From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: dkcombs@panix.com (David Combs) Newsgroups: gmane.emacs.help Subject: how to scan file for non-ascii chars (eg cut-n-paste from ms-word) Date: 8 Jan 2011 19:53:01 -0500 Organization: PANIX -- Public Access Networks Corp. Message-ID: NNTP-Posting-Host: lo.gmane.org X-Trace: dough.gmane.org 1294580300 21461 80.91.229.12 (9 Jan 2011 13:38:20 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Sun, 9 Jan 2011 13:38:20 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Sun Jan 09 14:38:13 2011 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1PbvTG-0005qM-9C for geh-help-gnu-emacs@m.gmane.org; Sun, 09 Jan 2011 14:38:10 +0100 Original-Received: from localhost ([127.0.0.1]:53924 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PbvTF-0000k5-F5 for geh-help-gnu-emacs@m.gmane.org; Sun, 09 Jan 2011 08:38:09 -0500 Original-Path: usenet.stanford.edu!panix!panix3.panix.com!not-for-mail Original-Newsgroups: gnu.emacs.help Original-Lines: 25 Original-NNTP-Posting-Host: panix3.panix.com Original-X-Trace: reader1.panix.com 1294534381 27226 166.84.1.3 (9 Jan 2011 00:53:01 GMT) Original-X-Complaints-To: abuse@panix.com Original-NNTP-Posting-Date: Sun, 9 Jan 2011 00:53:01 +0000 (UTC) Original-Xref: usenet.stanford.edu gnu.emacs.help:184130 X-Mailman-Approved-At: Sun, 09 Jan 2011 08:35:00 -0500 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:78324 Archived-At: When I 'cut-n-paste' from eg ms-word-produced document, into an emacs buffer (ie ascii), you get all kinds of "non-ascii" chars, eg left and right double-quotes, like these: Char: . (8221, #o20035, #x201d) point=250 of 4096 (6%) column=7 Char: . (8220, #o20034, #x201c) point=218 of 4096 (5%) column=42 accents, and so on. When I go to save the buffer, emacs will ask if I want to save it in eg japanese format. Not exactly what I want. What I'd like to do is change those "strange" characters to their plain-ascii "equivalent", so to speak. Like '"' for double quote (left OR right), etc. Surely I'm not the only one to experience this difficulty: what work-arounds have YOU found? Thanks! David