From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: gebser@speakeasy.net Newsgroups: gmane.emacs.help Subject: Re: How to convert .doc to plain text ascii in emacs. Date: Sun, 2 May 2004 10:44:17 -0400 (EDT) Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Message-ID: References: <87u0z0vuln.fsf@spacecentre.se> Reply-To: gebser@speakeasy.net NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Trace: sea.gmane.org 1083509412 1418 80.91.224.253 (2 May 2004 14:50:12 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sun, 2 May 2004 14:50:12 +0000 (UTC) Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Sun May 02 16:50:02 2004 Return-path: Original-Received: from monty-python.gnu.org ([199.232.76.173]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1BKII6-00074p-00 for ; Sun, 02 May 2004 16:50:02 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.30) id 1BKIHg-0002ZJ-GG for geh-help-gnu-emacs@m.gmane.org; Sun, 02 May 2004 10:49:36 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.30) id 1BKIHH-0002XU-Sj for help-gnu-emacs@gnu.org; Sun, 02 May 2004 10:49:11 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.30) id 1BKICs-0001aE-4M for help-gnu-emacs@gnu.org; Sun, 02 May 2004 10:45:10 -0400 Original-Received: from [216.254.0.202] (helo=mail2.speakeasy.net) by monty-python.gnu.org with esmtp (TLSv1:DES-CBC3-SHA:168) (Exim 4.30) id 1BKICr-0001ZN-Ir for help-gnu-emacs@gnu.org; Sun, 02 May 2004 10:44:37 -0400 Original-Received: (qmail 24381 invoked from network); 2 May 2004 14:44:32 -0000 Original-Received: from dsl093-011-017.cle1.dsl.speakeasy.net (HELO heidegger.mousecar.net) ([66.93.11.17]) (envelope-sender ) by mail2.speakeasy.net (qmail-ldap-1.03) with AES256-SHA encrypted SMTP for ; 2 May 2004 14:44:32 -0000 Original-Newsgroups: gnu.emacs.help Original-To: GNU Emacs List In-Reply-To: <87u0z0vuln.fsf@spacecentre.se> X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.4 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.help:18201 X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:18201 Thanks very much. Your elisp works great. There's one glitch (which I realize is from antiword): The three characters "\342\200\231" should be replaced by the single apostrophe character ('). To do this by hand, I did M-x replace-regexp Return C-q 342 Return C-q 200 Return C-q 231 Return Return ' Return but this does not find the intended string. The problem seems to be that C-q 342 is immediately (in the minibuffer) converted into an 'a' with a grave symbol over it. Putting the point on the backslash (\) preceding the 342 in the antiword-converted buffer and doing "C-u C-x =" indeed shows this a-with-grave character to be (0342, 226, 0xe2). To create a simple test case, do the following: Open an empty *scratch* buffer. Enter into it: C-q 342 Return C-q 200 Return C-q 231 Return. The first character that appears is the a-with-grave; the second and third characters appear properly as \200\231. It is, I think, the failure of C-q 342 to be represented as \342 which is the problem. What is the solution? tia, ken [....]