From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Charles Muller Newsgroups: gmane.emacs.help Subject: Re: character encoding Date: Mon, 21 Oct 2002 16:36:24 +0900 (JST) Sender: help-gnu-emacs-admin@gnu.org Message-ID: <20021021.163624.59467597.acmuller@gol.com> References: <87smz13p05.fsf@localhost.localdomain> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Trace: main.gmane.org 1035185775 27704 80.91.224.249 (21 Oct 2002 07:36:15 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 21 Oct 2002 07:36:15 +0000 (UTC) Cc: help-gnu-emacs@gnu.org Return-path: Original-Received: from monty-python.gnu.org ([199.232.76.173]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 183X6j-0007Ch-00 for ; Mon, 21 Oct 2002 09:36:13 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10) id 183X3u-0001lb-00; Mon, 21 Oct 2002 03:33:18 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10) id 183X3c-0001d0-00 for help-gnu-emacs@gnu.org; Mon, 21 Oct 2002 03:33:00 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10) id 183X3Z-0001cI-00 for help-gnu-emacs@gnu.org; Mon, 21 Oct 2002 03:32:59 -0400 Original-Received: from smtp01.fields.gol.com ([203.216.5.131]) by monty-python.gnu.org with esmtp (Exim 4.10) id 183X3Z-0001bD-00 for help-gnu-emacs@gnu.org; Mon, 21 Oct 2002 03:32:57 -0400 Original-Received: from 203-216-51-222.dsl.gol.ne.jp ([203.216.51.222] helo=localhost) by smtp01.fields.gol.com with esmtp (Magnetic Fields) id 183X3P-00068n-00; Mon, 21 Oct 2002 16:32:47 +0900 Original-To: hlawson@triad.rr.com Original-Newsgroups: gnu.emacs.help In-Reply-To: <87smz13p05.fsf@localhost.localdomain> X-Mailer: Mew version 2.2 on Emacs 21.2 / Mule 5.0 (SAKAKI) X-Abuse-Complaints: abuse@gol.com Errors-To: help-gnu-emacs-admin@gnu.org X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.0.11 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.emacs.help:2786 X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:2786 Hugh wrote: > > Sometimes when I cut and past "it's" from a web page into an emacs > buffer it transfers as "it?s". Ditto for other similar events. Most likely these are "curly" apostrophes that are inserted when people publish HTML by first writing it in a word processor like Word or Word Perfect, which use curly apostrophes and quotation marks by default, rather than standard, straight apostrophes. I suspect that if you copy this text into any other plain text editor (not only Emacs) the same thing will happen. I am working on a couple of different web projects where people submit these kinds of word processor generated HTML documents, and it is a real pain in the neck to have to clean them up all the time. Chuck --------------------------- Charles Muller Faculty of Humanities, Toyo Gakuen University Digital Dictionary of Buddhism and CJKV-English Dictionary [http://www.acmuller.net] Mobile Phone: 090-9310-1787