From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: user sees \xxx but is thwarted from searching for them Date: Wed, 17 Apr 2002 13:18:06 -0400 Sender: emacs-devel-admin@gnu.org Message-ID: References: <563-Tue16Apr2002170838+0300-eliz@is.elta.co.il> <200204171604.g3HG4ob24867@aztec.santafe.edu> Reply-To: Eli Zaretskii NNTP-Posting-Host: localhost.gmane.org X-Trace: main.gmane.org 1019064070 15231 127.0.0.1 (17 Apr 2002 17:21:10 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 17 Apr 2002 17:21:10 +0000 (UTC) Cc: Heinrich.Rommerskirchen@icn.siemen.de, emacs-devel@gnu.org Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.33 #1 (Debian)) id 16xt7G-0003xY-00 for ; Wed, 17 Apr 2002 19:21:10 +0200 Original-Received: from fencepost.gnu.org ([199.232.76.164]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 16xtQ3-0007tZ-00 for ; Wed, 17 Apr 2002 19:40:35 +0200 Original-Received: from localhost ([127.0.0.1] helo=fencepost.gnu.org) by fencepost.gnu.org with esmtp (Exim 3.34 #1 (Debian)) id 16xt7E-0005VI-00; Wed, 17 Apr 2002 13:21:08 -0400 Original-Received: from eliz by fencepost.gnu.org with local (Exim 3.34 #1 (Debian)) id 16xt4I-0005FV-00; Wed, 17 Apr 2002 13:18:06 -0400 Original-To: rms@gnu.org In-Reply-To: <200204171604.g3HG4ob24867@aztec.santafe.edu> (message from Richard Stallman on Wed, 17 Apr 2002 10:04:50 -0600 (MDT)) Errors-To: emacs-devel-admin@gnu.org X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.0.9 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: Emacs development discussions. List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.emacs.devel:2711 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:2711 > From: Richard Stallman > Date: Wed, 17 Apr 2002 10:04:50 -0600 (MDT) > > I was in such a situation yesterday. Normally I use latin-1 encoding but > switched to language environment latin-9 to edit some files containing Euro > signs and forgot about this change. Then I loaded a 400 line DOS file > containing "-*- coding: cp850 -*-" in the first line. Emacs encoded all the > umlauts already in the file as latin-1 but encoded the typed umlauts as > latin-9. And after minor changes all over the file and a few interruptions > I didn't remember which parts were changed and which were old ... > > Does anyone have an idea for what we should do about this? Help Handa-san make the switch to Unicode ;-) The problem is that the target charset of cp850 is Latin-1, not Latin-9. OTOH, in a Latin-9 language environment, non-ASCII characters typed by the user are by default converted to Latin-9 characters. > Does the change to turn on unify-on-encoding fix this automatically? Yes, as long as the user doesn't type characters that are unique to Latin-1 and to Latin-9 (like if they use both the currency symbol and the Euro symbol in the same buffer). That is, assuming that the result, probably UTF-8, is not what the users expect. > Will the switch to native Unicode fix it? Yes.