From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Agustin Martin Newsgroups: gmane.emacs.devel Subject: Re: Bug 130397 (Was: Emacs - Ispell problem with i[no]german dictionary) Date: Wed, 19 May 2004 13:44:04 +0200 Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: <20040519114404.GA8916@agmartin.aq.upm.es> References: <20040517120658.GA6919@agmartin.aq.upm.es> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1084976141 17426 80.91.224.253 (19 May 2004 14:15:41 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 19 May 2004 14:15:41 +0000 (UTC) Cc: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Wed May 19 16:15:25 2004 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1BQRqv-0001HW-00 for ; Wed, 19 May 2004 16:15:25 +0200 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.35 #1 (Debian)) id 1BQRqv-000096-00 for ; Wed, 19 May 2004 16:15:25 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.34) id 1BQR8E-0001EI-3O for emacs-devel@quimby.gnus.org; Wed, 19 May 2004 09:29:14 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.34) id 1BQR7v-00018v-JS for emacs-devel@gnu.org; Wed, 19 May 2004 09:28:55 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.34) id 1BQPqW-0000Jn-9U for emacs-devel@gnu.org; Wed, 19 May 2004 08:07:23 -0400 Original-Received: from [138.100.4.49] (helo=edison.ccupm.upm.es) by monty-python.gnu.org with esmtp (Exim 4.34) id 1BQPWe-0006iR-No for emacs-devel@gnu.org; Wed, 19 May 2004 07:46:21 -0400 Original-Received: from corbu.aq.upm.es (corbu.aq.upm.es [138.100.41.3]) by edison.ccupm.upm.es (8.12.10/8.12.10) with ESMTP id i4JBiD4P025771; Wed, 19 May 2004 13:44:13 +0200 Original-Received: from mala.aq.upm.es (postfix@Agmartin.aq.upm.es [138.100.41.131]) by corbu.aq.upm.es (8.12.3/8.12.3/Debian-6.3) with ESMTP id i4JBSZrl015580; Wed, 19 May 2004 13:28:48 +0200 Original-Received: by mala.aq.upm.es (Postfix, from userid 1000) id BB962151B8; Wed, 19 May 2004 13:44:04 +0200 (CEST) Original-To: Kenichi Handa Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.6i X-MIME-Autoconverted: from 8bit to quoted-printable by edison.ccupm.upm.es id i4JBiD4P025771 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.4 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:23707 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:23707 On Tue, May 18, 2004 at 10:54:23AM -0400, Richard Stallman wrote: > If this is an issue of encodings, could you talk about it with > handa@etl.go.jp, and cc emavs-devel? Handa is the expert on > this part of Emacs. >=20 Hi,=20 As Richard suggested I am writing to you about this problem. I have been recently taking a look at some of the Debian spell related emacs21 bug reports and was trying to reproduce this one, as well as getting aditiona= l information. You can find the full thread at http://bugs.debian.org/130397 and this is what I saw when trying to reproduce the problem (same content= s as in my mail to Richard with a minor adition) Having french selected as ispell default I do: a) Start emacs with fr_FR@euro locale and manually type the (mispelled) french word d=E8plorable. Try ispell-word it. Bug reproduced, high bit= is not considered a word element. b) After previous step, I save the file containing that word and run emac= s on it again. ispell-word now works as expected and detects the complet= e mispelled word proposing the right fix. In both cases, emacs is called as $ LC_ALL=3Dfr_FR@euro emacs fr-test & c) If I now type again the mispelled word after the previous one, previou= s word is properly handled by ispell mode, but the bug is reproduced for the just typed one. Also, both '=E8' (previous and last one) clearly s= eem to have a different look when using LC_ALL=3Dfr_FR@euro. However, if I type it with a latin1 LC_ALL they look similar. d) If I save the file and re-edit, ispell-word now works well on both wor= ds. I have tested this with 'sid' Debian emacs21 (version 21.3+1-5) This seems to be a problem with the way emacs internally handles differen= t encodings. My guess is that emacs is handling differently the '=E8' chara= cter (In case of encoding problems in the mail, it is the grave accented=20 lowercase e) when typed in the fr_FR@euro locale than when file is read f= rom disk or typed in the fr_FR locale. This also sounds me to something I read somewhere about 'emacs utf-8 supp= ort is not yet complete' problem, but I cannot remember now where I found tha= t. I do not know if this is already fixed in CVS emacs, but is better to be sure that you are aware of this. Thanks, --=20 Agustin