From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Agustin Martin Newsgroups: gmane.emacs.devel Subject: Re: Bug 130397 Date: Fri, 7 Jan 2005 16:36:18 +0100 Message-ID: <20050107153618.GA1508@agmartin.aq.upm.es> References: <20040517120658.GA6919@agmartin.aq.upm.es> <20041217121515.GA2270@agmartin.aq.upm.es> <200412221237.VAA07262@etlken.m17n.org> <20041222171306.GA4462@agmartin.aq.upm.es> <200501041250.VAA10883@etlken.m17n.org> <200501050200.LAA12589@etlken.m17n.org> <873bxgjxrp.fsf-monnier+emacs@gnu.org> <200501050550.OAA13142@etlken.m17n.org> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1105113263 17158 80.91.229.6 (7 Jan 2005 15:54:23 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Fri, 7 Jan 2005 15:54:23 +0000 (UTC) Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Jan 07 16:54:03 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1CmwR8-0005e0-00 for ; Fri, 07 Jan 2005 16:54:02 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1CmwcT-0000dW-Tk for ged-emacs-devel@m.gmane.org; Fri, 07 Jan 2005 11:05:45 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1Cmwc9-0000Zu-Ka for emacs-devel@gnu.org; Fri, 07 Jan 2005 11:05:25 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1Cmwc6-0000Xk-Eg for emacs-devel@gnu.org; Fri, 07 Jan 2005 11:05:24 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Cmwc6-0000XG-3L for emacs-devel@gnu.org; Fri, 07 Jan 2005 11:05:22 -0500 Original-Received: from [199.232.41.8] (helo=mx20.gnu.org) by monty-python.gnu.org with esmtp (TLSv1:DES-CBC3-SHA:168) (Exim 4.34) id 1CmwQF-0000WI-V0 for emacs-devel@gnu.org; Fri, 07 Jan 2005 10:53:08 -0500 Original-Received: from [138.100.4.49] (helo=edison.ccupm.upm.es) by mx20.gnu.org with esmtp (Exim 4.34) id 1CmwCb-0002b2-RC for emacs-devel@gnu.org; Fri, 07 Jan 2005 10:39:02 -0500 Original-Received: from mala.aq.upm.es (Agmartin.aq.upm.es [138.100.41.131]) by edison.ccupm.upm.es (8.12.10/8.12.10) with ESMTP id j07FauLN009799; Fri, 7 Jan 2005 16:36:56 +0100 Original-Received: by mala.aq.upm.es (Postfix, from userid 1000) id 93BE6108FD; Fri, 7 Jan 2005 16:36:18 +0100 (CET) Original-To: Kenichi Handa , Stefan Monnier , emacs-devel@gnu.org, k.stevens@ieee.org, 130397@bugs.debian.org Content-Disposition: inline In-Reply-To: <200501050550.OAA13142@etlken.m17n.org> User-Agent: Mutt/1.5.6+20040907i X-MIME-Autoconverted: from 8bit to quoted-printable by edison.ccupm.upm.es id j07FauLN009799 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:32002 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:32002 On Wed, Jan 05, 2005 at 02:50:09PM +0900, Kenichi Handa wrote: > Stefan Monnier writes: > > But ispell.el should be able to automatically check whether the chars= can be > > safely encoded with the coding-system and if not (as in your example)= , > > ispell.el will know that the word can't be checked by ispell and shou= ld > > just be skipped (and maybe marked as "uncheckable"). >=20 > That seems to be a good approach. But, just checking > whether the chars is encodable with the coding-system is not > enough. For instance, entry for "francais" dict doesn't > contain "=F1" in CASECHARS, but "espa=F1ol" is safely encodable > by iso-8859-1. So, the same error happens. For ispell.el > to know that "espa=F1ol" is uncheckable, we anyway need the > current database ispell-dictionary-alist. Expect otherwise something like ispell and its process have different character maps during ispell-word, as well as some other possible errors. This for singl= e byte chars. When there is a char that cannot be encoded in the dict encod= ing the 'ispell misalignment' errors appears.=20 *Ken*, since you are being cc'ed I vaguely remembered some info I somewhe= re read about this misalignements. I finally found it, http://lists.gnu.org/archive/html/emacs-devel/2002-09/msg01007.html Essentially seems to be suggested that ispell-word (as well as flyspell) does not show the misalignment problems because of the way words are pass= ed to ispell, while ispell-region (and so ispell-buffer) does. I have tested that in an ad-hoc file, ispell-buffer gives the misalignement error while flyspell-buffer not. The suggestion is that making ispell-region it= erate over words intead of over lines this could be fixed. Do you think this wo= uld help to get rid of the misalignements, or there are other drawbacks I am = not aware of? I did not see any reply to that mail. >=20 > By the way, isn't it possible to make that database > automatically from *.aff? >=20 Remember that there is also aspell, so should use .aff when using ispell = and some other way when using aspell. The way we do this is trust dict maintainers to provide a file with all t= he relevant info updated to the dict current values. ispell-dictionary-alist= is rebuilt after that data, that is parsed at dictionary installation. This = way we try to make sure that all values really match, and also that errors ca= n be fixed more quickly by the dict maintainer, without needing a centralized maintainer to keep that alist up to date, and that things are done after = the really installed dicts. By the way, in emacs CVS esperanto entry claims to use iso-8859-1 encodin= g, while it should be iso-8859-3, and that being added to the possible coding-system values. Regarding this, we added a patch by Joao Cachopo to allow for coding-syst= em any coding system supported by emacs (http://bugs.debian.org/208518),=20 using =20 (coding-system :tag "Coding System") instead of (choice :tag "Coding system" (const iso-8859-1) (const iso-8859-2) (const koi8-r)) in both ispell-local-dictionary-alist and ispell-dictionary-alist defcustoms. Cheers, --=20 Agustin