From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Lionel Elie Mamane Newsgroups: gmane.emacs.devel Subject: Re: Bug 130397 (Was: Emacs - Ispell problem with i[no]german dictionary) Date: Mon, 10 Jan 2005 14:06:41 +0100 Message-ID: <20050110130641.GB13663@tofu.mamane.lu> References: <20040517120658.GA6919@agmartin.aq.upm.es> <20041217121515.GA2270@agmartin.aq.upm.es> <200412221237.VAA07262@etlken.m17n.org> <20041222171306.GA4462@agmartin.aq.upm.es> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0497421426==" X-Trace: sea.gmane.org 1105363674 19081 80.91.229.6 (10 Jan 2005 13:27:54 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Mon, 10 Jan 2005 13:27:54 +0000 (UTC) Cc: 130397@bugs.debian.org, emacs-devel@gnu.org, Kenichi Handa Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Jan 10 14:27:46 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1CnzKn-0000fU-00 for ; Mon, 10 Jan 2005 14:11:49 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1CnzWI-0005hJ-Tr for ged-emacs-devel@m.gmane.org; Mon, 10 Jan 2005 08:23:42 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1CnzUW-0004tE-6q for emacs-devel@gnu.org; Mon, 10 Jan 2005 08:21:52 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1CnzUM-0004ng-5r for emacs-devel@gnu.org; Mon, 10 Jan 2005 08:21:44 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1CnzUK-0004jN-6w for emacs-devel@gnu.org; Mon, 10 Jan 2005 08:21:40 -0500 Original-Received: from [213.84.114.29] (helo=tofu.mamane.lu) by monty-python.gnu.org with esmtp (TLSv1:RC4-SHA:128) (Exim 4.34) id 1CnzFw-0006yQ-9x for emacs-devel@gnu.org; Mon, 10 Jan 2005 08:06:48 -0500 Original-Received: from master by tofu.mamane.lu with local (Exim 4.34) id 1CnzFp-0003wv-BU; Mon, 10 Jan 2005 14:06:41 +0100 Original-To: Agustin Martin In-Reply-To: <20041222171306.GA4462@agmartin.aq.upm.es> X-Operating-System: GNU/Linux X-Request-PGP: http://www.mamane.lu/openpgp/dh4096.asc User-Agent: Mutt/1.5.6+20040907i X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:32092 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:32092 --===============0497421426== Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="d6Gm4EdcadzBjdND" Content-Disposition: inline --d6Gm4EdcadzBjdND Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Wed, Dec 22, 2004 at 06:13:06PM +0100, Agustin Martin wrote: > On Wed, Dec 22, 2004 at 09:37:32PM +0900, Kenichi Handa wrote: > Thanks for the tip. I am not maintaining emacs, but a package for > the common dictionaries setup (dictionaries-common) that provides a > recent and patched ispell.el for all the diferent emacsen flavours > ({x}emacs) to integrate the different dicts and spellchecking > engines in some way. I will be happy to test this once is included > in sid emacs. >>> I am playing with redefining ispell-get-coding-system function in >>> ispell.el so dict coding-system is changed to iso-8859-15 if was >>> originally iso-8859-1 and emacs has iso-8859-15 as >>> buffer-file-coding-system, something like >> But, anyway, I think the above function is too ad-hoc. As >> iso-8859-1 and iso-8859-15 contains different set of characters >> (even if they are few), it's not good to treat them as the same >> thing. > I was aware of this, but anyway thanks for reminding. Code is > probably too ad-hoc, but latin{0,1} thing is also a somewhat ad-hoc > scenario, where latin0 should have really be named as something like > iso-8859-1v2, that is, a revision. I cannot imagine somebody using a > iso-8859-2 dict and trying to write in a iso8859-1 buffer, but with > iso-8859-1 and iso-8859-15 that is happening too frequently. > So we have a lot of people that blindly select the locale @euro > variant without realizing its implications, and that iso-8859-1 and > iso-8859-15 are different, but very close encodings (from a > practical point of view, they are fully equivalent for most > languages but IIRC french and finnish). > The current state of ispell dicts in Debian is that ifrench is iso-8859-15 > as default (although has a real latin1 entry). > So the only language that might currently require extra work is > french, and for it I find reasonable to use for emacs as default the > iso-8859-15 entry (tagged as iso-8859-1 for the above sustem to > work). For this I would like to hear Lionel's point of view, since > he has put a lot of effort to make iso-8859-15 available for > spellchecking (Hi, Lionel). I think that if we do that, then latin1 text won't be spell-checked correctly: Ispell will try to insert "one half" and "one quarter" characters (the characters occupying the same place as OE and oe in latin9), won't it? > I personally do not like having separate iso-8859-15 entries unless > they are really required. For the above dicts, that would be for > french, and I am not at all sure that it is really required. Having separate entries that the user has to select manually is bad, but it is the best we can have with the current system if we want to keep correctness of the spell-checking, as far as I understand. Having the system (the combination of emacs + dicts-common + the dicts) select the right dictionary + options combination automatically based on the (language, encoding) pair (like "-d francais -T ~latin1" for french and latin1) would be cool from the user's POV. We have special entries for "(La)TeX", which can be seen as another encoding, so why not special entries for iso8859-15 (when necessary)? What is so fundamentally different about iso8859-15? --=20 Lionel --d6Gm4EdcadzBjdND Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.4 (GNU/Linux) iD8DBQFB4n3gscRzFz57S3MRAir3AJ9b/mHTVmldiyzJIMLfz04SFq4+bQCgqEm0 qXQpjPBXewrhQ7/mJwK/WaI= =n9lO -----END PGP SIGNATURE----- --d6Gm4EdcadzBjdND-- --===============0497421426== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Emacs-devel mailing list Emacs-devel@gnu.org http://lists.gnu.org/mailman/listinfo/emacs-devel --===============0497421426==--