From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: iso-8859-1 and non-latin-1 chars Date: Fri, 8 Nov 2002 08:42:00 +0900 (JST) Sender: emacs-devel-admin@gnu.org Message-ID: <200211072342.IAA10345@etlken.m17n.org> References: <200211071457.gA7EvoL08222@rum.cs.yale.edu> <2110-Thu07Nov2002182535+0200-eliz@is.elta.co.il> <200211071706.gA7H6hW09141@rum.cs.yale.edu> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=US-ASCII X-Trace: main.gmane.org 1036713471 8151 80.91.224.249 (7 Nov 2002 23:57:51 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Thu, 7 Nov 2002 23:57:51 +0000 (UTC) Cc: eliz@is.elta.co.il, monnier+gnu/emacs@rum.cs.yale.edu, emacs-devel@gnu.org Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 189wX0-00027L-00 for ; Fri, 08 Nov 2002 00:57:50 +0100 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 189wgD-00067y-00 for ; Fri, 08 Nov 2002 01:07:21 +0100 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10) id 189wWW-0002hx-00; Thu, 07 Nov 2002 18:57:20 -0500 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10) id 189wHs-0001se-00 for emacs-devel@gnu.org; Thu, 07 Nov 2002 18:42:12 -0500 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10) id 189wHo-0001pf-00 for emacs-devel@gnu.org; Thu, 07 Nov 2002 18:42:10 -0500 Original-Received: from tsukuba.m17n.org ([192.47.44.130]) by monty-python.gnu.org with esmtp (Exim 4.10) id 189wHm-0001lz-00 for emacs-devel@gnu.org; Thu, 07 Nov 2002 18:42:06 -0500 Original-Received: from fs.m17n.org (fs.m17n.org [192.47.44.2]) by tsukuba.m17n.org (8.11.6/3.7W-20010518204228) with ESMTP id gA7Ng1k03291; Fri, 8 Nov 2002 08:42:01 +0900 (JST) (envelope-from handa@m17n.org) Original-Received: from etlken.m17n.org (etlken.m17n.org [192.47.44.125]) by fs.m17n.org (8.11.3/3.7W-20010823150639) with ESMTP id gA7Ng0R15663; Fri, 8 Nov 2002 08:42:00 +0900 (JST) Original-Received: (from handa@localhost) by etlken.m17n.org (8.8.8+Sun/3.7W-2001040620) id IAA10345; Fri, 8 Nov 2002 08:42:00 +0900 (JST) Original-To: monnier+gnu/emacs@rum.cs.yale.edu In-reply-to: <200211071706.gA7H6hW09141@rum.cs.yale.edu> (monnier+gnu/emacs@rum.cs.yale.edu) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/21.2.92 (sparc-sun-solaris2.6) MULE/5.0 (SAKAKI) Errors-To: emacs-devel-admin@gnu.org X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.0.11 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: Emacs development discussions. List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.emacs.devel:9246 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:9246 In article <200211071706.gA7H6hW09141@rum.cs.yale.edu>, "Stefan Monnier" writes: >> > When encoding text containing non-latin-1 chars with the latin-1 >> > coding-system, they get output as some kind of escape sequence. >> >> Yes. IIRC, this is hard-coded in the encoder's C code: it works as if >> latin-1 was actually iso-latin-1-wth-esc. > How can we change that ? This change will do. *** european.el.~1.75.~ Wed Nov 6 09:13:16 2002 --- european.el Fri Nov 8 08:32:12 2002 *************** *** 36,42 **** 'iso-latin-1 2 ?1 "ISO 2022 based 8-bit encoding for Latin-1 (MIME:ISO-8859-1)." '(ascii latin-iso8859-1 nil nil ! nil nil nil nil nil nil nil nil nil nil nil nil t) '((safe-charsets ascii latin-iso8859-1) (mime-charset . iso-8859-1))) --- 36,42 ---- 'iso-latin-1 2 ?1 "ISO 2022 based 8-bit encoding for Latin-1 (MIME:ISO-8859-1)." '(ascii latin-iso8859-1 nil nil ! nil nil nil nil nil nil nil nil nil nil nil t t) '((safe-charsets ascii latin-iso8859-1) (mime-charset . iso-8859-1))) Or, if this is a problem only for ispell, we can make series of "safe" coding-systems for ispell. Or, we can add a global flag, say `inhibit-unsafe-iso-escape, to tell encoding routine not to produces those escape sequences. Then, ispell can let-bind that variable to t on encoding. I think the last one is the best solution. What do you think? --- Ken'ichi HANDA handa@m17n.org