From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: "Stephen J. Turnbull" Newsgroups: gmane.emacs.devel Subject: Re: cannot encode a buffer?? Date: Wed, 25 Jun 2003 20:22:22 +0900 Organization: The XEmacs Project Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: <8765mul7v5.fsf@tleepslib.sk.tsukuba.ac.jp> References: <200306250752.QAA01036@etlken.m17n.org> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1056541120 1546 80.91.224.249 (25 Jun 2003 11:38:40 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 25 Jun 2003 11:38:40 +0000 (UTC) Cc: sds@gnu.org Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Wed Jun 25 13:38:36 2003 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 19V8bk-0000Oh-00 for ; Wed, 25 Jun 2003 13:38:36 +0200 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 19V8fk-0004TF-00 for ; Wed, 25 Jun 2003 13:42:45 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.20) id 19V8bb-00054D-8b for emacs-devel@quimby.gnus.org; Wed, 25 Jun 2003 07:38:27 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.20) id 19V8ai-0004ob-Cv for emacs-devel@gnu.org; Wed, 25 Jun 2003 07:37:32 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.20) id 19V8Oh-0000vU-4F for emacs-devel@gnu.org; Wed, 25 Jun 2003 07:25:08 -0400 Original-Received: from tleepslib.sk.tsukuba.ac.jp ([130.158.98.109]) by monty-python.gnu.org with esmtp (Exim 4.20) id 19V8MG-0008Ob-Mk; Wed, 25 Jun 2003 07:22:36 -0400 Original-Received: from steve by tleepslib.sk.tsukuba.ac.jp with local (Exim 3.36 #1 (Debian)) id 19V8M2-0001zH-00; Wed, 25 Jun 2003 20:22:22 +0900 Original-To: Kenichi Handa In-Reply-To: <200306250752.QAA01036@etlken.m17n.org> (Kenichi Handa's message of "Wed, 25 Jun 2003 16:52:34 +0900 (JST)") User-Agent: Gnus/5.1001 (Gnus v5.10.1) XEmacs/21.4 (Portable Code, linux) Original-cc: emacs-devel@gnu.org X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1b5 Precedence: list List-Id: Emacs development discussions. List-Help: List-Post: List-Subscribe: , List-Archive: List-Unsubscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:15234 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:15234 >>>>> "Kenichi" == Kenichi Handa writes: Kenichi> If you force saving them by utf-8, the encoder generates Kenichi> a utf-8 byte sequence corresponding to U+FFFD for each of Kenichi> them. Isn't that a violation of the Unicode standard? I agree that the preferences of those who would rather that Emacs keep the different flavors of Han different should be respected. FWIW, I'd default `utf-translate-cjk-mode' to on (to encourage development of a Unicode-based way to disambiguate Unihan), but that does risk a lot of annoyance for Asian polyglots. However, if somebody insists on saving as UTF-8, the result should be unification of Japanese to Chinese (which after all can be read, if you can read both languages), not destruction of text. Ie, it should not be possible for Emacs to convert any JIS X 0208 character to U+FFFD, ever. -- Institute of Policy and Planning Sciences http://turnbull.sk.tsukuba.ac.jp University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Ask not how you can "do" free software business; ask what your business can "do for" free software.