From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Simon Josefsson Newsgroups: gmane.emacs.devel Subject: Re: More Cyrillic vs UTF-8 Date: Tue, 29 Apr 2003 01:08:53 +0200 Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: References: <84he8lovtc.fsf@lucy.is.informatik.uni-duisburg.de> <841xzphrr4.fsf@lucy.is.informatik.uni-duisburg.de> <8465p0l4jp.fsf@lucy.is.informatik.uni-duisburg.de> <200304281235.VAA11025@etlken.m17n.org> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1051572090 15730 80.91.224.249 (28 Apr 2003 23:21:30 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 28 Apr 2003 23:21:30 +0000 (UTC) Cc: kai.grossjohann@gmx.net Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Tue Apr 29 01:21:28 2003 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 19AHvP-000422-00 for ; Tue, 29 Apr 2003 01:20:43 +0200 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 19AI3n-0001Zm-00 for ; Tue, 29 Apr 2003 01:29:23 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 19AHuR-0004gC-01 for emacs-devel@quimby.gnus.org; Mon, 28 Apr 2003 19:19:43 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10.13) id 19AHoZ-0003Hb-00 for emacs-devel@gnu.org; Mon, 28 Apr 2003 19:13:39 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10.13) id 19AHkC-0001uC-00 for emacs-devel@gnu.org; Mon, 28 Apr 2003 19:09:09 -0400 Original-Received: from 178.230.13.217.in-addr.dgcsystems.net ([217.13.230.178] helo=yxa.extundo.com) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 19AHk2-0001qx-00 for emacs-devel@gnu.org; Mon, 28 Apr 2003 19:08:58 -0400 Original-Received: from latte.josefsson.org (yxa.extundo.com [217.13.230.178]) (authenticated bits=0) by yxa.extundo.com (8.12.9/8.12.9) with ESMTP id h3SN8rbU004465 (version=TLSv1/SSLv3 cipher=EDH-RSA-DES-CBC3-SHA bits=168 verify=OK); Tue, 29 Apr 2003 01:08:54 +0200 Original-To: Kenichi Handa Mail-Copies-To: nobody X-Payment: hashcash 1.2 0:030428:handa@m17n.org:86db46b36845b850 X-Hashcash: 0:030428:handa@m17n.org:86db46b36845b850 X-Payment: hashcash 1.2 0:030428:kai.grossjohann@gmx.net:7ce827aff313179b X-Hashcash: 0:030428:kai.grossjohann@gmx.net:7ce827aff313179b X-Payment: hashcash 1.2 0:030428:emacs-devel@gnu.org:e1cb61354db670c5 X-Hashcash: 0:030428:emacs-devel@gnu.org:e1cb61354db670c5 In-Reply-To: <200304281235.VAA11025@etlken.m17n.org> (Kenichi Handa's message of "Mon, 28 Apr 2003 21:35:48 +0900 (JST)") User-Agent: Gnus/5.09002 (Oort Gnus v0.20) XEmacs/21.4 (Portable Code, linux) Original-cc: emacs-devel@gnu.org X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1b5 Precedence: list List-Id: Emacs development discussions. List-Help: List-Post: List-Subscribe: , List-Archive: List-Unsubscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:13523 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:13523 Kenichi Handa writes: > Richard Stallman writes: >> +* Encoding some characters as Unicode is rejected by Emacs. >> + >> +Emacs currently only supports the parts of the BMP whose codepoints >> +are in the ranges 0000-33ff and e000-ffff. If you try to save a file >> +containing characters with code points outside this range, Emacs will >> +suggest other compatible coding systems. > >> That is clearer; it's written in terms of behavior the user sees. >> I agree with the people who said that the codepoint numbers may not >> be clear enough. > > Perhaps, it is better to mention utf-translate-cjk mode as this. > > * Encoding some characters as Unicode (UTF-8) is rejected by Emacs. > > Emacs currently, by default, only supports the parts of the > BMP whose codepoints are in the ranges 0000-33ff and > e000-ffff. This excludes CJK, Yi, Music, and Maths. > > If you try to save a file containing characters with code > points outside this range, Emacs will suggest other > compatible coding systems. > > By turing Utf-Translate-Cjk mode on, many more CJK > characters are included in the support. This looks good. As for utf-translate-cjk, it does sounds like that functionality should be enabled by default. Is the only problem that loading them is slow? Perhaps it can be loaded lazily?