From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: Unicode character read representation Date: Tue, 24 Feb 2009 20:14:14 +0900 Message-ID: References: <87hc2n28a4.fsf@cyd.mit.edu> NNTP-Posting-Host: lo.gmane.org X-Trace: ger.gmane.org 1235474047 28875 80.91.229.12 (24 Feb 2009 11:14:07 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 24 Feb 2009 11:14:07 +0000 (UTC) Cc: emacs-devel@gnu.org To: Chong Yidong Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Tue Feb 24 12:15:23 2009 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1LbvFy-0002wp-GH for ged-emacs-devel@m.gmane.org; Tue, 24 Feb 2009 12:15:22 +0100 Original-Received: from localhost ([127.0.0.1]:39143 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1LbvEc-0004La-Us for ged-emacs-devel@m.gmane.org; Tue, 24 Feb 2009 06:13:58 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1LbvEX-0004Kw-Gz for emacs-devel@gnu.org; Tue, 24 Feb 2009 06:13:53 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1LbvEW-0004Kd-HK for emacs-devel@gnu.org; Tue, 24 Feb 2009 06:13:52 -0500 Original-Received: from [199.232.76.173] (port=46143 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1LbvEW-0004Ka-Dm for emacs-devel@gnu.org; Tue, 24 Feb 2009 06:13:52 -0500 Original-Received: from mx1.aist.go.jp ([150.29.246.133]:59497) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1LbvEV-0001UB-RU for emacs-devel@gnu.org; Tue, 24 Feb 2009 06:13:52 -0500 Original-Received: from rqsmtp1.aist.go.jp (rqsmtp1.aist.go.jp [150.29.254.115]) by mx1.aist.go.jp with ESMTP id n1OBDlJf016838; Tue, 24 Feb 2009 20:13:47 +0900 (JST) env-from (handa@m17n.org) Original-Received: from smtp1.aist.go.jp by rqsmtp1.aist.go.jp with ESMTP id n1OBDla6000037; Tue, 24 Feb 2009 20:13:47 +0900 (JST) env-from (handa@m17n.org) Original-Received: by smtp1.aist.go.jp with ESMTP id n1OBDk0p008924; Tue, 24 Feb 2009 20:13:46 +0900 (JST) env-from (handa@m17n.org) Original-Received: from handa by etlken with local (Exim 4.69) (envelope-from ) id 1LbvEs-0004v7-F1; Tue, 24 Feb 2009 20:14:14 +0900 In-reply-to: <87hc2n28a4.fsf@cyd.mit.edu> (message from Chong Yidong on Sat, 21 Feb 2009 09:07:15 -0500) X-detected-operating-system: by monty-python.gnu.org: Solaris 9 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:109307 Archived-At: In article <87hc2n28a4.fsf@cyd.mit.edu>, Chong Yidong writes: > From objects.texi in the Lisp manual: > `\U00NNNNNN' represents the character whose Unicode code point is > `U+NNNNNN', if such a character is supported by Emacs. If the > corresponding character is not supported, Emacs signals an error. > Are there any Unicode code points not supported by Emacs, No. > or is this sentence obsolete? Not completely obsolete, but should be modified somehow. At first, #x0..#x3FFFFF are all valid Emacs character codes. Some of U+NNNNNN are valid Unicode code points for "noncharacter" (e.g. U+FFFE, U+FFFF), some are invalid Unicode code points (U+120000..U+3FFFFF), some are invalid both as Unicode code points and Emacs character codes (U+400000 and over). Currently Emacs signals an error only for U+400000 and over, and I'm not sure how strictly we should interprete \U.. notation. --- Kenichi Handa handa@m17n.org