From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: UCS-2BE Date: Fri, 01 Sep 2006 20:28:02 +0900 Message-ID: References: <878xl5x4lr.fsf@jurta.org> <44F6A74A.9040708@gnu.org> <44F6BC5B.8010504@gnu.org> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=US-ASCII X-Trace: sea.gmane.org 1157110184 11042 80.91.229.2 (1 Sep 2006 11:29:44 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Fri, 1 Sep 2006 11:29:44 +0000 (UTC) Cc: juri@jurta.org, emacs-devel@gnu.org, jasonr@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Sep 01 13:29:39 2006 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1GJ7DL-00067u-LP for ged-emacs-devel@m.gmane.org; Fri, 01 Sep 2006 13:29:35 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GJ7DL-0005ut-5Q for ged-emacs-devel@m.gmane.org; Fri, 01 Sep 2006 07:29:35 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1GJ7D8-0005ue-1Y for emacs-devel@gnu.org; Fri, 01 Sep 2006 07:29:22 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1GJ7D6-0005tV-Gr for emacs-devel@gnu.org; Fri, 01 Sep 2006 07:29:21 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GJ7D6-0005tA-BG for emacs-devel@gnu.org; Fri, 01 Sep 2006 07:29:20 -0400 Original-Received: from [150.29.246.133] (helo=mx1.aist.go.jp) by monty-python.gnu.org with esmtp (Exim 4.52) id 1GJ7Mt-0000vH-2F; Fri, 01 Sep 2006 07:39:27 -0400 Original-Received: from smtp4.aist.go.jp ([150.29.246.12]) by mx1.aist.go.jp with ESMTP id k81BTCqx007809; Fri, 1 Sep 2006 20:29:12 +0900 (JST) env-from (handa@m17n.org) Original-Received: by smtp4.aist.go.jp with ESMTP id k81BT7uO013674; Fri, 1 Sep 2006 20:29:09 +0900 (JST) env-from (handa@m17n.org) Original-Received: from handa by etlken with local (Exim 3.36 #1 (Debian)) id 1GJ7Bq-0006dO-00; Fri, 01 Sep 2006 20:28:02 +0900 Original-To: Andreas Schwab In-reply-to: (message from Andreas Schwab on Fri, 01 Sep 2006 11:01:14 +0200) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/22.0.50 (i686-pc-linux-gnu) MULE/5.0 (SAKAKI) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:59216 Archived-At: In article , Andreas Schwab writes: > The above quote is talking about "coded in N octets". If that's not about > serialisation, what else is it? To my understanding, it means 8*N bits here, and the wording "UCS-4 (Universal Character Set coded in 4 octets)" is just for explaining from where the the literal "UCS-4" comes. See this description in C.2. "As a consequence, UCS-4 can now be taken effectively as an alias for the Unicode encoding form UTF-32, ..." So, apparently UCS-4 is CEF here. By the way, Unicode itself is confusing in names. For instance, UTF-32 means both "UTF-32 encoding form (CEF)" and "UTF-32 encoding scheme (CES)". Unicode 4.1 says: "For historical reasons, the Unicode encoding schemes are also referred to as Unicode (or UCS) transformation formats (UTF). That term is, however, ambiguous between its usage for encoding forms and encoding schemes." --- Kenichi Handa handa@m17n.org