From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: Emacs 23 character code space Date: Wed, 26 Nov 2008 10:51:51 +0900 Message-ID: References: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=US-ASCII X-Trace: ger.gmane.org 1227664336 29631 80.91.229.12 (26 Nov 2008 01:52:16 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 26 Nov 2008 01:52:16 +0000 (UTC) Cc: eliz@gnu.org, emacs-devel@gnu.org To: Stefan Monnier Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Nov 26 02:53:18 2008 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1L59ae-00043Y-Be for ged-emacs-devel@m.gmane.org; Wed, 26 Nov 2008 02:53:16 +0100 Original-Received: from localhost ([127.0.0.1]:47268 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1L59ZU-00062c-DQ for ged-emacs-devel@m.gmane.org; Tue, 25 Nov 2008 20:52:04 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1L59ZO-000623-K3 for emacs-devel@gnu.org; Tue, 25 Nov 2008 20:51:58 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1L59ZN-00061J-47 for emacs-devel@gnu.org; Tue, 25 Nov 2008 20:51:58 -0500 Original-Received: from [199.232.76.173] (port=36248 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1L59ZM-00061D-Vj for emacs-devel@gnu.org; Tue, 25 Nov 2008 20:51:57 -0500 Original-Received: from mx1.aist.go.jp ([150.29.246.133]:59402) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1L59ZK-00082E-7C; Tue, 25 Nov 2008 20:51:54 -0500 Original-Received: from rqsmtp1.aist.go.jp (rqsmtp1.aist.go.jp [150.29.254.115]) by mx1.aist.go.jp with ESMTP id mAQ1pqJM024893; Wed, 26 Nov 2008 10:51:52 +0900 (JST) env-from (handa@m17n.org) Original-Received: from smtp1.aist.go.jp by rqsmtp1.aist.go.jp with ESMTP id mAQ1pqc1023313; Wed, 26 Nov 2008 10:51:52 +0900 (JST) env-from (handa@m17n.org) Original-Received: by smtp1.aist.go.jp with ESMTP id mAQ1ppWc006212; Wed, 26 Nov 2008 10:51:51 +0900 (JST) env-from (handa@m17n.org) Original-Received: from handa by etlken.m17n.org with local (Exim 4.69) (envelope-from ) id 1L59ZH-0005fX-Oa; Wed, 26 Nov 2008 10:51:51 +0900 In-reply-to: (message from Stefan Monnier on Sat, 22 Nov 2008 23:16:49 -0500) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/23.0.60 (i686-pc-linux-gnu) MULE/6.0 (HANACHIRUSATO) X-detected-operating-system: by monty-python.gnu.org: Solaris 9 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:106179 Archived-At: In article , Stefan Monnier writes: > > What exactly is meant here by ``8-bit characters''? Do you mean > > eight-bit raw bytes, or do you mean Unicode characters whose > > codepoints are below 256? > It should be eight-bit raw bytes. In some cases it's difficult to tell > the difference, so Emacs may occasionally accept latin-1 chars as stand > ins for eight-bit raw bytes. I classified characters into ASCII chars, non-ASCII chars, and 8-bit chars at the beginning of "@node Text Representations". And, unibyte text can contain ASCII and 8-bit chars, multibyte text can contain all chars. > I think we should state somewhere that unibyte strings and buffers > contain bytes only. And that multibyte strings and buffers contain > chars. And that bytes are a subset of chars. I'm not sure that is good as far as the Lisp reader returns a unibyte string from "abc". --- Kenichi Handa handa@ni.aist.go.jp