From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Andreas Schwab Newsgroups: gmane.emacs.devel Subject: Re: Unibyte characters, strings, and buffers Date: Sun, 30 Mar 2014 16:25:39 +0200 Message-ID: <87wqfblq58.fsf@igel.home> References: <831txozsqa.fsf@gnu.org> <83ppl7y30l.fsf@gnu.org> <87r45nouvx.fsf@uwakimon.sk.tsukuba.ac.jp> <8361myyac6.fsf@gnu.org> <87a9capqfr.fsf@uwakimon.sk.tsukuba.ac.jp> <83eh1mfd09.fsf@gnu.org> <87ob0pnyt6.fsf@uwakimon.sk.tsukuba.ac.jp> <87ioqxnhhk.fsf@uwakimon.sk.tsukuba.ac.jp> <87bnwpov7b.fsf@fencepost.gnu.org> <87eh1lnf4q.fsf@uwakimon.sk.tsukuba.ac.jp> <877g7dos88.fsf@fencepost.gnu.org> <87a9c8o2yq.fsf@uwakimon.sk.tsukuba.ac.jp> <87a9c8njqf.fsf@fencepost.gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1396189568 24996 80.91.229.3 (30 Mar 2014 14:26:08 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 30 Mar 2014 14:26:08 +0000 (UTC) Cc: emacs-devel@gnu.org To: David Kastrup Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sun Mar 30 16:26:02 2014 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1WUGgT-00050a-D8 for ged-emacs-devel@m.gmane.org; Sun, 30 Mar 2014 16:26:01 +0200 Original-Received: from localhost ([::1]:44556 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WUGgS-0000c3-QQ for ged-emacs-devel@m.gmane.org; Sun, 30 Mar 2014 10:26:00 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:60704) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WUGgL-0000aD-Ni for emacs-devel@gnu.org; Sun, 30 Mar 2014 10:25:59 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WUGgG-00035X-Bz for emacs-devel@gnu.org; Sun, 30 Mar 2014 10:25:53 -0400 Original-Received: from mail-out.m-online.net ([212.18.0.9]:34153) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WUGgA-00033k-GY; Sun, 30 Mar 2014 10:25:42 -0400 Original-Received: from frontend1.mail.m-online.net (unknown [192.168.8.180]) by mail-out.m-online.net (Postfix) with ESMTP id 3fxfGs2Fn5z4KK3T; Sun, 30 Mar 2014 16:25:41 +0200 (CEST) Original-Received: from localhost (dynscan1.mnet-online.de [192.168.6.68]) by mail.m-online.net (Postfix) with ESMTP id 3fxfGs0tSQzbbgt; Sun, 30 Mar 2014 16:25:41 +0200 (CEST) X-Virus-Scanned: amavisd-new at mnet-online.de Original-Received: from mail.mnet-online.de ([192.168.8.180]) by localhost (dynscan1.mail.m-online.net [192.168.6.68]) (amavisd-new, port 10024) with ESMTP id l6AowGP3OnJw; Sun, 30 Mar 2014 16:25:39 +0200 (CEST) X-Auth-Info: /ybXqS9p6Y5gfAcLznbKjIe0Fq0YfjO0JFWzKMBZm7o= Original-Received: from igel.home (ppp-93-104-157-147.dynamic.mnet-online.de [93.104.157.147]) by mail.mnet-online.de (Postfix) with ESMTPA; Sun, 30 Mar 2014 16:25:39 +0200 (CEST) Original-Received: by igel.home (Postfix, from userid 1000) id 6D8802C0471; Sun, 30 Mar 2014 16:25:39 +0200 (CEST) X-Yow: --``I love KATRINKA because she drives a PONTIAC. We're going away now. I fed the cat. - Zippy'' In-Reply-To: <87a9c8njqf.fsf@fencepost.gnu.org> (David Kastrup's message of "Sun, 30 Mar 2014 11:01:12 +0200") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3 (gnu/linux) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.4.x X-Received-From: 212.18.0.9 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:171196 Archived-At: David Kastrup writes: > I don't think it gets much more transparent than "unibyte flag only > marks the valid Unicode-in-Emacs character range". I'm for the range > 0..255, Andreas for something like 0..127 U 4194176..4194303 which > IĀ find cumbersome for little return. Before decoding there is no charset information yet, so using anything other than the eight-bit charset would be wrong. After decoding, the eight-bit charset is used only for undecodable bytes. That preserves the distinction between encoded and decoded strings/buffers (except for the uninteresting trivial ASCII decoding) in a world without unibyte flag. Andreas. -- Andreas Schwab, schwab@linux-m68k.org GPG Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5 "And now for something completely different."