From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.devel,gmane.lisp.gcl.devel Subject: Re: utf8 and emacs text/string multibyte representation Date: Sat, 01 Nov 2014 20:56:03 -0400 Message-ID: References: <87wq7jxc7d.fsf@gnu.org> <87zjcfx985.fsf_-_@maguirefamily.org> <83mw8f0w08.fsf@gnu.org> <87oasu3m72.fsf@maguirefamily.org> <83bnou26is.fsf@gnu.org> <87bnotwsqn.fsf@maguirefamily.org> <83y4rxzgmm.fsf@gnu.org> <87lhnxo73l.fsf@maguirefamily.org> <83wq7hzf9t.fsf@gnu.org> <87h9ykazdr.fsf@maguirefamily.org> <831tpnz442.fsf@gnu.org> <87mw8a22mo.fsf@uwakimon.sk.tsukuba.ac.jp> <87y4ru227h.fsf@fencepost.gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1414889803 31532 80.91.229.3 (2 Nov 2014 00:56:43 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 2 Nov 2014 00:56:43 +0000 (UTC) Cc: gcl-devel@gnu.org, emacs-devel@gnu.org To: David Kastrup Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sun Nov 02 01:56:33 2014 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1XkjT5-0002SC-RT for ged-emacs-devel@m.gmane.org; Sun, 02 Nov 2014 01:56:31 +0100 Original-Received: from localhost ([::1]:54205 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XkjT5-0005Zh-Cl for ged-emacs-devel@m.gmane.org; Sat, 01 Nov 2014 20:56:31 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:36865) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XkjSv-0005ZY-AN for emacs-devel@gnu.org; Sat, 01 Nov 2014 20:56:28 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XkjSn-0006hj-R1 for emacs-devel@gnu.org; Sat, 01 Nov 2014 20:56:21 -0400 Original-Received: from ironport2-out.teksavvy.com ([206.248.154.181]:8758) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XkjSf-0006h4-Ox; Sat, 01 Nov 2014 20:56:05 -0400 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AvAMAOatTlRFpY87/2dsb2JhbABcgw6DYoZ+y08EBAICgRwXAQF8hAMBAQMBViMFCwsOJhIUGA0kiEsJy3IBAQEHAgEfkQgHhEsBBI9pojeBb4F+IIF2IYJ6AQEB X-IPAS-Result: AvAMAOatTlRFpY87/2dsb2JhbABcgw6DYoZ+y08EBAICgRwXAQF8hAMBAQMBViMFCwsOJhIUGA0kiEsJy3IBAQEHAgEfkQgHhEsBBI9pojeBb4F+IIF2IYJ6AQEB X-IronPort-AV: E=Sophos;i="5.04,797,1406606400"; d="scan'208";a="95767606" Original-Received: from 69-165-143-59.dsl.teksavvy.com (HELO ceviche.home) ([69.165.143.59]) by ironport2-out.teksavvy.com with ESMTP/TLS/DHE-RSA-AES256-SHA; 01 Nov 2014 20:56:03 -0400 Original-Received: by ceviche.home (Postfix, from userid 20848) id 7FA5666108; Sat, 1 Nov 2014 20:56:03 -0400 (EDT) In-Reply-To: <87y4ru227h.fsf@fencepost.gnu.org> (David Kastrup's message of "Sat, 01 Nov 2014 19:41:22 +0100") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.4.50 (gnu/linux) X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 206.248.154.181 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:176224 gmane.lisp.gcl.devel:8821 Archived-At: >> Actually, originally Emacs allowed you to treat text (buffers and >> strings) either as sequences of characters or arrays of bytes, and This was the case in Emacs-20.1, yes. It was fixed by Emacs-20.3 or maybe even 20.2 already. > string-as-unibyte and string-as-multibyte most certainly are available > for going from one to the other. And they both suck. > I=A0have no idea whether this is a problem vector for creating > inconsistent multibyte content. It's mostly a problem in that it helps 8bit people stay stuck in the confusion between bytes and characters. Stefan