From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.devel Subject: Re: Fwd: Re: Inadequate documentation of silly characters on screen. Date: Thu, 19 Nov 2009 08:35:31 -0500 Message-ID: References: <20091118191258.GA2676@muc.de> <20091119082040.GA1720@muc.de> <874ooq8xay.fsf@wanchan.jasonrumney.net> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1258638165 25004 80.91.229.12 (19 Nov 2009 13:42:45 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 19 Nov 2009 13:42:45 +0000 (UTC) Cc: Alan Mackenzie , Andreas Schwab , emacs-devel@gnu.org To: Jason Rumney Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Nov 19 14:42:37 2009 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1NB7HO-0004Ru-Se for ged-emacs-devel@m.gmane.org; Thu, 19 Nov 2009 14:42:35 +0100 Original-Received: from localhost ([127.0.0.1]:38591 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NB7HO-0005gd-D0 for ged-emacs-devel@m.gmane.org; Thu, 19 Nov 2009 08:42:34 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1NB7Aj-0000zp-PW for emacs-devel@gnu.org; Thu, 19 Nov 2009 08:35:41 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1NB7Ae-0000uu-Ht for emacs-devel@gnu.org; Thu, 19 Nov 2009 08:35:40 -0500 Original-Received: from [199.232.76.173] (port=59136 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NB7Ae-0000um-DV for emacs-devel@gnu.org; Thu, 19 Nov 2009 08:35:36 -0500 Original-Received: from ironport2-out.teksavvy.com ([206.248.154.181]:3125 helo=ironport2-out.pppoe.ca) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1NB7Aa-0006k2-Lw; Thu, 19 Nov 2009 08:35:33 -0500 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AqYEAAfZBEvO+IIa/2dsb2JhbACBTdQthDsEgxGGWA X-IronPort-AV: E=Sophos;i="4.44,771,1249272000"; d="scan'208";a="49647461" Original-Received: from 206-248-130-26.dsl.teksavvy.com (HELO pastel.home) ([206.248.130.26]) by ironport2-out.pppoe.ca with ESMTP; 19 Nov 2009 08:35:31 -0500 Original-Received: by pastel.home (Postfix, from userid 20848) id 4828B8774; Thu, 19 Nov 2009 08:35:31 -0500 (EST) In-Reply-To: <874ooq8xay.fsf@wanchan.jasonrumney.net> (Jason Rumney's message of "Thu, 19 Nov 2009 21:21:41 +0800") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.1.50 (gnu/linux) X-detected-operating-system: by monty-python.gnu.org: Genre and OS details not recognized. X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:117247 Archived-At: >> Nothing gets truncated. In Emacs 23 ?=F1 is simply the number 241, >> whereas in Emacs 22 is it the number 2289. You can put 2289 in a string >> in Emacs 23, but there is no defined unicode character with that value. > The bug here is likely that setting a character in a unibyte string to a > value between 160 and 255 does not result in an automatic conversion to > multibyte. That was correct in 22.3, since values in that range were > raw binary bytes outside of any character set, but in 23.1 they correspond > to valid Latin-1 codepoints. If you think of unibyte strings as sequences of bytes, it makes perfect sense to not automatically convert them to multibyte strings, since a sequence of bytes cannot hold the character =F1, only the byte 241. Stefan