From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.help Subject: Re: [Solved] RE: Differences between identical strings in Emacs lisp Date: Wed, 08 Apr 2015 14:59:51 +0300 Message-ID: <83egnuhlu0.fsf@gnu.org> References: <87pp7gu7by.fsf@kuiper.lan.informatimago.com> <83mw2khvc1.fsf@gnu.org> <834morj19g.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: QUOTED-PRINTABLE X-Trace: ger.gmane.org 1428494417 25631 80.91.229.3 (8 Apr 2015 12:00:17 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 8 Apr 2015 12:00:17 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Wed Apr 08 14:00:02 2015 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YfoeH-0000CP-QB for geh-help-gnu-emacs@m.gmane.org; Wed, 08 Apr 2015 14:00:01 +0200 Original-Received: from localhost ([::1]:52269 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YfoeH-0005Mm-AH for geh-help-gnu-emacs@m.gmane.org; Wed, 08 Apr 2015 08:00:01 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:37791) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Yfoe6-0005Me-0B for help-gnu-emacs@gnu.org; Wed, 08 Apr 2015 07:59:50 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Yfoe2-00063J-MD for help-gnu-emacs@gnu.org; Wed, 08 Apr 2015 07:59:49 -0400 Original-Received: from mtaout29.012.net.il ([80.179.55.185]:55269) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Yfoe2-00062Y-FB for help-gnu-emacs@gnu.org; Wed, 08 Apr 2015 07:59:46 -0400 Original-Received: from conversion-daemon.mtaout29.012.net.il by mtaout29.012.net.il (HyperSendmail v2007.08) id <0NMH00400L1EIL00@mtaout29.012.net.il> for help-gnu-emacs@gnu.org; Wed, 08 Apr 2015 14:57:27 +0300 (IDT) Original-Received: from HOME-C4E4A596F7 ([87.69.4.28]) by mtaout29.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0NMH0006VL7OM840@mtaout29.012.net.il> for help-gnu-emacs@gnu.org; Wed, 08 Apr 2015 14:57:27 +0300 (IDT) In-reply-to: X-012-Sender: halo1@inter.net.il X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 80.179.55.185 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:103573 Archived-At: > From: J=FCrgen Hartmann > Date: Wed, 8 Apr 2015 13:01:16 +0200 >=20 > >> [About mapping between unibyte and multibyte strings] > >> > >> First I thought that some hidden decoding based on some charsets= or > >> coding > >> systems occurs. > > > > Actually, some sort of "decoding" does occur, albeit perhaps not = in > > the use cases you tried -- Emacs will sometimes silently convert > > unibyte characters to their locale-dependent multibyte equivalent= s. >=20 > On which occasion such a conversion is done? One example that comes to mind is (insert 160), i.e. when inserting text into a buffer. There are other examples, but I simply don't remember them at the moment. > Has this anything to do with the the charset that is individually > defined in language-info-alist for nearly each language environment= ? No, I think Emacs converts the value to the character that has the same Unicode codepoint. > It seems that all my related observations that puzzled me before ca= n be well > explained by the strict distinction between characters and raw byte= s and the > mapping between the latter's integer representations in the range > [0x80..0xFF] in an unibyte context and in the range [0x3FFF80..0x3F= FFFF] in a > multibyte context. Pretty much, yes.