From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.help Subject: Re: [Solved] RE: Differences between identical strings in Emacs lisp Date: Thu, 09 Apr 2015 08:32:09 -0400 Message-ID: References: <87pp7gu7by.fsf@kuiper.lan.informatimago.com> <83mw2khvc1.fsf@gnu.org> <834morj19g.fsf@gnu.org> <83egnuhlu0.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: ger.gmane.org 1428582779 3763 80.91.229.3 (9 Apr 2015 12:32:59 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 9 Apr 2015 12:32:59 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Thu Apr 09 14:32:48 2015 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YgBdX-0004tT-5u for geh-help-gnu-emacs@m.gmane.org; Thu, 09 Apr 2015 14:32:47 +0200 Original-Received: from localhost ([::1]:34311 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YgBdW-0000B7-5Y for geh-help-gnu-emacs@m.gmane.org; Thu, 09 Apr 2015 08:32:46 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:59085) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YgBdL-0000Ax-RA for help-gnu-emacs@gnu.org; Thu, 09 Apr 2015 08:32:36 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YgBdG-0000Eo-63 for help-gnu-emacs@gnu.org; Thu, 09 Apr 2015 08:32:35 -0400 Original-Received: from plane.gmane.org ([80.91.229.3]:58915) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YgBdF-0000Ec-Uq for help-gnu-emacs@gnu.org; Thu, 09 Apr 2015 08:32:30 -0400 Original-Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1YgBdE-0004n8-Ao for help-gnu-emacs@gnu.org; Thu, 09 Apr 2015 14:32:28 +0200 Original-Received: from 23.91.157.11 ([23.91.157.11]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 09 Apr 2015 14:32:28 +0200 Original-Received: from monnier by 23.91.157.11 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 09 Apr 2015 14:32:28 +0200 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 18 Original-X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: 23.91.157.11 User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.0.50 (gnu/linux) Cancel-Lock: sha1:6hq+llGWpMJ6OwB/ljN2cS2fTgI= X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 80.91.229.3 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:103604 Archived-At: > I could imagine that the step from the equivalence char=byte to > char=unicode code point (long(er) integer) is not so difficult. But we have > in addition the UTF-8 representation. To what of the two latter--unicode code > point (integer, several bytes long) or its UTF-8 representation (sequence of > several bytes) does the term "multibyte" refer? multibyte refers to "string of characters". These have been represented internally using an iso-2022 encoding until Emacs-22 and since Emacs-23 they're represented internally with a utf-8 encoding. The name comes from the fact that each element can use up more than one byte. But that's just an internal detail that is mostly hidden from Elisp. To turn such a string of characters into a string of bytes you need to use things like encode-coding-(string|buffer), at which point you have to specify which encoding you want to use (e.g. utf-8). Stefan