From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: [Emacs-diffs] emacs-26 8f18d12: Improve documentation of decoding into a unibyte buffer Date: Sat, 25 May 2019 22:59:02 +0300 Message-ID: <83sgt22tmh.fsf@gnu.org> References: <20190525191039.14136.23307@vcs0.savannah.gnu.org> <20190525191040.CCD6C207F5@vcs0.savannah.gnu.org> Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="101593"; mail-complaints-to="usenet@blaine.gmane.org" Cc: emacs-devel@gnu.org To: Stefan Monnier Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sat May 25 22:08:17 2019 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([209.51.188.17]) by blaine.gmane.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:256) (Exim 4.89) (envelope-from ) id 1hUcxl-000QJx-95 for ged-emacs-devel@m.gmane.org; Sat, 25 May 2019 22:08:17 +0200 Original-Received: from localhost ([127.0.0.1]:45859 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hUcxk-0003Nt-5T for ged-emacs-devel@m.gmane.org; Sat, 25 May 2019 16:08:16 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:43486) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hUcxY-0003LV-E5 for emacs-devel@gnu.org; Sat, 25 May 2019 16:08:05 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:35933) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hUcou-0004ZB-U8; Sat, 25 May 2019 15:59:10 -0400 Original-Received: from [176.228.60.248] (port=2977 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1hUcon-0005vR-M1; Sat, 25 May 2019 15:59:04 -0400 In-reply-to: (message from Stefan Monnier on Sat, 25 May 2019 15:41:46 -0400) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.org gmane.emacs.devel:236989 Archived-At: > From: Stefan Monnier > Cc: emacs-devel@gnu.org > Date: Sat, 25 May 2019 15:41:46 -0400 > > > +length of the decoded text. If that buffer is a unibyte buffer > > +(@pxref{Selecting a Representations}), the internal representation of > > +the decoded text (@pxref{Text Representations}) is inserted into the > > +buffer as individual bytes. > > If the decoded char is a byte between 128-255, is it inserted as > a single byte or as the two-byte sequence used internally for those > "eight-bit" chars? The internal representation of the decoded text could include both. If some of the bytes in the original byte stream couldn't be decoded using the specified coding-system, they will be represented as raw bytes, using 2-byte sequences. OTOH, Latin characters successfully decoded into codepoints less than 256 will take 1 byte. Again, this is just the internal representation of what was decoded.