From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: Automatic (e)tags generation and incremental updates Date: Sat, 16 Jan 2021 09:34:39 +0200 Message-ID: <83bldp9vxs.fsf@gnu.org> References: <779a6328-9ca5-202a-25a2-b270c66fe6dd@yandex.ru> <8fc5e96c-ebb8-c668-9b2a-c7c4ee54c0b9@yandex.ru> <83r1mwltob.fsf@gnu.org> <0bee9ab4-46bc-b6fd-97b6-e26cc80f1610@yandex.ru> <875z45dbm7.fsf@tromey.com> <1e9c9572-52ee-339b-78a2-731b9eb5f3de@yandex.ru> <871resd93f.fsf@tromey.com> <83mtxffrou.fsf@gnu.org> <106abdbb-ce7a-4911-0831-149da3dccfb3@yandex.ru> <83o8hudwgo.fsf@gnu.org> <8335z6dql2.fsf@gnu.org> <3c688f2e-a32c-63b8-235b-8ef92e87fe83@yandex.ru> <83y2gyca4z.fsf@gnu.org> <09159508-db02-75f8-ec4e-692c62360905@yandex.ru> <837dogdgp6.fsf@gnu.org> <834kjkde1z.fsf@gnu.org> <8581b496-2093-42de-4e9d-deff8d4c9465@yandex.ru> Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="33283"; mail-complaints-to="usenet@ciao.gmane.io" Cc: philipk@posteo.net, tom@tromey.com, emacs-devel@gnu.org, john@yates-sheets.org To: Dmitry Gutov Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Sat Jan 16 08:36:00 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1l0g7s-0008Xh-GS for ged-emacs-devel@m.gmane-mx.org; Sat, 16 Jan 2021 08:36:00 +0100 Original-Received: from localhost ([::1]:59090 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1l0g7r-0002av-FH for ged-emacs-devel@m.gmane-mx.org; Sat, 16 Jan 2021 02:35:59 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:60666) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1l0g6d-00025h-Mm for emacs-devel@gnu.org; Sat, 16 Jan 2021 02:34:45 -0500 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:50600) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1l0g6b-0007Fm-Hi; Sat, 16 Jan 2021 02:34:41 -0500 Original-Received: from 84.94.185.95.cable.012.net.il ([84.94.185.95]:4469 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1l0g6a-0005Ma-QA; Sat, 16 Jan 2021 02:34:41 -0500 In-Reply-To: <8581b496-2093-42de-4e9d-deff8d4c9465@yandex.ru> (message from Dmitry Gutov on Sat, 16 Jan 2021 05:57:21 +0200) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:263092 Archived-At: > Cc: tom@tromey.com, john@yates-sheets.org, philipk@posteo.net, > emacs-devel@gnu.org > From: Dmitry Gutov > Date: Sat, 16 Jan 2021 05:57:21 +0200 > > > We don't recode characters when they are valid UTF-8 sequences, but > > you forget the raw bytes: they are converted from internal multibyte > > representation to single bytes, and that requires walking the buffer > > one character at a time. > > > > IOW, utf-8-emacs is the same as utf-8 for this purpose. > > So utf-8-emacs is not the same as "internal multibyte representation"? No, not according to my reading of the code. (The telltale sign is that "C-h C" tells you utf-8-emacs has the usual 3 EOL variants, something that makes no sense for the internal representation.) If something like "dump internal representation" coding-system is needed, we will have to add it, I think.