From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Dmitry Gutov Newsgroups: gmane.emacs.devel Subject: Re: Automatic (e)tags generation and incremental updates Date: Wed, 13 Jan 2021 17:52:16 +0200 Message-ID: References: <779a6328-9ca5-202a-25a2-b270c66fe6dd@yandex.ru> <8fc5e96c-ebb8-c668-9b2a-c7c4ee54c0b9@yandex.ru> <83r1mwltob.fsf@gnu.org> <0bee9ab4-46bc-b6fd-97b6-e26cc80f1610@yandex.ru> <875z45dbm7.fsf@tromey.com> <1e9c9572-52ee-339b-78a2-731b9eb5f3de@yandex.ru> <871resd93f.fsf@tromey.com> <83mtxffrou.fsf@gnu.org> <106abdbb-ce7a-4911-0831-149da3dccfb3@yandex.ru> <83o8hudwgo.fsf@gnu.org> <8335z6dql2.fsf@gnu.org> <3c688f2e-a32c-63b8-235b-8ef92e87fe83@yandex.ru> <83y2gyca4z.fsf@gnu.org> <09159508-db02-75f8-ec4e-692c62360905@yandex.ru> <837dogdgp6.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="38594"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 Cc: philipk@posteo.net, tom@tromey.com, emacs-devel@gnu.org, john@yates-sheets.org To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Wed Jan 13 16:54:37 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1kziTk-0009xS-Hi for ged-emacs-devel@m.gmane-mx.org; Wed, 13 Jan 2021 16:54:36 +0100 Original-Received: from localhost ([::1]:43796 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kziTj-0007ml-IY for ged-emacs-devel@m.gmane-mx.org; Wed, 13 Jan 2021 10:54:35 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:47674) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kziRd-0005Rx-UE for emacs-devel@gnu.org; Wed, 13 Jan 2021 10:52:25 -0500 Original-Received: from mail-ed1-x535.google.com ([2a00:1450:4864:20::535]:44656) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1kziRc-0000TR-07; Wed, 13 Jan 2021 10:52:25 -0500 Original-Received: by mail-ed1-x535.google.com with SMTP id p22so2425099edu.11; Wed, 13 Jan 2021 07:52:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=WOe/oZsjRbjA0oO52GjhSH7f+ZHxXeYX+5t0hgQejWA=; b=Ge58jCEdPoQEMPYjNGve9ZUMjOoI99rx78bvbmNjj+0oAWJOU1AbrJINGRdmDwF6JS 1xUtDltDoO4q0ixtS7wB2gZEB+r7fZNboIkimtSE1hBKUNnJ7njkPvEoP9o51Fhffd7t cFgPUy5AulaOo8sRN9/Bgm7J0Jks1065MSern7pCEjV2T56VNoe69/8hLbeFNY9UNACI xpGp/SnMcY0CBuxzaSAFah0tHiI1xSRoHNXFkGUII3rmx/VsJ5YVjlOHTFkdms8uD46n m2XwGCTv/w49xSNAHU7yzLv8TpQBV4GEo1y/o2A90BUm9Ks+66zdRA6SH35yPavuWt4D mLmw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:subject:to:cc:references:from:message-id :date:user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=WOe/oZsjRbjA0oO52GjhSH7f+ZHxXeYX+5t0hgQejWA=; b=JnEvxXK50fW1GuH4B9eioteMW8gvnskjSsFciTlukTBLq45FKwrD+Gs+DzedPVT0zP I9LUypjj6w4WWwcS0aDNGgwlwcQv2cKaKJdMnpColJLKlWDKlkDddCgdBJhwIM3Elbc0 5FS/uVqdEZw5Jr9q5qHMHZFbxuoFBMoztWHiaZ0s8Umq2Sz8b9Q6DRC/E3AQ2fpUrj9E b6d0vvORSxq25lmX0enlEJVN6vLk5O+IqQ7e69wyQiYywjfgMekOqlGnhIkY6b78O4rz TjOD1XlYQxS9d+mvKac1oiRa4V+FN1TJcJxi1ttSX0wg6qbmalVWLEF/cvZDYAHOWwdF iMpA== X-Gm-Message-State: AOAM5302GH0XeIa/E938c+BCJSXnLGC+pKWsilsy8fqJysyStLjy5lU0 LfJbJFBW20r2E9wwyCuj9VIlZWvn1nxxIA== X-Google-Smtp-Source: ABdhPJwrDfb36ZzhS0TwKiy4GS+aAUcAX8Ytl9UBAr1Qj8pY44wwNmgQgFuHGVQzjl+X0mTWaoi43A== X-Received: by 2002:a50:d60f:: with SMTP id x15mr2271785edi.224.1610553141463; Wed, 13 Jan 2021 07:52:21 -0800 (PST) Original-Received: from [192.168.0.6] ([46.251.119.176]) by smtp.googlemail.com with ESMTPSA id j23sm1037325edv.45.2021.01.13.07.52.19 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 13 Jan 2021 07:52:20 -0800 (PST) In-Reply-To: <837dogdgp6.fsf@gnu.org> Content-Language: en-US Received-SPF: pass client-ip=2a00:1450:4864:20::535; envelope-from=raaahh@gmail.com; helo=mail-ed1-x535.google.com X-Spam_score_int: -14 X-Spam_score: -1.5 X-Spam_bar: - X-Spam_report: (-1.5 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FORGED_FROMDOMAIN=0.249, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.248, NICE_REPLY_A=-0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=no autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:263032 Archived-At: On 13.01.2021 17:01, Eli Zaretskii wrote: >> Cc: tom@tromey.com, john@yates-sheets.org, philipk@posteo.net, >> emacs-devel@gnu.org >> From: Dmitry Gutov >> Date: Wed, 13 Jan 2021 00:26:50 +0200 >> >> Anyway, if you have any suggestions regarding how to write the file >> quicker (and why utf-8-emacs coding didn't help), that would be great. > > Almost all the identifiers are ASCII, right? So maybe optimize 99.9% > of use cases by storing such tags tables in a unibyte buffer, read > with insert-file-contents-literally? All right, and that option is probably handled well enough already by the user choosing (l) in the prompt when the tags file is very big. > As for why utf-8-emacs didn't help: I'm not really sure why Stefan > thought it will. I mean, look at the code: it still encodes, just > differently. My (apparently faulty) intuition was that if utf-8-emacs is the memory representation of buffer text, converting it into that encoding can be faster because it could be done by copying from memory rather that having to do the work of recoding every character.