From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Stephen Leake Newsgroups: gmane.emacs.devel Subject: Re: parser error recovery algorithm vs treesit indentation "blinking" Date: Tue, 04 Apr 2023 09:00:16 -0700 Message-ID: <86zg7ntz73.fsf@stephe-leake.org> References: <87lejgsf0m.fsf@gmail.com> <83pm8s70o3.fsf@gnu.org> <83mt3u65vw.fsf@gnu.org> <87y1newqus.fsf@gmail.com> <83bkka5z7w.fsf@gnu.org> <871ql6a4d4.fsf@gmail.com> <83jzyy4776.fsf@gnu.org> <9F152CAA-6326-459F-84FF-87988B3A92B6@gmail.com> <868rf8vdse.fsf_-_@stephe-leake.org> <234fa18d-8b99-65c0-f4d0-161954888831@yandex.ru> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="37447"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Cc: John Yates , Alan Mackenzie , Yuan Fu , Eli Zaretskii , theodor thornhill , geza.herman@gmail.com, Daniel Colascione , emacs-devel@gnu.org To: Dmitry Gutov Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Tue Apr 04 18:02:45 2023 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1pjj7M-0009S9-EH for ged-emacs-devel@m.gmane-mx.org; Tue, 04 Apr 2023 18:02:44 +0200 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1pjj5m-0006Xn-FS; Tue, 04 Apr 2023 12:01:06 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pjj5D-00062W-Lk for emacs-devel@gnu.org; Tue, 04 Apr 2023 12:00:32 -0400 Original-Received: from alt-proxy28.mail.unifiedlayer.com ([74.220.216.123]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pjj56-0000vh-J7 for emacs-devel@gnu.org; Tue, 04 Apr 2023 12:00:29 -0400 Original-Received: from cmgw12.mail.unifiedlayer.com (unknown [10.0.90.127]) by progateway1.mail.pro1.eigbox.com (Postfix) with ESMTP id A1E8510167A6C for ; Tue, 4 Apr 2023 16:00:19 +0000 (UTC) Original-Received: from host2007.hostmonster.com ([67.20.76.71]) by cmsmtp with ESMTP id jj51pb6SRuftbjj51phKx1; Tue, 04 Apr 2023 16:00:19 +0000 X-Authority-Reason: nr=8 X-Authority-Analysis: v=2.4 cv=cPYlDnSN c=1 sm=1 tr=0 ts=642c4993 a=dWLzHQi6WpdymmZIwiVdBw==:117 a=Fln8i1WyhtedwaIJAdHvmw==:17 a=dLZJa+xiwSxG16/P+YVxDGlgEgI=:19 a=dKHAf1wccvYA:10:nop_rcvd_month_year a=vvvmwbhNdt4A:10:endurance_base64_authed_username_1 a=vaJtXVxTAAAA:8 a=NEAV23lmAAAA:8 a=9i_RQKNPAAAA:8 a=vODQOw8GXSEnmkx-UZEA:9 a=Ev4oQ7kfJBNsvnoXShoW:22 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=stephe-leake.org; s=default; h=Content-Type:MIME-Version:Message-ID:Date: References:In-Reply-To:Subject:Cc:To:From:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=0m3FD16iiXGXDtAkUE0uwT1d53bEcYSCeNzV54wjlA4=; b=ZEq07CHi0CKPyJmrs8zS9t2Hoi 3UzSv+O2nRn8gz/A2rGIEwbqZoM2Yd9RqUXm3Zh/StAzfGQfExvejnEVlqMvY+yc0je9JNjGHhXBY MqRmvryO0s0AIOWpDJo8sts0PcbWrjeS+tThRbuy8IwGWvIn2CITbX6DNfxcgI3QQYjymFM0HIgPM EVQxkpmDDJm5f76SeWw04JmKhu4Umuvb9+MoEuhUbxQKpDbbGcaUPpIzDxC2RfP4A2AeeGkpNcgbx 0rgogrUk7BXq8763pdarYQhSQ7Lg9lXGYqtuaQPVuv1ZZN5i0rbWvTplhgMnqUtUMQYxaaQEMlxHI Cf3ieTUg==; Original-Received: from 135-180-197-170.fiber.dynamic.sonic.net ([135.180.197.170]:57538 helo=DESKTOP-G20DCG1) by host2007.hostmonster.com with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.95) (envelope-from ) id 1pjj50-001KPd-Fv; Tue, 04 Apr 2023 10:00:18 -0600 In-Reply-To: <234fa18d-8b99-65c0-f4d0-161954888831@yandex.ru> (Dmitry Gutov's message of "Tue, 4 Apr 2023 16:40:10 +0300") X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - host2007.hostmonster.com X-AntiAbuse: Original Domain - gnu.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - stephe-leake.org X-BWhitelist: no X-Source-IP: 135.180.197.170 X-Source-L: No X-Exim-ID: 1pjj50-001KPd-Fv X-Source-Sender: 135-180-197-170.fiber.dynamic.sonic.net (DESKTOP-G20DCG1) [135.180.197.170]:57538 X-Source-Auth: stephen_leake@stephe-leake.org X-Email-Count: 9 X-Source-Cap: c3RlcGhlbGU7c3RlcGhlbGU7aG9zdDIwMDcuaG9zdG1vbnN0ZXIuY29t X-Local-Domain: yes Received-SPF: pass client-ip=74.220.216.123; envelope-from=stephen_leake@stephe-leake.org; helo=alt-proxy28.mail.unifiedlayer.com X-Spam_score_int: 16 X-Spam_score: 1.6 X-Spam_bar: + X-Spam_report: (1.6 / 5.0 requ) BAYES_00=-1.9, DKIM_INVALID=0.1, DKIM_SIGNED=0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_SBL_CSS=3.335, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=no autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.devel:305109 Archived-At: Dmitry Gutov writes: > Here's a relevant discussion, but there's nothing about positions in > there: https://github.com/tree-sitter/tree-sitter/issues/224 Ah. So tree-sitter does do some insertion, but not as much as wisitoken wisitoken is also GLR, and uses the same approach to multiple error correction solutions, although the cost of insert/delete depends on the token; for example, it's cheaper to insert or delete parentheses, since they are often missing or extra while editing. I've posted a draft paper on the wisitoken error correction algorithm; https://stephe-leake.org/ada/error_correction_algorithm.pdf In the full algorithm used by ada-mode, there is some consideration about whether to insert tokens before or after new-lines, to address the indent issue you describe. -- -- Stephe