From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: "Perry E. Metzger" Newsgroups: gmane.emacs.devel Subject: Re: Tree Sitter (was Re: cc-mode fontification feels random) Date: Wed, 21 Jul 2021 15:15:01 -0400 Message-ID: <0c575ca7-d287-4699-02bd-65822c11bf5d@piermont.com> References: <83o8cge4lg.fsf@gnu.org> <62e438b5-d27f-1d3c-69c6-11fe29a76d74@dancol.org> <83fsxsdxhu.fsf@gnu.org> <179f22a44d8.2816.cc5b3318d7e9908e2c46732289705cb0@dancol.org> <179f38c0370.2816.cc5b3318d7e9908e2c46732289705cb0@dancol.org> <236e62c2-be9b-b26d-8cd0-4b5a1a86e19a@dancol.org> <86mtqsoh3f.fsf@stephe-leake.org> <286d815e-d1a1-07ca-6696-a7f51923ab4e@piermont.com> <86wnpl6f0y.fsf@stephe-leake.org> <865yx45y7g.fsf@stephe-leake.org> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="34961"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:91.0) Gecko/20100101 Thunderbird/91.0 Cc: emacs-devel@gnu.org To: Daniel Colascione , Stefan Monnier , Stephen Leake Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Wed Jul 21 21:16:12 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1m6HhT-0008ph-GZ for ged-emacs-devel@m.gmane-mx.org; Wed, 21 Jul 2021 21:16:11 +0200 Original-Received: from localhost ([::1]:54820 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1m6HhR-00085b-Iy for ged-emacs-devel@m.gmane-mx.org; Wed, 21 Jul 2021 15:16:09 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:58210) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1m6HgV-0007Mg-PM for emacs-devel@gnu.org; Wed, 21 Jul 2021 15:15:11 -0400 Original-Received: from hacklheber.piermont.com ([166.84.7.14]:35430) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1m6HgT-0000FI-Tk for emacs-devel@gnu.org; Wed, 21 Jul 2021 15:15:11 -0400 Original-Received: from snark.cb.piermont.com (localhost [127.0.0.1]) by hacklheber.piermont.com (Postfix) with UTF8SMTP id 737CC21B; Wed, 21 Jul 2021 15:15:02 -0400 (EDT) Original-Received: from [10.160.2.107] (jabberwock.cb.piermont.com [10.160.2.107]) by snark.cb.piermont.com (Postfix) with UTF8SMTP id 300F12DE7D8; Wed, 21 Jul 2021 15:15:02 -0400 (EDT) Content-Language: en-US In-Reply-To: Received-SPF: pass client-ip=166.84.7.14; envelope-from=perry@piermont.com; helo=hacklheber.piermont.com X-Spam_score_int: -19 X-Spam_score: -2.0 X-Spam_bar: -- X-Spam_report: (-2.0 / 5.0 requ) BAYES_00=-1.9, NICE_REPLY_A=-0.117, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:271427 Archived-At: On 7/21/21 12:21, Daniel Colascione wrote: > On 7/21/21 7:43 AM, Perry E. Metzger wrote: >> Thought I would note that there's a substantial literature now on >> incremental parsing, especially the sort that is needed for editor >> tools. One doesn't need to reinvent the algorithms, they're out there >> waiting to be used. The Tree Sitter project is based on previous >> published work. > > There is indeed a big literature! I wish there were a bigger > literature on *composable* incremental parsers though. IMHO, what we > need is an incremental GLR system (yes, GLR is bad worst-case, but > it's not a practical concern) that spits out a parse *forest* which we > then pare down to a parse tree with ad-hoc syntactic consistency > rules. Something like this naturally supports multi-language modes and > incorporation of out-of-band semantic information. > Tree sitter handles GLR. Perry