From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Daniel Colascione Newsgroups: gmane.emacs.devel Subject: Re: Tree Sitter (was Re: cc-mode fontification feels random) Date: Wed, 21 Jul 2021 18:16:05 -0700 Message-ID: <2e5ead63-624e-57bf-feaa-996f078fc782@dancol.org> References: <83o8cge4lg.fsf@gnu.org> <62e438b5-d27f-1d3c-69c6-11fe29a76d74@dancol.org> <83fsxsdxhu.fsf@gnu.org> <179f22a44d8.2816.cc5b3318d7e9908e2c46732289705cb0@dancol.org> <179f38c0370.2816.cc5b3318d7e9908e2c46732289705cb0@dancol.org> <236e62c2-be9b-b26d-8cd0-4b5a1a86e19a@dancol.org> <86mtqsoh3f.fsf@stephe-leake.org> <286d815e-d1a1-07ca-6696-a7f51923ab4e@piermont.com> <86wnpl6f0y.fsf@stephe-leake.org> <865yx45y7g.fsf@stephe-leake.org> <0c575ca7-d287-4699-02bd-65822c11bf5d@piermont.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="10150"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0 Cc: emacs-devel@gnu.org To: "Perry E. Metzger" , Stefan Monnier , Stephen Leake Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Thu Jul 22 03:19:00 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1m6NMZ-0002Ph-VB for ged-emacs-devel@m.gmane-mx.org; Thu, 22 Jul 2021 03:19:00 +0200 Original-Received: from localhost ([::1]:32772 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1m6NMY-00023h-Pn for ged-emacs-devel@m.gmane-mx.org; Wed, 21 Jul 2021 21:18:58 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:33716) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1m6NJt-0001Ib-0o for emacs-devel@gnu.org; Wed, 21 Jul 2021 21:16:13 -0400 Original-Received: from dancol.org ([2600:3c01:e000:3d8::1]:56948) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1m6NJq-0000bJ-Oo for emacs-devel@gnu.org; Wed, 21 Jul 2021 21:16:12 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=dancol.org; s=x; h=Content-Transfer-Encoding:Content-Type:In-Reply-To:MIME-Version:Date: Message-ID:From:References:Cc:To:Subject:Sender:Reply-To:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=HXTM2XK3SeuBQXjix4xgGXvbLoC1lzvKyzxtXBS0FYI=; b=Jku4vHHQkM6CHd0gIeZFMhnXCE +CpK7UYG7L1enpn1r5ZkW63fRPviWL3W1+iUXGteT2my+0hYxIruU3PqMOviGCHX87sHr5KhYIl5I kO0qy1VgxIJIoD2X4Jsw0/9/vx18YE4VQjbiId1Zz20BI/IiKbeGKN+I25KH4rEPxoC7PH0A6MaHd +oruqjXmBe9k29ZFo1LZFN5R1cPZAgP2jn4An7lceH6+bJ/HTGMsZGUwItjHkZQOppQT8jHMV2SK2 xW8V0e7LrwkJhE2g0zsZWAJAM7AEWWulHIe7jpAwCqB8cG4h1v0PKSZmCfofRsOMja63PY+uJOfrR pEZ1YGCA==; Original-Received: from [2604:4080:1321:8020:2761:c5fe:e373:3ed] (port=38820) by dancol.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.89) (envelope-from ) id 1m6NJm-0008MY-Mu; Wed, 21 Jul 2021 18:16:06 -0700 In-Reply-To: <0c575ca7-d287-4699-02bd-65822c11bf5d@piermont.com> Content-Language: en-US Received-SPF: pass client-ip=2600:3c01:e000:3d8::1; envelope-from=dancol@dancol.org; helo=dancol.org X-Spam_score_int: -21 X-Spam_score: -2.2 X-Spam_bar: -- X-Spam_report: (-2.2 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, NICE_REPLY_A=-0.117, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:271442 Archived-At: On 7/21/21 12:15 PM, Perry E. Metzger wrote: > On 7/21/21 12:21, Daniel Colascione wrote: >> On 7/21/21 7:43 AM, Perry E. Metzger wrote: >>> Thought I would note that there's a substantial literature now on >>> incremental parsing, especially the sort that is needed for editor >>> tools. One doesn't need to reinvent the algorithms, they're out >>> there waiting to be used. The Tree Sitter project is based on >>> previous published work. >> >> There is indeed a big literature! I wish there were a bigger >> literature on *composable* incremental parsers though. IMHO, what we >> need is an incremental GLR system (yes, GLR is bad worst-case, but >> it's not a practical concern) that spits out a parse *forest* which >> we then pare down to a parse tree with ad-hoc syntactic consistency >> rules. Something like this naturally supports multi-language modes >> and incorporation of out-of-band semantic information. >> > Tree sitter handles GLR. > Cool. How does it prune the parse forest?