From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Alan Mackenzie Newsgroups: gmane.emacs.devel Subject: Re: Indirect text properties Date: Mon, 18 Nov 2019 18:06:59 +0000 Message-ID: <20191118180659.GA6204@ACM> References: <20191117170527.GB11551@ACM> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="201609"; mail-complaints-to="usenet@blaine.gmane.org" User-Agent: Mutt/1.10.1 (2018-07-13) Cc: Vitalie Spinu , emacs-devel@gnu.org To: Dmitry Gutov Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Nov 18 19:07:26 2019 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([209.51.188.17]) by blaine.gmane.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1iWlQq-000qJ0-O5 for ged-emacs-devel@m.gmane.org; Mon, 18 Nov 2019 19:07:24 +0100 Original-Received: from localhost ([::1]:37928 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1iWlQp-0000nI-7t for ged-emacs-devel@m.gmane.org; Mon, 18 Nov 2019 13:07:23 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:35416) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1iWlQc-0000n8-3q for emacs-devel@gnu.org; Mon, 18 Nov 2019 13:07:11 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1iWlQY-0001jl-UI for emacs-devel@gnu.org; Mon, 18 Nov 2019 13:07:08 -0500 Original-Received: from colin.muc.de ([193.149.48.1]:13588 helo=mail.muc.de) by eggs.gnu.org with smtp (Exim 4.71) (envelope-from ) id 1iWlQY-0001jD-1B for emacs-devel@gnu.org; Mon, 18 Nov 2019 13:07:06 -0500 Original-Received: (qmail 60606 invoked by uid 3782); 18 Nov 2019 18:07:03 -0000 Original-Received: from acm.muc.de (p4FE15822.dip0.t-ipconnect.de [79.225.88.34]) by colin.muc.de (tmda-ofmipd) with ESMTP; Mon, 18 Nov 2019 19:06:59 +0100 Original-Received: (qmail 6264 invoked by uid 1000); 18 Nov 2019 18:06:59 -0000 Content-Disposition: inline In-Reply-To: X-Delivery-Agent: TMDA/1.1.12 (Macallan) X-Primary-Address: acm@muc.de X-detected-operating-system: by eggs.gnu.org: FreeBSD 9.x [fuzzy] X-Received-From: 193.149.48.1 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.org gmane.emacs.devel:242392 Archived-At: Hello, Dmitry. On Mon, Nov 18, 2019 at 00:55:50 +0200, Dmitry Gutov wrote: > On 17.11.2019 19:05, Alan Mackenzie wrote: > > This is an idea I had a couple of years ago, and has recently resurfaced > > in discussions with Dmitry (Subject: Several major modes). > > The idea is that there could be several alternative sets of text > > properties with the same symbol simultaneously in a buffer, the Lisp > > code selecting which to use by binding a dynamic variable. This would > > be most useful for the syntax-table text property. > Could char-property-alias-alist help? That is kind of the other way round to indirect properties. It gives several names accessing one property, whereas the indirect properties gives one name accessing several properties. > > How would this work? In textprop.c, the code would, on any access to a > > text property, check its symbol's property 'indirect-text-property, and > > if that is a non-nil symbol, access it's value (another symbol) and use > > that as the symbol for the text property instead. It's easier to say in > > code, which would look something like: > > #define TEXP_PROP_END_NAME(sym) \ > > !NILP (itp = Fget (sym, Qindirect_text_property)) && SYMPOLP (itp) \ > > && !NILP (etp = find_symbol_value (itp)) && SYMBOLP (etp) \ > > ? etp : sym > > . To switch to a different set of, e.g., syntax-table text properties > > it would suffice to bind the lisp variable i-t-p to, say, the gensym > > syntax-table-13. Of course low level caches, e.g. in syntax.c, would > > have to be kept synchronised, too. > It's a lot of work with likely some performance overhead even for the > default case as well. It sounds like it could be a piece of the puzzle, > but let's see if we get the full picture first. The performance overhead whilst the facility is not in use would be tiny: a look-up of a symbol's property list which is highly likely to be empty anyway. This is in the context of text properties, which involve traversing trees to find an "interval" containing the place we're looking up. But OK, we need firmer proposals. > Also, I think most (all?) of this proposal could be implemented in Lisp > by just setting the 'syntax-table' on the overlays that cover different > submode regions. With more overhead when setting but less overhead than > accessing the values. Overlays don't have a syntax-table property. It could be implemented, but it would slow down syntactic scanning probably a lot, since parse-partial-sexp would have to check all the overlays on _each_ character, one by one. Or, some optimisation which might be brittle. > > So, what use would it be? What I have proposed to Dmitry is having a > > distinct set of syntax-table properties for each major mode chunk of an > > MMM Mode ("multiple major mode") buffer. Say syntax-table-13 would be > > the set for a CC Mode chunk. Outside of that chunk, every character > > would be given a space syntax-table-13 text property. This is the > > critical thing. > > Thus all actions dependent upon syntax (and there are a LOT), could be > > performed by CC Mode in the chunk without the other chunks getting in > > the way. It may not even be necessary to narrow to the chunk. > It doesn't seem like it covers all problematic cases. Maybe not even the > majority: Possibly not. > - Would this win over "local" syntax-table properties as assigned by > syntax-table? By the usual logic of how we implement property > priorities, probably not. But it should, for this to work. I'm not clear what you mean by "local" syntax-table properties, or by "assigned by syntax-table". Sorry. > - Some code can just be looking for certain characters instead of syntax > classes with re-search-backward, etc. It wouldn't be fooled either. So > this would likely require some "are we still in the same major mode" > predicate. At which point we might get by without the space-syntax-table > swapping entirely. Yes, that is true. So the concept of region boundary could not be done away with altogether. > So what are the exact scenarios that your aim is to fix with this? You mentioned having a C Mode "main" section in MMMM with embedded other modes, possibly in the middle of CC Mode constructs. Here, my new scheme would win, since the other modes would just look like whitespace to CC Mode whilst it is scanning for strings, comments, braces, etc. This seems to be a big problem at the moment with CC Mode + MMM Mode. The same applies to other major modes, many of which will want to use syntax-ppss. > /Cc Vitalie, he could have some ideas, maybe even tell us how Polymode > maybe solves this problem already. -- Alan Mackenzie (Nuremberg, Germany).