unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
* Define treesit-font-lock-level  as buffer local
@ 2024-11-07 10:30 Vincenzo Pupillo
  2024-11-07 11:03 ` Eli Zaretskii
  0 siblings, 1 reply; 13+ messages in thread
From: Vincenzo Pupillo @ 2024-11-07 10:30 UTC (permalink / raw)
  To: emacs-devel

Ciao,
I use tree-sitter for different languages, for some of them I prefer font lock level 3, for others I prefer level 4. 
treesit-font-lock-level, however, is not defined as buffer local and so there is no elegant way to have 
different levels for different languages. 
Would it be possible to define treesit-font-lock-level as a buffer local?

Thank you.

V.





^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Define treesit-font-lock-level  as buffer local
  2024-11-07 10:30 Define treesit-font-lock-level as buffer local Vincenzo Pupillo
@ 2024-11-07 11:03 ` Eli Zaretskii
  2024-11-07 11:08   ` Vincenzo Pupillo
  0 siblings, 1 reply; 13+ messages in thread
From: Eli Zaretskii @ 2024-11-07 11:03 UTC (permalink / raw)
  To: Vincenzo Pupillo, Yuan Fu; +Cc: emacs-devel

> From: Vincenzo Pupillo <v.pupillo@gmail.com>
> Date: Thu, 07 Nov 2024 11:30:03 +0100
> 
> Ciao,
> I use tree-sitter for different languages, for some of them I prefer font lock level 3, for others I prefer level 4. 
> treesit-font-lock-level, however, is not defined as buffer local and so there is no elegant way to have 
> different levels for different languages. 
> Would it be possible to define treesit-font-lock-level as a buffer local?

Is it a problem for you to do that in your init file?

Also, we could support a value of that variable which is a list, like
we do with font-lock-maximum-decoration.  This will be more useful
than making the variable buffer-local, since presumably your
preferences are per-mode, not per-buffer.  Yuan, WDYT?



^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Define treesit-font-lock-level  as buffer local
  2024-11-07 11:03 ` Eli Zaretskii
@ 2024-11-07 11:08   ` Vincenzo Pupillo
  2024-11-09  8:36     ` Yuan Fu
  0 siblings, 1 reply; 13+ messages in thread
From: Vincenzo Pupillo @ 2024-11-07 11:08 UTC (permalink / raw)
  To: Yuan Fu, Eli Zaretskii; +Cc: emacs-devel

In data giovedì 7 novembre 2024 12:03:49 Ora standard dell’Europa centrale, Eli Zaretskii ha scritto:
> > From: Vincenzo Pupillo <v.pupillo@gmail.com>
> > Date: Thu, 07 Nov 2024 11:30:03 +0100
> > 
> > Ciao,
> > I use tree-sitter for different languages, for some of them I prefer font lock level 3, for others I prefer level 4. 
> > treesit-font-lock-level, however, is not defined as buffer local and so there is no elegant way to have 
> > different levels for different languages. 
> > Would it be possible to define treesit-font-lock-level as a buffer local?
> 
> Is it a problem for you to do that in your init file?

No, it's not a problem.

> Also, we could support a value of that variable which is a list, like
> we do with font-lock-maximum-decoration.  This will be more useful
> than making the variable buffer-local, since presumably your
> preferences are per-mode, not per-buffer.  
Yes, this would be a better solution.

> Yuan, WDYT?
> 

V.





^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Define treesit-font-lock-level  as buffer local
  2024-11-07 11:08   ` Vincenzo Pupillo
@ 2024-11-09  8:36     ` Yuan Fu
  2024-11-09  8:54       ` Eli Zaretskii
  0 siblings, 1 reply; 13+ messages in thread
From: Yuan Fu @ 2024-11-09  8:36 UTC (permalink / raw)
  To: Vincenzo Pupillo; +Cc: Eli Zaretskii, emacs-devel



> On Nov 7, 2024, at 3:08 AM, Vincenzo Pupillo <v.pupillo@gmail.com> wrote:
> 
> In data giovedì 7 novembre 2024 12:03:49 Ora standard dell’Europa centrale, Eli Zaretskii ha scritto:
>>> From: Vincenzo Pupillo <v.pupillo@gmail.com>
>>> Date: Thu, 07 Nov 2024 11:30:03 +0100
>>> 
>>> Ciao,
>>> I use tree-sitter for different languages, for some of them I prefer font lock level 3, for others I prefer level 4. 
>>> treesit-font-lock-level, however, is not defined as buffer local and so there is no elegant way to have 
>>> different levels for different languages. 
>>> Would it be possible to define treesit-font-lock-level as a buffer local?
>> 
>> Is it a problem for you to do that in your init file?
> 
> No, it's not a problem.
> 
>> Also, we could support a value of that variable which is a list, like
>> we do with font-lock-maximum-decoration.  This will be more useful
>> than making the variable buffer-local, since presumably your
>> preferences are per-mode, not per-buffer.  
> Yes, this would be a better solution.
> 
>> Yuan, WDYT?
>> 

Perhaps even per-language, for multi-language modes. I think it’s a valid use case. IMO, specify the level by language is better than mode. For single language modes, using the language is equivalent to using the mode; for multi-language modes, using the language allows more flexibility. 

Yuan


^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Define treesit-font-lock-level  as buffer local
  2024-11-09  8:36     ` Yuan Fu
@ 2024-11-09  8:54       ` Eli Zaretskii
  2024-11-10  8:04         ` Yuan Fu
  0 siblings, 1 reply; 13+ messages in thread
From: Eli Zaretskii @ 2024-11-09  8:54 UTC (permalink / raw)
  To: Yuan Fu; +Cc: v.pupillo, emacs-devel

> From: Yuan Fu <casouri@gmail.com>
> Date: Sat, 9 Nov 2024 00:36:31 -0800
> Cc: Eli Zaretskii <eliz@gnu.org>,
>  emacs-devel@gnu.org
> 
> >> Also, we could support a value of that variable which is a list, like
> >> we do with font-lock-maximum-decoration.  This will be more useful
> >> than making the variable buffer-local, since presumably your
> >> preferences are per-mode, not per-buffer.  
> > Yes, this would be a better solution.
> > 
> >> Yuan, WDYT?
> >> 
> 
> Perhaps even per-language, for multi-language modes. I think it’s a valid use case. IMO, specify the level by language is better than mode. For single language modes, using the language is equivalent to using the mode; for multi-language modes, using the language allows more flexibility. 

Patches welcome.



^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Define treesit-font-lock-level  as buffer local
  2024-11-09  8:54       ` Eli Zaretskii
@ 2024-11-10  8:04         ` Yuan Fu
  2024-11-23 12:20           ` Eli Zaretskii
  0 siblings, 1 reply; 13+ messages in thread
From: Yuan Fu @ 2024-11-10  8:04 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: Vincenzo Pupillo, emacs-devel



> On Nov 9, 2024, at 12:54 AM, Eli Zaretskii <eliz@gnu.org> wrote:
> 
>> From: Yuan Fu <casouri@gmail.com>
>> Date: Sat, 9 Nov 2024 00:36:31 -0800
>> Cc: Eli Zaretskii <eliz@gnu.org>,
>> emacs-devel@gnu.org
>> 
>>>> Also, we could support a value of that variable which is a list, like
>>>> we do with font-lock-maximum-decoration.  This will be more useful
>>>> than making the variable buffer-local, since presumably your
>>>> preferences are per-mode, not per-buffer.  
>>> Yes, this would be a better solution.
>>> 
>>>> Yuan, WDYT?
>>>> 
>> 
>> Perhaps even per-language, for multi-language modes. I think it’s a valid use case. IMO, specify the level by language is better than mode. For single language modes, using the language is equivalent to using the mode; for multi-language modes, using the language allows more flexibility. 
> 
> Patches welcome.

I’ll prepare one soon.

Yuan


^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Define treesit-font-lock-level  as buffer local
  2024-11-10  8:04         ` Yuan Fu
@ 2024-11-23 12:20           ` Eli Zaretskii
  2024-11-29  5:49             ` Tree-sitter central configuration variable Yuan Fu
  0 siblings, 1 reply; 13+ messages in thread
From: Eli Zaretskii @ 2024-11-23 12:20 UTC (permalink / raw)
  To: Yuan Fu; +Cc: v.pupillo, emacs-devel

> From: Yuan Fu <casouri@gmail.com>
> Date: Sun, 10 Nov 2024 00:04:00 -0800
> Cc: Vincenzo Pupillo <v.pupillo@gmail.com>,
>  emacs-devel@gnu.org
> 
> 
> 
> > On Nov 9, 2024, at 12:54 AM, Eli Zaretskii <eliz@gnu.org> wrote:
> > 
> >> From: Yuan Fu <casouri@gmail.com>
> >> Date: Sat, 9 Nov 2024 00:36:31 -0800
> >> Cc: Eli Zaretskii <eliz@gnu.org>,
> >> emacs-devel@gnu.org
> >> 
> >>>> Also, we could support a value of that variable which is a list, like
> >>>> we do with font-lock-maximum-decoration.  This will be more useful
> >>>> than making the variable buffer-local, since presumably your
> >>>> preferences are per-mode, not per-buffer.  
> >>> Yes, this would be a better solution.
> >>> 
> >>>> Yuan, WDYT?
> >>>> 
> >> 
> >> Perhaps even per-language, for multi-language modes. I think it’s a valid use case. IMO, specify the level by language is better than mode. For single language modes, using the language is equivalent to using the mode; for multi-language modes, using the language allows more flexibility. 
> > 
> > Patches welcome.
> 
> I’ll prepare one soon.

Did you have time to work on this?



^ permalink raw reply	[flat|nested] 13+ messages in thread

* Tree-sitter central configuration variable
  2024-11-23 12:20           ` Eli Zaretskii
@ 2024-11-29  5:49             ` Yuan Fu
  2024-11-29  7:12               ` Eshel Yaron
  2024-11-29  8:04               ` Eli Zaretskii
  0 siblings, 2 replies; 13+ messages in thread
From: Yuan Fu @ 2024-11-29  5:49 UTC (permalink / raw)
  To: Eli Zaretskii
  Cc: Vincenzo Pupillo, Emacs Devel, Stefan Kangas, Stefan Monnier,
	Dmitry Gutov



> On Nov 23, 2024, at 4:20 AM, Eli Zaretskii <eliz@gnu.org> wrote:
> 
>> From: Yuan Fu <casouri@gmail.com>
>> Date: Sun, 10 Nov 2024 00:04:00 -0800
>> Cc: Vincenzo Pupillo <v.pupillo@gmail.com>,
>> emacs-devel@gnu.org
>> 
>> 
>> 
>>> On Nov 9, 2024, at 12:54 AM, Eli Zaretskii <eliz@gnu.org> wrote:
>>> 
>>>> From: Yuan Fu <casouri@gmail.com>
>>>> Date: Sat, 9 Nov 2024 00:36:31 -0800
>>>> Cc: Eli Zaretskii <eliz@gnu.org>,
>>>> emacs-devel@gnu.org
>>>> 
>>>>>> Also, we could support a value of that variable which is a list, like
>>>>>> we do with font-lock-maximum-decoration.  This will be more useful
>>>>>> than making the variable buffer-local, since presumably your
>>>>>> preferences are per-mode, not per-buffer.  
>>>>> Yes, this would be a better solution.
>>>>> 
>>>>>> Yuan, WDYT?
>>>>>> 
>>>> 
>>>> Perhaps even per-language, for multi-language modes. I think it’s a valid use case. IMO, specify the level by language is better than mode. For single language modes, using the language is equivalent to using the mode; for multi-language modes, using the language allows more flexibility. 
>>> 
>>> Patches welcome.
>> 
>> I’ll prepare one soon.
> 
> Did you have time to work on this?

Actually, I want to expand this to something that allows users to configure tree-sitter modes and toggle on/off tree-sitter features.

Since Emacs 29, I see many people ask about how to configure tree-sitter modes by setting some variable. It seems that people much prefer setting a variable than adding a major mode hook that calls some functions. Also, admittedly adding custom font-lock or indent rules aren’t very straightforward for users.

In Emacs 29, we went with the major-mode hook approach for customization since the major mode inheritance situation wasn’t yet clear, the hook approach provides most flexibility, and we don’t really know what we want. But at this point I think we can add another layer of convenience. As long as this convenience layer handles 90% of the use-cases and doesn’t add any confusion, it’ll be a net-gain.

What do you guys think about something like this:

(setq treesit-global-configuration
      '((c-ts-mode
         ;; Set treesit-font-lock-level to 4
         (font-lock-level . 4)
         ;; Disable tree-sitter’s outline support
         (outline . disable)
         ;; Enable these features on top of the default ones.
         (font-lock-enable . (function property variable))
         ;; Disable these features.
         (font-lock-disable . (definition))
         ;; Add extra font-lock rules
         (font-lock-extra-rules
          ( :feature 'my-rules
            :language 'c
            ((some_query) @some-face)))
         (simple-indent-extra-rules
          (c
           (matcher anchor offset))
          (d
           (matcher anchor offset)))
         )))

This config will apply to c-ts-mode or its derived mode. Users can add extra font-lock and indent rules by setting this variable.

One thing I don’t like is how it handles languages. In this POC language-specific settings are nested under the mode. I’m ok with mode-language hierarchy, but the nesting adds a lot of nesting levels to the variable. And the language nesting isn’t consistent, some settings have language nesting, some don’t.

Implementation wise, it shouldn’t be difficult. We just make treesit-major-mode-setup aware of this variable and use it to override stuff when setting things up.

Yuan


^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Tree-sitter central configuration variable
  2024-11-29  5:49             ` Tree-sitter central configuration variable Yuan Fu
@ 2024-11-29  7:12               ` Eshel Yaron
  2024-11-29  8:42                 ` Yuan Fu
  2024-11-29  8:04               ` Eli Zaretskii
  1 sibling, 1 reply; 13+ messages in thread
From: Eshel Yaron @ 2024-11-29  7:12 UTC (permalink / raw)
  To: Yuan Fu
  Cc: Eli Zaretskii, Vincenzo Pupillo, Emacs Devel, Stefan Kangas,
	Stefan Monnier, Dmitry Gutov

Hi,

Yuan Fu <casouri@gmail.com> writes:

> Actually, I want to expand this to something that allows users to
> configure tree-sitter modes and toggle on/off tree-sitter features.
>
[...]
>
> What do you guys think about something like this:
>
> (setq treesit-global-configuration
>       '((c-ts-mode
>          ;; Set treesit-font-lock-level to 4
>          (font-lock-level . 4)
[...]
>          )))
>

FWIW, I think tree-sitter should be treated as an implementation detail,
not a user-facing concept.  Tree-sitter empowers major mode developers,
but users shouldn't have to know or care about tree-sitter to benefit
from it.  So a TS-specific configuration user option is not the best
approach IMO.  (For the same reason, I'm not a big fan of the "powered
by tree-sitter" that we have today in the docstring of various modes.)

Instead, when it comes to settings that apply across major modes, I
think we should focus on knobs that make sense whether or not a mode is
TS-based, and have the TS-based modes respect such universal settings.


Just my 2c,

Eshel



^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Tree-sitter central configuration variable
  2024-11-29  5:49             ` Tree-sitter central configuration variable Yuan Fu
  2024-11-29  7:12               ` Eshel Yaron
@ 2024-11-29  8:04               ` Eli Zaretskii
  2024-11-29  9:07                 ` Yuan Fu
  1 sibling, 1 reply; 13+ messages in thread
From: Eli Zaretskii @ 2024-11-29  8:04 UTC (permalink / raw)
  To: Yuan Fu; +Cc: v.pupillo, emacs-devel, stefankangas, monnier, dmitry

> From: Yuan Fu <casouri@gmail.com>
> Date: Thu, 28 Nov 2024 21:49:33 -0800
> Cc: Vincenzo Pupillo <v.pupillo@gmail.com>,
>  Emacs Devel <emacs-devel@gnu.org>,
>  Stefan Kangas <stefankangas@gmail.com>,
>  Stefan Monnier <monnier@iro.umontreal.ca>,
>  Dmitry Gutov <dmitry@gutov.dev>
> 
> Actually, I want to expand this to something that allows users to configure tree-sitter modes and toggle on/off tree-sitter features.
> 
> Since Emacs 29, I see many people ask about how to configure tree-sitter modes by setting some variable. It seems that people much prefer setting a variable than adding a major mode hook that calls some functions. Also, admittedly adding custom font-lock or indent rules aren’t very straightforward for users.
> 
> In Emacs 29, we went with the major-mode hook approach for customization since the major mode inheritance situation wasn’t yet clear, the hook approach provides most flexibility, and we don’t really know what we want. But at this point I think we can add another layer of convenience. As long as this convenience layer handles 90% of the use-cases and doesn’t add any confusion, it’ll be a net-gain.

We are discussing the solutions to these issues in this thread:

  https://lists.gnu.org/archive/html/emacs-devel/2024-11/msg00636.html

Feel free to chime in and state your opinions about the solutions
being proposed there.

> What do you guys think about something like this:
> 
> (setq treesit-global-configuration
>       '((c-ts-mode
>          ;; Set treesit-font-lock-level to 4
>          (font-lock-level . 4)
>          ;; Disable tree-sitter’s outline support
>          (outline . disable)
>          ;; Enable these features on top of the default ones.
>          (font-lock-enable . (function property variable))
>          ;; Disable these features.
>          (font-lock-disable . (definition))
>          ;; Add extra font-lock rules
>          (font-lock-extra-rules
>           ( :feature 'my-rules
>             :language 'c
>             ((some_query) @some-face)))
>          (simple-indent-extra-rules
>           (c
>            (matcher anchor offset))
>           (d
>            (matcher anchor offset)))
>          )))
> 
> This config will apply to c-ts-mode or its derived mode. Users can add extra font-lock and indent rules by setting this variable.

I'm not sure I understand what problem(s) this intends to solve?  Why
should we have a global variable that customizes several different
modes, if the specific customizations made by that variable depend on
the individual modes?

> One thing I don’t like is how it handles languages. In this POC language-specific settings are nested under the mode. I’m ok with mode-language hierarchy, but the nesting adds a lot of nesting levels to the variable. And the language nesting isn’t consistent, some settings have language nesting, some don’t.

We don't have a notion of "language", we only have major modes.  Emacs
30 makes it easier to specify settings common to modes that serve the
same "language" by introducing the notion of "extra parent" modes.
But that is only a small half-step; I don't think we have figured out
what should be the goal.



^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Tree-sitter central configuration variable
  2024-11-29  7:12               ` Eshel Yaron
@ 2024-11-29  8:42                 ` Yuan Fu
  0 siblings, 0 replies; 13+ messages in thread
From: Yuan Fu @ 2024-11-29  8:42 UTC (permalink / raw)
  To: Eshel Yaron
  Cc: Eli Zaretskii, Vincenzo Pupillo, Emacs Devel, Stefan Kangas,
	Stefan Monnier, Dmitry Gutov



> On Nov 28, 2024, at 11:12 PM, Eshel Yaron <me@eshelyaron.com> wrote:
> 
> Hi,
> 
> Yuan Fu <casouri@gmail.com> writes:
> 
>> Actually, I want to expand this to something that allows users to
>> configure tree-sitter modes and toggle on/off tree-sitter features.
>> 
> [...]
>> 
>> What do you guys think about something like this:
>> 
>> (setq treesit-global-configuration
>>      '((c-ts-mode
>>         ;; Set treesit-font-lock-level to 4
>>         (font-lock-level . 4)
> [...]
>>         )))
>> 
> 
> FWIW, I think tree-sitter should be treated as an implementation detail,
> not a user-facing concept.  Tree-sitter empowers major mode developers,
> but users shouldn't have to know or care about tree-sitter to benefit
> from it.  So a TS-specific configuration user option is not the best
> approach IMO.  (For the same reason, I'm not a big fan of the "powered
> by tree-sitter" that we have today in the docstring of various modes.)

I principle, I agree, and that’s what we’ve been doing: we expose tree-sitter as a mechanism for major modes to build on. However, the reality is, something like this just aren’t going to be transparent—users know it’s powered by tree-sitter, they’ll need to have tree-sitter knowledge for non-trivial customizations (eg, add a font-lock rule or indent rule), and the major mode is tied-in with tree-sitter, implementing the major mode with another parser library would require creating a new major mode.

Once you accept that these modes are tied with tree-sitter, a TS-specific configuration starts to seem pretty reasonable. Also, think of the advantage of using a single set of configuration: no need to read the manual and learn the variables etc for each individual major modes; as long as the mode uses tree-sitter features, you learn the configuration framework once and use it on every mode.

> Instead, when it comes to settings that apply across major modes, I
> think we should focus on knobs that make sense whether or not a mode is
> TS-based, and have the TS-based modes respect such universal settings.

Perhaps you mean something that configures things like font-lock, imenu, etc? (I can’t really think of a third example). That’s a bit ambitious, the return is unclear to me, and gods bless whoever wants to fight through emacs-devel for it :-) OTOH, a tree-sitter configuration variable is pretty concrete, I know people want it, and more or less noncontroversial (I hope).

Yuan


^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Tree-sitter central configuration variable
  2024-11-29  8:04               ` Eli Zaretskii
@ 2024-11-29  9:07                 ` Yuan Fu
  2024-11-29 12:02                   ` Eli Zaretskii
  0 siblings, 1 reply; 13+ messages in thread
From: Yuan Fu @ 2024-11-29  9:07 UTC (permalink / raw)
  To: Eli Zaretskii
  Cc: Vincenzo Pupillo, Emacs Devel, Stefan Kangas, Stefan Monnier,
	dmitry



> On Nov 29, 2024, at 12:04 AM, Eli Zaretskii <eliz@gnu.org> wrote:
> 
>> From: Yuan Fu <casouri@gmail.com>
>> Date: Thu, 28 Nov 2024 21:49:33 -0800
>> Cc: Vincenzo Pupillo <v.pupillo@gmail.com>,
>> Emacs Devel <emacs-devel@gnu.org>,
>> Stefan Kangas <stefankangas@gmail.com>,
>> Stefan Monnier <monnier@iro.umontreal.ca>,
>> Dmitry Gutov <dmitry@gutov.dev>
>> 
>> Actually, I want to expand this to something that allows users to configure tree-sitter modes and toggle on/off tree-sitter features.
>> 
>> Since Emacs 29, I see many people ask about how to configure tree-sitter modes by setting some variable. It seems that people much prefer setting a variable than adding a major mode hook that calls some functions. Also, admittedly adding custom font-lock or indent rules aren’t very straightforward for users.
>> 
>> In Emacs 29, we went with the major-mode hook approach for customization since the major mode inheritance situation wasn’t yet clear, the hook approach provides most flexibility, and we don’t really know what we want. But at this point I think we can add another layer of convenience. As long as this convenience layer handles 90% of the use-cases and doesn’t add any confusion, it’ll be a net-gain.
> 
> We are discussing the solutions to these issues in this thread:
> 
>  https://lists.gnu.org/archive/html/emacs-devel/2024-11/msg00636.html
> 
> Feel free to chime in and state your opinions about the solutions
> being proposed there.

That thread is about turning tree-sitter modes on and off. I think Dmitry’s proposal seems pretty good, and I don’t have much to add. What I’m proposing here is rather about configuring tree-sitter modes: turning features (outline, imenu, etc) on and off, customizing things (font-lock, indent, etc).

> 
>> What do you guys think about something like this:
>> 
>> (setq treesit-global-configuration
>>      '((c-ts-mode
>>         ;; Set treesit-font-lock-level to 4
>>         (font-lock-level . 4)
>>         ;; Disable tree-sitter’s outline support
>>         (outline . disable)
>>         ;; Enable these features on top of the default ones.
>>         (font-lock-enable . (function property variable))
>>         ;; Disable these features.
>>         (font-lock-disable . (definition))
>>         ;; Add extra font-lock rules
>>         (font-lock-extra-rules
>>          ( :feature 'my-rules
>>            :language 'c
>>            ((some_query) @some-face)))
>>         (simple-indent-extra-rules
>>          (c
>>           (matcher anchor offset))
>>          (d
>>           (matcher anchor offset)))
>>         )))
>> 
>> This config will apply to c-ts-mode or its derived mode. Users can add extra font-lock and indent rules by setting this variable.
> 
> I'm not sure I understand what problem(s) this intends to solve?  Why
> should we have a global variable that customizes several different
> modes, if the specific customizations made by that variable depend on
> the individual modes?

I’m trying to solve several things: first off, turning on/off some tree-sitter features. Originally someone wants to turn off tree-sitter mode’s outline support. Technically they don’t really need to turn of tree-sitter’s outline, they just need to make the minor mode they’re using override tree-sitter’s outline configuration completely. Nevertheless, wanting to turn off a tree-sitter feature seems a reasonable ask so I want to support it.

Secondly, many people wants to configure tree-sitter modes by setting a variable. Right now, if one needs to add/remove some font-lock features they need to do this:

(defun x-mode-setup ()
  (treesit-font-lock-recompute-features '(feature-a) '(feature-b)))
(add-hook 'x-ts-mode-hook #'x-mode-setup)

If they want to add some custom font-lock rules:

(defun x-mode-setup ()
  (setq treesit-font-lock-settings
        (append treesit-font-lock-settings
                (treesit-font-lock-rules
                 :feature 'custom-feature
                 :language 'lang
                 ((some_node) @face))))
  (treesit-font-lock-recompute-features))
(add-hook 'x-ts-mode-hook #'x-mode-setup)

That might be a bit convoluted. I’m hoping that I can simplify things by the proposed variable.

> 
>> One thing I don’t like is how it handles languages. In this POC language-specific settings are nested under the mode. I’m ok with mode-language hierarchy, but the nesting adds a lot of nesting levels to the variable. And the language nesting isn’t consistent, some settings have language nesting, some don’t.
> 
> We don't have a notion of "language", we only have major modes.  Emacs
> 30 makes it easier to specify settings common to modes that serve the
> same "language" by introducing the notion of "extra parent" modes.
> But that is only a small half-step; I don't think we have figured out
> what should be the goal.

We’re talking about different things. The language I was talking about are the languages in a multi-language modes. For a HTML mode, you’ll have HTML, CSS, and Javascript, and the we might want to configure the three languages differently in the HTML major mode. Eg, add a custom indent rule for CSS in the mode.

Yuan




^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Tree-sitter central configuration variable
  2024-11-29  9:07                 ` Yuan Fu
@ 2024-11-29 12:02                   ` Eli Zaretskii
  0 siblings, 0 replies; 13+ messages in thread
From: Eli Zaretskii @ 2024-11-29 12:02 UTC (permalink / raw)
  To: Yuan Fu; +Cc: v.pupillo, emacs-devel, stefankangas, monnier, dmitry

> From: Yuan Fu <casouri@gmail.com>
> Date: Fri, 29 Nov 2024 01:07:37 -0800
> Cc: Vincenzo Pupillo <v.pupillo@gmail.com>,
>  Emacs Devel <emacs-devel@gnu.org>,
>  Stefan Kangas <stefankangas@gmail.com>,
>  Stefan Monnier <monnier@iro.umontreal.ca>,
>  dmitry@gutov.dev
> 
> >> (setq treesit-global-configuration
> >>      '((c-ts-mode
> >>         ;; Set treesit-font-lock-level to 4
> >>         (font-lock-level . 4)
> >>         ;; Disable tree-sitter’s outline support
> >>         (outline . disable)
> >>         ;; Enable these features on top of the default ones.
> >>         (font-lock-enable . (function property variable))
> >>         ;; Disable these features.
> >>         (font-lock-disable . (definition))
> >>         ;; Add extra font-lock rules
> >>         (font-lock-extra-rules
> >>          ( :feature 'my-rules
> >>            :language 'c
> >>            ((some_query) @some-face)))
> >>         (simple-indent-extra-rules
> >>          (c
> >>           (matcher anchor offset))
> >>          (d
> >>           (matcher anchor offset)))
> >>         )))
> >> 
> >> This config will apply to c-ts-mode or its derived mode. Users can add extra font-lock and indent rules by setting this variable.
> > 
> > I'm not sure I understand what problem(s) this intends to solve?  Why
> > should we have a global variable that customizes several different
> > modes, if the specific customizations made by that variable depend on
> > the individual modes?
> 
> I’m trying to solve several things: first off, turning on/off some tree-sitter features. Originally someone wants to turn off tree-sitter mode’s outline support. Technically they don’t really need to turn of tree-sitter’s outline, they just need to make the minor mode they’re using override tree-sitter’s outline configuration completely. Nevertheless, wanting to turn off a tree-sitter feature seems a reasonable ask so I want to support it.
> 
> Secondly, many people wants to configure tree-sitter modes by setting a variable. Right now, if one needs to add/remove some font-lock features they need to do this:
> 
> (defun x-mode-setup ()
>   (treesit-font-lock-recompute-features '(feature-a) '(feature-b)))
> (add-hook 'x-ts-mode-hook #'x-mode-setup)
> 
> If they want to add some custom font-lock rules:
> 
> (defun x-mode-setup ()
>   (setq treesit-font-lock-settings
>         (append treesit-font-lock-settings
>                 (treesit-font-lock-rules
>                  :feature 'custom-feature
>                  :language 'lang
>                  ((some_node) @face))))
>   (treesit-font-lock-recompute-features))
> (add-hook 'x-ts-mode-hook #'x-mode-setup)
> 
> That might be a bit convoluted. I’m hoping that I can simplify things by the proposed variable.

But we didn't support any such features with the "traditional" regexp-
and syntax-based fontifications, so why would tree-sitter based modes
be special in this regard?  Are you saying that tree-sitter based
modes somehow need to update rules for fontification, indentation, and
other stuff much more frequently than the "traditional" modes?  If so,
why?

Btw, I don't see the above hooks more convoluted than your
treesit-global-configuration variable.  They both need a good
understanding of tree-sitter grammars and of Lisp.

> > We don't have a notion of "language", we only have major modes.  Emacs
> > 30 makes it easier to specify settings common to modes that serve the
> > same "language" by introducing the notion of "extra parent" modes.
> > But that is only a small half-step; I don't think we have figured out
> > what should be the goal.
> 
> We’re talking about different things. The language I was talking about are the languages in a multi-language modes. For a HTML mode, you’ll have HTML, CSS, and Javascript, and the we might want to configure the three languages differently in the HTML major mode. Eg, add a custom indent rule for CSS in the mode.

How would the variable you propose help in doing that?  IOW, I guess I
don't see clearly enough the problem you want to solve in this case
and the solution you are proposing for that.  Please elaborate.




^ permalink raw reply	[flat|nested] 13+ messages in thread

end of thread, other threads:[~2024-11-29 12:02 UTC | newest]

Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-11-07 10:30 Define treesit-font-lock-level as buffer local Vincenzo Pupillo
2024-11-07 11:03 ` Eli Zaretskii
2024-11-07 11:08   ` Vincenzo Pupillo
2024-11-09  8:36     ` Yuan Fu
2024-11-09  8:54       ` Eli Zaretskii
2024-11-10  8:04         ` Yuan Fu
2024-11-23 12:20           ` Eli Zaretskii
2024-11-29  5:49             ` Tree-sitter central configuration variable Yuan Fu
2024-11-29  7:12               ` Eshel Yaron
2024-11-29  8:42                 ` Yuan Fu
2024-11-29  8:04               ` Eli Zaretskii
2024-11-29  9:07                 ` Yuan Fu
2024-11-29 12:02                   ` Eli Zaretskii

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).