unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties
@ 2024-11-18 10:12 Guillaume Brunerie
  2024-11-24  9:59 ` Eli Zaretskii
  0 siblings, 1 reply; 10+ messages in thread
From: Guillaume Brunerie @ 2024-11-18 10:12 UTC (permalink / raw)
  To: 74415

Hello,

The `mouse-start-end` function (used in particular when double clicking on an
opening parenthesis to select the region from that opening parenthesis to the
matching closing parenthesis) does not respect syntax-table text properties to
determine if the character at point is an opening parenthesis. Looking at the
code, it is at line 1941 in lisp/mouse.el (and there are more instances further
down in the same file)
(https://github.com/emacs-mirror/emacs/blob/eee0ed8442aa78320a3e578ab290df145fb49624/lisp/mouse.el#L1941):

    ((and (= mode 1)
          (= start end)
          (char-after start)
          (= (char-syntax (char-after start)) ?\())

Note that the documentation of `char-syntax` says:

> If you’re trying to determine the syntax of characters in the buffer, this is
> probably the wrong function to use, because it can’t take ‘syntax-table’ text
> properties into account. Consider using ‘syntax-after’ instead.

The line just below does use `syntax-after`, but it looks like something that
was added later to work around a related bug. I think this function should be
refactored to only use `syntax-after` instead of `char-syntax`.

For context, I'm writing my own Typescript major mode where the < and > symbols
are sometimes balanced delimiters, sometimes not (determined via Tree-sitter). I
could get most things working using syntax-table text properties, like
forward-sexp and show-paren-mode, but not double-click selection due to this
issue.

Related: https://lists.gnu.org/archive/html/bug-gnu-emacs/2016-02/msg00988.html
In that issue, the opposite problem occurred, double clicking on an unmatched
open parenthesis did not take into account the "punctuation" text property. But
the fix only fixed one half of the issue (when a text property makes a
parenthesis into not a parenthesis) and I am facing the other half (a text
property makes a punctuation character into a parenthesis).

Best,
Guillaume





^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties
  2024-11-18 10:12 bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties Guillaume Brunerie
@ 2024-11-24  9:59 ` Eli Zaretskii
  2024-11-25 23:15   ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
  0 siblings, 1 reply; 10+ messages in thread
From: Eli Zaretskii @ 2024-11-24  9:59 UTC (permalink / raw)
  To: Guillaume Brunerie, Stefan Monnier; +Cc: 74415

> From: Guillaume Brunerie <guillaume.brunerie@gmail.com>
> Date: Mon, 18 Nov 2024 11:12:49 +0100
> 

^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties
  2024-11-24  9:59 ` Eli Zaretskii
@ 2024-11-25 23:15   ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
  2024-11-28 16:02     ` Eli Zaretskii
  0 siblings, 1 reply; 10+ messages in thread
From: Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors @ 2024-11-25 23:15 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: Guillaume Brunerie, 74415

> Stefan, is there any reason not to use syntax-after everywhere in
> mouse.el?

Not that I can think of, no.
AFAIU, this code simply predates `syntax-after`.


        Stefan






^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties
  2024-11-25 23:15   ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
@ 2024-11-28 16:02     ` Eli Zaretskii
  2024-11-28 20:03       ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
  2024-12-05  6:13       ` Guillaume Brunerie
  0 siblings, 2 replies; 10+ messages in thread
From: Eli Zaretskii @ 2024-11-28 16:02 UTC (permalink / raw)
  To: guillaume.brunerie, Stefan Monnier; +Cc: 74415

> From: Stefan Monnier <monnier@iro.umontreal.ca>
> Cc: Guillaume Brunerie <guillaume.brunerie@gmail.com>,  74415@debbugs.gnu.org
> Date: Mon, 25 Nov 2024 18:15:12 -0500
> 
> > Stefan, is there any reason not to use syntax-after everywhere in
> > mouse.el?
> 
> Not that I can think of, no.
> AFAIU, this code simply predates `syntax-after`.

Thanks.  Guillaume, does the patch below give good results?

diff --git a/lisp/mouse.el b/lisp/mouse.el
index 410e52b..766c4d8 100644
--- a/lisp/mouse.el
+++ b/lisp/mouse.el
@@ -632,7 +632,9 @@ context-menu-region
     (with-current-buffer (window-buffer (posn-window (event-end click)))
       (when (let* ((pos (posn-point (event-end click)))
                    (char (when pos (char-after pos))))
-              (or (and char (eq (char-syntax char) ?\"))
+              (or (and char (eq (syntax-class-to-char
+                                 (syntax-class (syntax-after pos)))
+                                ?\"))
                   (nth 3 (save-excursion (syntax-ppss pos)))))
         (define-key-after submenu [mark-string]
           `(menu-item "String"
@@ -1890,7 +1892,8 @@ mouse-skip-word
 If `mouse-1-double-click-prefer-symbols' is non-nil, skip over symbol.
 If DIR is positive skip forward; if negative, skip backward."
   (let* ((char (following-char))
-	 (syntax (char-to-string (char-syntax char)))
+	 (syntax (char-to-string
+                  (syntax-class-to-char (syntax-class (syntax-after (point))))))
          sym)
     (cond ((and mouse-1-double-click-prefer-symbols
                 (setq sym (bounds-of-thing-at-point 'symbol)))
@@ -1938,7 +1941,9 @@ mouse-start-end
         ((and (= mode 1)
               (= start end)
 	      (char-after start)
-              (= (char-syntax (char-after start)) ?\())
+              (= (syntax-class-to-char
+                  (syntax-class (syntax-after start)))
+                 ?\())
          (if (/= (syntax-class (syntax-after start)) 4) ; raw syntax code for ?\(
              ;; This happens in CC Mode when unbalanced parens in CPP
              ;; constructs are given punctuation syntax with
@@ -1953,7 +1958,9 @@ mouse-start-end
         ((and (= mode 1)
               (= start end)
 	      (char-after start)
-              (= (char-syntax (char-after start)) ?\)))
+              (= (syntax-class-to-char
+                  (syntax-class (syntax-after start)))
+                 ?\)))
          (if (/= (syntax-class (syntax-after start)) 5) ; raw syntax code for ?\)
              ;; See above comment about CC Mode.
              (signal 'scan-error (list "Unbalanced parentheses" start start))
@@ -1965,7 +1972,9 @@ mouse-start-end
 	((and (= mode 1)
               (= start end)
 	      (char-after start)
-              (= (char-syntax (char-after start)) ?\"))
+              (= (syntax-class-to-char
+                  (syntax-class (syntax-after start)))
+                 ?\"))
 	 (let ((open (or (eq start (point-min))
 			 (save-excursion
 			   (goto-char (- start 1))





^ permalink raw reply related	[flat|nested] 10+ messages in thread

* bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties
  2024-11-28 16:02     ` Eli Zaretskii
@ 2024-11-28 20:03       ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
  2024-12-05  6:13       ` Guillaume Brunerie
  1 sibling, 0 replies; 10+ messages in thread
From: Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors @ 2024-11-28 20:03 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: guillaume.brunerie, 74415

> Thanks.  Guillaume, does the patch below give good results?

the latch LGTM, but it bought up the following side note: maybe we
should add a `syntax-class-of-char` so we can replace

    (= (syntax-class-to-char FOO)
       BAR))

with

    (= FOO
       (syntax-class-of-char BAR))

where `syntax-class-of-char` can be pre-computed during compilation
(when BAR is a constant, which would be always the case in your patch).

Tho, maybe even better would be a `(syntax-class-p BAR FOO)`.

Anyway, nothing urgent.


        Stefan






^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties
  2024-11-28 16:02     ` Eli Zaretskii
  2024-11-28 20:03       ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
@ 2024-12-05  6:13       ` Guillaume Brunerie
  2024-12-07 12:33         ` Eli Zaretskii
  1 sibling, 1 reply; 10+ messages in thread
From: Guillaume Brunerie @ 2024-12-05  6:13 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: Stefan Monnier, 74415

Den tors 28 nov. 2024 kl 17:02 skrev Eli Zaretskii <eliz@gnu.org>:
> Thanks.  Guillaume, does the patch below give good results?

Thank you, I haven’t managed to apply the patch locally yet but I
think that would work (I’m on Emacs 29.4, but I guess I might need to
get the development version of Emacs? The patch seems to fail on my
mouse.el).
But one thing I want to point out is that the two `(signal 'scan-error
[...])` seem to be dead code now, as they test the exact opposite of
what the previous test now does.
So I guess it should be implemented a bit differently if you want to
preserve the current behavior (have a signal 'scan-error' when double
clicking on unbalanced parentheses in CC mode).





^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties
  2024-12-05  6:13       ` Guillaume Brunerie
@ 2024-12-07 12:33         ` Eli Zaretskii
  2024-12-09 19:21           ` Guillaume Brunerie
  0 siblings, 1 reply; 10+ messages in thread
From: Eli Zaretskii @ 2024-12-07 12:33 UTC (permalink / raw)
  To: Guillaume Brunerie; +Cc: monnier, 74415

> From: Guillaume Brunerie <guillaume.brunerie@gmail.com>
> Date: Thu, 5 Dec 2024 07:13:20 +0100
> Cc: Stefan Monnier <monnier@iro.umontreal.ca>, 74415@debbugs.gnu.org
> 
> Den tors 28 nov. 2024 kl 17:02 skrev Eli Zaretskii <eliz@gnu.org>:
> > Thanks.  Guillaume, does the patch below give good results?
> 
> Thank you, I haven’t managed to apply the patch locally yet but I
> think that would work (I’m on Emacs 29.4, but I guess I might need to
> get the development version of Emacs? The patch seems to fail on my
> mouse.el).
> But one thing I want to point out is that the two `(signal 'scan-error
> [...])` seem to be dead code now, as they test the exact opposite of
> what the previous test now does.

Sorry, I don't understand.  The changes I proposed didn't touch the
lines that signals errors.  Are you saying that those errors were dead
code before these changes as well?  If not, could you please elaborate
on the issues you see with the changes I proposed?

> So I guess it should be implemented a bit differently if you want to
> preserve the current behavior (have a signal 'scan-error' when double
> clicking on unbalanced parentheses in CC mode).






^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties
  2024-12-07 12:33         ` Eli Zaretskii
@ 2024-12-09 19:21           ` Guillaume Brunerie
  2024-12-13 17:05             ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
  0 siblings, 1 reply; 10+ messages in thread
From: Guillaume Brunerie @ 2024-12-09 19:21 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: monnier, 74415

Den lör 7 dec. 2024 kl 13:33 skrev Eli Zaretskii <eliz@gnu.org>:
>
> > From: Guillaume Brunerie <guillaume.brunerie@gmail.com>
> > Date: Thu, 5 Dec 2024 07:13:20 +0100
> > Cc: Stefan Monnier <monnier@iro.umontreal.ca>, 74415@debbugs.gnu.org
> >
> > Den tors 28 nov. 2024 kl 17:02 skrev Eli Zaretskii <eliz@gnu.org>:
> > > Thanks.  Guillaume, does the patch below give good results?
> >
> > Thank you, I haven’t managed to apply the patch locally yet but I
> > think that would work (I’m on Emacs 29.4, but I guess I might need to
> > get the development version of Emacs? The patch seems to fail on my
> > mouse.el).
> > But one thing I want to point out is that the two `(signal 'scan-error
> > [...])` seem to be dead code now, as they test the exact opposite of
> > what the previous test now does.
>
> Sorry, I don't understand.  The changes I proposed didn't touch the
> lines that signals errors.  Are you saying that those errors were dead
> code before these changes as well?  If not, could you please elaborate
> on the issues you see with the changes I proposed?

No, it was not dead code before, but changing the outer condition
makes it impossible for both the outer condition and the inner
condition to be true at the same time.
The current code is the following (inside a cond)

((and (= mode 1)
      (= start end)
      (char-after start)
      (= (char-syntax (char-after start)) ?\())
 (if (/= (syntax-class (syntax-after start)) 4) ; raw syntax code for ?\(
     ;; This happens in CC Mode when unbalanced parens in CPP
     ;; constructs are given punctuation syntax with
     ;; syntax-table text properties.  (2016-02-21).
     (signal 'scan-error (list "Containing expression ends prematurely"
                               start start))
   (list start
         (save-excursion
           (goto-char start)
           (forward-sexp 1)
           (point)))))

So the 'scan-error happens when the character is a parenthesis
character according to the syntax table (that's what is tested by (=
(char-syntax (char-after start)) ?\()) but has a text property telling
Emacs to treat it as something else than a parenthesis instead.
Changing the "char-syntax" test to (= (syntax-class-to-char
(syntax-class (syntax-after start))) ?\()) makes it so that the
'scan-error happens if the character is both a parenthesis and not a
parenthesis according to text properties, which is not possible.

In other words, it is not possible for both (= (syntax-class-to-char
(syntax-class (syntax-after start))) ?\()) and (/= (syntax-class
(syntax-after start)) 4) to be simultaneously true, as they are the
exact opposite of each other, so when they are nested conditions, the
inner one becomes dead code.

That said, it only seems to happen in rare edge cases (as in
https://lists.gnu.org/archive/html/bug-gnu-emacs/2016-02/msg00988.html),
as regular unmatched parentheses do seem to still trigger a different
"Unbalanced parentheses" scan error.

Apart from that, I have now managed to test the patch, and I can
confirm that it fixes my original issue





^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties
  2024-12-09 19:21           ` Guillaume Brunerie
@ 2024-12-13 17:05             ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
  2025-01-04 10:54               ` Eli Zaretskii
  0 siblings, 1 reply; 10+ messages in thread
From: Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors @ 2024-12-13 17:05 UTC (permalink / raw)
  To: Guillaume Brunerie; +Cc: Eli Zaretskii, 74415

> No, it was not dead code before, but changing the outer condition
> makes it impossible for both the outer condition and the inner
> condition to be true at the same time.
> The current code is the following (inside a cond)
>
> ((and (= mode 1)
>       (= start end)
>       (char-after start)
>       (= (char-syntax (char-after start)) ?\())
>  (if (/= (syntax-class (syntax-after start)) 4) ; raw syntax code for ?\(
>      ;; This happens in CC Mode when unbalanced parens in CPP
>      ;; constructs are given punctuation syntax with
>      ;; syntax-table text properties.  (2016-02-21).
>      (signal 'scan-error (list "Containing expression ends prematurely"
>                                start start))
>    (list start
>          (save-excursion
>            (goto-char start)
>            (forward-sexp 1)
>            (point)))))

I have the strong impression that this reflects the fact that the
if+signal was a workaround which we're now replacing with an actual fix.


        Stefan






^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties
  2024-12-13 17:05             ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
@ 2025-01-04 10:54               ` Eli Zaretskii
  0 siblings, 0 replies; 10+ messages in thread
From: Eli Zaretskii @ 2025-01-04 10:54 UTC (permalink / raw)
  To: Stefan Monnier; +Cc: guillaume.brunerie, 74415-done

> From: Stefan Monnier <monnier@iro.umontreal.ca>
> Cc: Eli Zaretskii <eliz@gnu.org>,  74415@debbugs.gnu.org
> Date: Fri, 13 Dec 2024 12:05:55 -0500
> 
> > No, it was not dead code before, but changing the outer condition
> > makes it impossible for both the outer condition and the inner
> > condition to be true at the same time.
> > The current code is the following (inside a cond)
> >
> > ((and (= mode 1)
> >       (= start end)
> >       (char-after start)
> >       (= (char-syntax (char-after start)) ?\())
> >  (if (/= (syntax-class (syntax-after start)) 4) ; raw syntax code for ?\(
> >      ;; This happens in CC Mode when unbalanced parens in CPP
> >      ;; constructs are given punctuation syntax with
> >      ;; syntax-table text properties.  (2016-02-21).
> >      (signal 'scan-error (list "Containing expression ends prematurely"
> >                                start start))
> >    (list start
> >          (save-excursion
> >            (goto-char start)
> >            (forward-sexp 1)
> >            (point)))))
> 
> I have the strong impression that this reflects the fact that the
> if+signal was a workaround which we're now replacing with an actual fix.

Evidently.  So I've now installed my changes on the master branch,
after removing the unneeded code which signals an error, and I'm
therefore closing this bug.





^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2025-01-04 10:54 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-11-18 10:12 bug#74415: 29.4; mouse-start-end does not respect syntax-table text properties Guillaume Brunerie
2024-11-24  9:59 ` Eli Zaretskii
2024-11-25 23:15   ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
2024-11-28 16:02     ` Eli Zaretskii
2024-11-28 20:03       ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
2024-12-05  6:13       ` Guillaume Brunerie
2024-12-07 12:33         ` Eli Zaretskii
2024-12-09 19:21           ` Guillaume Brunerie
2024-12-13 17:05             ` Stefan Monnier via Bug reports for GNU Emacs, the Swiss army knife of text editors
2025-01-04 10:54               ` Eli Zaretskii

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).