unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* bug#47112: cperl-mode: Wrong highlighting of tr, y, q with _
@ 2021-03-12 20:31 E. Choroba
  2021-03-13  0:27 ` bug#47112: Don't interpret '_' as word boundary [PATCH] Harald Jörg
  0 siblings, 1 reply; 3+ messages in thread
From: E. Choroba @ 2021-03-12 20:31 UTC (permalink / raw)
  To: 47112

The quote-like operators q, qq, qr, qw, qx, tr, y, m, and s can use word 
characters as quoting characters, but then they must be followed by
whitespace. In Perl, "_" is also a word character, but cperl-mode
doesn't follow this rule. E.g.

sub y_max {

is highlighted as if the y was the transliteration operator, but in
fact "y_max" is a subroutine name, as there's no space after "y".

 	Ch.





^ permalink raw reply	[flat|nested] 3+ messages in thread

* bug#47112: Don't interpret '_' as word boundary [PATCH]
  2021-03-12 20:31 bug#47112: cperl-mode: Wrong highlighting of tr, y, q with _ E. Choroba
@ 2021-03-13  0:27 ` Harald Jörg
  2021-03-18  7:06   ` bug#47112: cperl-mode: Wrong highlighting of tr, y, q with _ Lars Ingebrigtsen
  0 siblings, 1 reply; 3+ messages in thread
From: Harald Jörg @ 2021-03-13  0:27 UTC (permalink / raw)
  To: 47112

[-- Attachment #1: Type: text/plain, Size: 697 bytes --]

This is a border case which affects several dark corners in CPerl mode.
Per default, the '_' character is not a word character, but a symbol
character in Perl.

The bogus regexp in `cperl-find-pods-heres' checks the quote-like things
for "ending in a word boundary".  Per default there _is_ a word boundary
between 'y' and '_', and from here it goes downward.

It is wrong to change the search pattern to "symbol boundary" instead of
"word boundary": The ':' character is also a symbol in CPerl mode and
the regexp would then miss q:string: constructs.

The patch is a small change to eliminate the false detection of
quote constructs by yet another special case, plus some tests.
-- 
Cheers,
haj

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #2: cperl-mode: No word boundary in "y_" --]
[-- Type: text/x-diff, Size: 3799 bytes --]

From c68e15dc4bfceb6e084645f987fc401edb87f7d9 Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Harald=20J=C3=B6rg?= <haj@posteo.de>
Date: Sat, 13 Mar 2021 00:57:30 +0100
Subject: [PATCH] ; cperl-mode: Don't interpret y_ as start of y// function.

* lisp/progmodes/cperl-mode.el (cperl-find-pods-heres): Avoid
treating underscores as word-terminators.

* test/lisp/progmodes/cperl-mode-tests.el
(cperl-test-bug-47112): Test case for that bug.
---
 lisp/progmodes/cperl-mode.el            | 33 ++++++++++++++-----------
 test/lisp/progmodes/cperl-mode-tests.el | 26 +++++++++++++++++++
 2 files changed, 44 insertions(+), 15 deletions(-)

diff --git a/lisp/progmodes/cperl-mode.el b/lisp/progmodes/cperl-mode.el
index 734797b3ad..7e7327d986 100644
--- a/lisp/progmodes/cperl-mode.el
+++ b/lisp/progmodes/cperl-mode.el
@@ -3926,21 +3926,24 @@ cperl-find-pods-heres
 		      bb (char-after (1- (match-beginning b1))) ; tmp holder
 		      ;; bb == "Not a stringy"
 		      bb (if (eq b1 10) ; user variables/whatever
-			     (and (memq bb (append "$@%*#_:-&>" nil)) ; $#y)
-				  (cond ((eq bb ?-) (eq c ?s)) ; -s file test
-					((eq bb ?\:) ; $opt::s
-					 (eq (char-after
-					      (- (match-beginning b1) 2))
-					     ?\:))
-					((eq bb ?\>) ; $foo->s
-					 (eq (char-after
-					      (- (match-beginning b1) 2))
-					     ?\-))
-					((eq bb ?\&)
-					 (not (eq (char-after ; &&m/blah/
-						   (- (match-beginning b1) 2))
-						  ?\&)))
-					(t t)))
+                             (or
+                              ; false positive: "y_" has no word boundary
+                              (save-match-data (looking-at "_"))
+			      (and (memq bb (append "$@%*#_:-&>" nil)) ; $#y)
+				   (cond ((eq bb ?-) (eq c ?s)) ; -s file test
+					 ((eq bb ?\:) ; $opt::s
+					  (eq (char-after
+					       (- (match-beginning b1) 2))
+					      ?\:))
+					 ((eq bb ?\>) ; $foo->s
+					  (eq (char-after
+					       (- (match-beginning b1) 2))
+					      ?\-))
+					 ((eq bb ?\&)
+					  (not (eq (char-after ; &&m/blah/
+						    (- (match-beginning b1) 2))
+						   ?\&)))
+					 (t t))))
 			   ;; <file> or <$file>
 			   (and (eq c ?\<)
 				;; Do not stringify <FH>, <$fh> :
diff --git a/test/lisp/progmodes/cperl-mode-tests.el b/test/lisp/progmodes/cperl-mode-tests.el
index 61e4ece49b..f0e15022d0 100644
--- a/test/lisp/progmodes/cperl-mode-tests.el
+++ b/test/lisp/progmodes/cperl-mode-tests.el
@@ -447,4 +447,30 @@ cperl-test-bug-45255
     ;; The yadda-yadda operator should not be in a string.
     (should (equal (nth 8 (cperl-test-ppss code "\\.")) nil))))
 
+(ert-deftest cperl-test-bug-47112 ()
+  "Check that in a bareword starting with a quote-like operator
+followed by an underscore is not interpreted as that quote-like
+operator.  Also check that a quote-like operator followed by a
+colon (which is, like ?_, a symbol in CPerl mode) _is_ identified
+as that quote like operator."
+  (with-temp-buffer
+    (funcall cperl-test-mode)
+    (insert "sub y_max { q:bar:; y _bar_foo_; }")
+    (goto-char (point-min))
+    (cperl-update-syntaxification (point-max))
+    (font-lock-fontify-buffer)
+    (search-forward "max")
+    (should (equal (get-text-property (match-beginning 0) 'face)
+                   'font-lock-function-name-face))
+    (search-forward "bar")
+    (should (equal (get-text-property (match-beginning 0) 'face)
+                   'font-lock-string-face))
+    ; perl-mode doesn't highlight
+    (when (eq cperl-test-mode #'cperl-mode)
+      (search-forward "_")
+      (should (equal (get-text-property (match-beginning 0) 'face)
+                     (if (eq cperl-test-mode #'cperl-mode)
+                         'font-lock-constant-face
+                       font-lock-string-face))))))
+
 ;;; cperl-mode-tests.el ends here
-- 
2.20.1


^ permalink raw reply related	[flat|nested] 3+ messages in thread

* bug#47112: cperl-mode: Wrong highlighting of tr, y, q with _
  2021-03-13  0:27 ` bug#47112: Don't interpret '_' as word boundary [PATCH] Harald Jörg
@ 2021-03-18  7:06   ` Lars Ingebrigtsen
  0 siblings, 0 replies; 3+ messages in thread
From: Lars Ingebrigtsen @ 2021-03-18  7:06 UTC (permalink / raw)
  To: Harald Jörg; +Cc: 47112

haj@posteo.de (Harald Jörg) writes:

> The patch is a small change to eliminate the false detection of
> quote constructs by yet another special case, plus some tests.

Thanks; applied to Emacs 28.

-- 
(domestic pets only, the antidote for overdose, milk.)
   bloggy blog: http://lars.ingebrigtsen.no





^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2021-03-18  7:06 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2021-03-12 20:31 bug#47112: cperl-mode: Wrong highlighting of tr, y, q with _ E. Choroba
2021-03-13  0:27 ` bug#47112: Don't interpret '_' as word boundary [PATCH] Harald Jörg
2021-03-18  7:06   ` bug#47112: cperl-mode: Wrong highlighting of tr, y, q with _ Lars Ingebrigtsen

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).