* bug#47112: cperl-mode: Wrong highlighting of tr, y, q with _
@ 2021-03-12 20:31 E. Choroba
2021-03-13 0:27 ` bug#47112: Don't interpret '_' as word boundary [PATCH] Harald Jörg
0 siblings, 1 reply; 3+ messages in thread
From: E. Choroba @ 2021-03-12 20:31 UTC (permalink / raw)
To: 47112
The quote-like operators q, qq, qr, qw, qx, tr, y, m, and s can use word
characters as quoting characters, but then they must be followed by
whitespace. In Perl, "_" is also a word character, but cperl-mode
doesn't follow this rule. E.g.
sub y_max {
is highlighted as if the y was the transliteration operator, but in
fact "y_max" is a subroutine name, as there's no space after "y".
Ch.
^ permalink raw reply [flat|nested] 3+ messages in thread
* bug#47112: Don't interpret '_' as word boundary [PATCH]
2021-03-12 20:31 bug#47112: cperl-mode: Wrong highlighting of tr, y, q with _ E. Choroba
@ 2021-03-13 0:27 ` Harald Jörg
2021-03-18 7:06 ` bug#47112: cperl-mode: Wrong highlighting of tr, y, q with _ Lars Ingebrigtsen
0 siblings, 1 reply; 3+ messages in thread
From: Harald Jörg @ 2021-03-13 0:27 UTC (permalink / raw)
To: 47112
[-- Attachment #1: Type: text/plain, Size: 697 bytes --]
This is a border case which affects several dark corners in CPerl mode.
Per default, the '_' character is not a word character, but a symbol
character in Perl.
The bogus regexp in `cperl-find-pods-heres' checks the quote-like things
for "ending in a word boundary". Per default there _is_ a word boundary
between 'y' and '_', and from here it goes downward.
It is wrong to change the search pattern to "symbol boundary" instead of
"word boundary": The ':' character is also a symbol in CPerl mode and
the regexp would then miss q:string: constructs.
The patch is a small change to eliminate the false detection of
quote constructs by yet another special case, plus some tests.
--
Cheers,
haj
[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #2: cperl-mode: No word boundary in "y_" --]
[-- Type: text/x-diff, Size: 3799 bytes --]
From c68e15dc4bfceb6e084645f987fc401edb87f7d9 Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Harald=20J=C3=B6rg?= <haj@posteo.de>
Date: Sat, 13 Mar 2021 00:57:30 +0100
Subject: [PATCH] ; cperl-mode: Don't interpret y_ as start of y// function.
* lisp/progmodes/cperl-mode.el (cperl-find-pods-heres): Avoid
treating underscores as word-terminators.
* test/lisp/progmodes/cperl-mode-tests.el
(cperl-test-bug-47112): Test case for that bug.
---
lisp/progmodes/cperl-mode.el | 33 ++++++++++++++-----------
test/lisp/progmodes/cperl-mode-tests.el | 26 +++++++++++++++++++
2 files changed, 44 insertions(+), 15 deletions(-)
diff --git a/lisp/progmodes/cperl-mode.el b/lisp/progmodes/cperl-mode.el
index 734797b3ad..7e7327d986 100644
--- a/lisp/progmodes/cperl-mode.el
+++ b/lisp/progmodes/cperl-mode.el
@@ -3926,21 +3926,24 @@ cperl-find-pods-heres
bb (char-after (1- (match-beginning b1))) ; tmp holder
;; bb == "Not a stringy"
bb (if (eq b1 10) ; user variables/whatever
- (and (memq bb (append "$@%*#_:-&>" nil)) ; $#y)
- (cond ((eq bb ?-) (eq c ?s)) ; -s file test
- ((eq bb ?\:) ; $opt::s
- (eq (char-after
- (- (match-beginning b1) 2))
- ?\:))
- ((eq bb ?\>) ; $foo->s
- (eq (char-after
- (- (match-beginning b1) 2))
- ?\-))
- ((eq bb ?\&)
- (not (eq (char-after ; &&m/blah/
- (- (match-beginning b1) 2))
- ?\&)))
- (t t)))
+ (or
+ ; false positive: "y_" has no word boundary
+ (save-match-data (looking-at "_"))
+ (and (memq bb (append "$@%*#_:-&>" nil)) ; $#y)
+ (cond ((eq bb ?-) (eq c ?s)) ; -s file test
+ ((eq bb ?\:) ; $opt::s
+ (eq (char-after
+ (- (match-beginning b1) 2))
+ ?\:))
+ ((eq bb ?\>) ; $foo->s
+ (eq (char-after
+ (- (match-beginning b1) 2))
+ ?\-))
+ ((eq bb ?\&)
+ (not (eq (char-after ; &&m/blah/
+ (- (match-beginning b1) 2))
+ ?\&)))
+ (t t))))
;; <file> or <$file>
(and (eq c ?\<)
;; Do not stringify <FH>, <$fh> :
diff --git a/test/lisp/progmodes/cperl-mode-tests.el b/test/lisp/progmodes/cperl-mode-tests.el
index 61e4ece49b..f0e15022d0 100644
--- a/test/lisp/progmodes/cperl-mode-tests.el
+++ b/test/lisp/progmodes/cperl-mode-tests.el
@@ -447,4 +447,30 @@ cperl-test-bug-45255
;; The yadda-yadda operator should not be in a string.
(should (equal (nth 8 (cperl-test-ppss code "\\.")) nil))))
+(ert-deftest cperl-test-bug-47112 ()
+ "Check that in a bareword starting with a quote-like operator
+followed by an underscore is not interpreted as that quote-like
+operator. Also check that a quote-like operator followed by a
+colon (which is, like ?_, a symbol in CPerl mode) _is_ identified
+as that quote like operator."
+ (with-temp-buffer
+ (funcall cperl-test-mode)
+ (insert "sub y_max { q:bar:; y _bar_foo_; }")
+ (goto-char (point-min))
+ (cperl-update-syntaxification (point-max))
+ (font-lock-fontify-buffer)
+ (search-forward "max")
+ (should (equal (get-text-property (match-beginning 0) 'face)
+ 'font-lock-function-name-face))
+ (search-forward "bar")
+ (should (equal (get-text-property (match-beginning 0) 'face)
+ 'font-lock-string-face))
+ ; perl-mode doesn't highlight
+ (when (eq cperl-test-mode #'cperl-mode)
+ (search-forward "_")
+ (should (equal (get-text-property (match-beginning 0) 'face)
+ (if (eq cperl-test-mode #'cperl-mode)
+ 'font-lock-constant-face
+ font-lock-string-face))))))
+
;;; cperl-mode-tests.el ends here
--
2.20.1
^ permalink raw reply related [flat|nested] 3+ messages in thread
end of thread, other threads:[~2021-03-18 7:06 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2021-03-12 20:31 bug#47112: cperl-mode: Wrong highlighting of tr, y, q with _ E. Choroba
2021-03-13 0:27 ` bug#47112: Don't interpret '_' as word boundary [PATCH] Harald Jörg
2021-03-18 7:06 ` bug#47112: cperl-mode: Wrong highlighting of tr, y, q with _ Lars Ingebrigtsen
Code repositories for project(s) associated with this external index
https://git.savannah.gnu.org/cgit/emacs.git
https://git.savannah.gnu.org/cgit/emacs/org-mode.git
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.