* bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
@ 2024-09-12 17:53 John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors
2024-09-13 7:43 ` Robert Pluim
0 siblings, 1 reply; 6+ messages in thread
From: John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors @ 2024-09-12 17:53 UTC (permalink / raw)
To: 73206
[-- Attachment #1: Type: text/plain, Size: 4991 bytes --]
1. Create foo.xml containing:
<foo>
<!--
foo
bar
-->
</foo>
Notice that we have a blank line in the xml comment.
2. Put the point on 'bar' and press TAB.
Notice that the 'bar' line is not indented correctly.
Solution
--------
The problem is that in nxml-compute-indent-in-delimited-token, there
is a (forward-line -1) which assumes that the prior line is indented. In
this case the prior line is a blank line and thus doesn't have
indentation. After the (forward-line -1), we need to keep going back
until we see a non-blank line:
(forward-line -1)
+ (while (looking-at "^[:blank:]*$")
+ (forward-line -1))
Attached is a patch for nxml-mode.el that fixes the issue.
Thanks
John
In GNU Emacs 28.2 (build 1, x86_64-pc-linux-gnu, GTK+ Version 3.24.38, cairo version 1.16.0)
of 2024-06-25, modified by Debian built on x86-conova-01
Windowing system distributor 'The X.Org Foundation', version 11.0.12101006
System Description: Debian GNU/Linux 12 (bookworm)
Configured using:
'configure --build x86_64-linux-gnu --prefix=/usr
--sharedstatedir=/var/lib --libexecdir=/usr/libexec
--localstatedir=/var/lib --infodir=/usr/share/info
--mandir=/usr/share/man --with-libsystemd --with-pop=yes
--enable-locallisppath=/etc/emacs:/usr/local/share/emacs/28.2/site-lisp:/usr/local/share/emacs/site-lisp:/usr/share/emacs/28.2/site-lisp:/usr/share/emacs/site-lisp
--with-sound=alsa --without-gconf --with-mailutils
--with-native-compilation --build x86_64-linux-gnu --prefix=/usr
--sharedstatedir=/var/lib --libexecdir=/usr/libexec
--localstatedir=/var/lib --infodir=/usr/share/info
--mandir=/usr/share/man --with-libsystemd --with-pop=yes
--enable-locallisppath=/etc/emacs:/usr/local/share/emacs/28.2/site-lisp:/usr/local/share/emacs/site-lisp:/usr/share/emacs/28.2/site-lisp:/usr/share/emacs/site-lisp
--with-sound=alsa --without-gconf --with-mailutils
--with-native-compilation --with-cairo --with-x=yes
--with-x-toolkit=gtk3 --with-toolkit-scroll-bars 'CFLAGS=-g -O2
-ffile-prefix-map=/build/reproducible-path/emacs-28.2+1=. -fstack-protector-strong
-Wformat -Werror=format-security -Wall' 'CPPFLAGS=-Wdate-time
-D_FORTIFY_SOURCE=2' LDFLAGS=-Wl,-z,relro'
Configured features:
ACL CAIRO DBUS FREETYPE GIF GLIB GMP GNUTLS GPM GSETTINGS HARFBUZZ JPEG
JSON LCMS2 LIBOTF LIBSELINUX LIBSYSTEMD LIBXML2 M17N_FLT MODULES
NATIVE_COMP NOTIFY INOTIFY PDUMPER PNG RSVG SECCOMP SOUND THREADS TIFF
TOOLKIT_SCROLL_BARS X11 XDBE XIM XPM GTK3 ZLIB
Important settings:
value of $LANG: en_US.UTF-8
locale-coding-system: utf-8-unix
Major mode: Lisp Interaction
Minor modes in effect:
tooltip-mode: t
global-eldoc-mode: t
eldoc-mode: t
show-paren-mode: t
electric-indent-mode: t
mouse-wheel-mode: t
tool-bar-mode: t
menu-bar-mode: t
file-name-shadow-mode: t
global-font-lock-mode: t
font-lock-mode: t
blink-cursor-mode: t
auto-composition-mode: t
auto-encryption-mode: t
auto-compression-mode: t
line-number-mode: t
indent-tabs-mode: t
transient-mark-mode: t
Load-path shadows:
None found.
Features:
(shadow sort mail-extr emacsbug message rmc puny dired dired-loaddefs
rfc822 mml mml-sec epa derived epg rfc6068 epg-config gnus-util rmail
rmail-loaddefs auth-source cl-seq eieio eieio-core cl-macs
eieio-loaddefs password-cache json map text-property-search time-date
subr-x seq byte-opt gv bytecomp byte-compile cconv mm-decode mm-bodies
mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader cl-loaddefs
cl-lib sendmail rfc2047 rfc2045 ietf-drums mm-util mail-prsvr mail-utils
iso-transl tooltip eldoc paren electric uniquify ediff-hook vc-hooks
lisp-float-type elisp-mode mwheel term/x-win x-win term/common-win x-dnd
tool-bar dnd fontset image regexp-opt fringe tabulated-list replace
newcomment text-mode lisp-mode prog-mode register page tab-bar menu-bar
rfn-eshadow isearch easymenu timer select scroll-bar mouse jit-lock
font-lock syntax font-core term/tty-colors frame minibuffer cl-generic
cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao
korean japanese eucjp-ms cp51932 hebrew greek romanian slovak czech
european ethiopic indian cyrillic chinese composite emoji-zwj charscript
charprop case-table epa-hook jka-cmpr-hook help simple abbrev obarray
cl-preloaded nadvice button loaddefs faces cus-face macroexp files
window text-properties overlay sha1 md5 base64 format env code-pages
mule custom widget hashtable-print-readable backquote threads dbusbind
inotify lcms2 dynamic-setting system-font-setting font-render-setting
cairo move-toolbar gtk x-toolkit x multi-tty make-network-process
native-compile emacs)
Memory information:
((conses 16 68059 5432)
(symbols 48 6639 0)
(strings 32 19490 1559)
(string-bytes 1 664189)
(vectors 16 13926)
(vector-slots 8 297636 11873)
(floats 8 22 31)
(intervals 56 258 0)
(buffers 992 10))
[-- Attachment #2: nxml-mode.el.patch --]
[-- Type: text/x-diff, Size: 399 bytes --]
--- ./orig/nxml-mode.el 2024-09-12 09:10:46.580451579 -0400
+++ nxml-mode.el 2024-09-12 09:04:48.473571128 -0400
@@ -1522,6 +1522,8 @@
((progn
(goto-char pos)
(forward-line -1)
+ (while (looking-at "^[:blank:]*$")
+ (forward-line -1))
(<= (point) xmltok-start))
(goto-char (+ xmltok-start (length open-delim)))
(when (and (string= open-delim "<!--")
^ permalink raw reply [flat|nested] 6+ messages in thread
* bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
2024-09-12 17:53 bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors
@ 2024-09-13 7:43 ` Robert Pluim
2024-09-15 12:08 ` John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors
0 siblings, 1 reply; 6+ messages in thread
From: Robert Pluim @ 2024-09-13 7:43 UTC (permalink / raw)
To: 73206; +Cc: ciolfi
>>>>> On Thu, 12 Sep 2024 13:53:26 -0400, John Ciolfi via "Bug reports for GNU Emacs, the Swiss army knife of text editors" <bug-gnu-emacs@gnu.org> said:
John> 1. Create foo.xml containing:
John> <foo>
John> <!--
John> foo
John> bar
-->
John> </foo>
John> Notice that we have a blank line in the xml comment.
John> 2. Put the point on 'bar' and press TAB.
John> Notice that the 'bar' line is not indented correctly.
John> Solution
John> --------
John> The problem is that in nxml-compute-indent-in-delimited-token, there
John> is a (forward-line -1) which assumes that the prior line is indented. In
John> this case the prior line is a blank line and thus doesn't have
John> indentation. After the (forward-line -1), we need to keep going back
John> until we see a non-blank line:
John> (forward-line -1)
John> + (while (looking-at "^[:blank:]*$")
John> + (forward-line -1))
John> Attached is a patch for nxml-mode.el that fixes the issue.
Your patch does the right thing, but for the wrong reasons. Regexp
character classes only have meaning with a bracket expression, so what
youʼre matching here is actually any lines containing zero or more of
':', 'b', 'l', 'a', 'n', or 'k'. IOW, you should use:
(looking-at "^[[:blank:]]*$")
Robert
--
^ permalink raw reply [flat|nested] 6+ messages in thread
* bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
2024-09-13 7:43 ` Robert Pluim
@ 2024-09-15 12:08 ` John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors
2024-09-28 8:45 ` Eli Zaretskii
0 siblings, 1 reply; 6+ messages in thread
From: John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors @ 2024-09-15 12:08 UTC (permalink / raw)
To: Robert Pluim, 73206@debbugs.gnu.org
[-- Attachment #1: Type: text/plain, Size: 1797 bytes --]
Thanks for catching my typo and for moving the fix along.
________________________________
From: Robert Pluim <rpluim@gmail.com>
Sent: Friday, September 13, 2024 3:43 AM
To: 73206@debbugs.gnu.org <73206@debbugs.gnu.org>
Cc: John Ciolfi <ciolfi@mathworks.com>
Subject: Re: bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
>>>>> On Thu, 12 Sep 2024 13:53:26 -0400, John Ciolfi via "Bug reports for GNU Emacs, the Swiss army knife of text editors" <bug-gnu-emacs@gnu.org> said:
John> 1. Create foo.xml containing:
John> <foo>
John> <!--
John> foo
John> bar
-->
John> </foo>
John> Notice that we have a blank line in the xml comment.
John> 2. Put the point on 'bar' and press TAB.
John> Notice that the 'bar' line is not indented correctly.
John> Solution
John> --------
John> The problem is that in nxml-compute-indent-in-delimited-token, there
John> is a (forward-line -1) which assumes that the prior line is indented. In
John> this case the prior line is a blank line and thus doesn't have
John> indentation. After the (forward-line -1), we need to keep going back
John> until we see a non-blank line:
John> (forward-line -1)
John> + (while (looking-at "^[:blank:]*$")
John> + (forward-line -1))
John> Attached is a patch for nxml-mode.el that fixes the issue.
Your patch does the right thing, but for the wrong reasons. Regexp
character classes only have meaning with a bracket expression, so what
you’re matching here is actually any lines containing zero or more of
':', 'b', 'l', 'a', 'n', or 'k'. IOW, you should use:
(looking-at "^[[:blank:]]*$")
Robert
--
[-- Attachment #2: Type: text/html, Size: 3436 bytes --]
^ permalink raw reply [flat|nested] 6+ messages in thread
* bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
2024-09-15 12:08 ` John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors
@ 2024-09-28 8:45 ` Eli Zaretskii
2024-09-29 20:47 ` Stefan Kangas
0 siblings, 1 reply; 6+ messages in thread
From: Eli Zaretskii @ 2024-09-28 8:45 UTC (permalink / raw)
To: John Ciolfi, Stefan Kangas; +Cc: 73206, rpluim
Stefan, does the patch with the regexp fix look correct to you?
> Date: Sun, 15 Sep 2024 12:08:17 +0000
> msip_labels:
> From: John Ciolfi via "Bug reports for GNU Emacs,
> the Swiss army knife of text editors" <bug-gnu-emacs@gnu.org>
>
> Thanks for catching my typo and for moving the fix along.
> -----------------------------------------------------------------------------
> From: Robert Pluim <rpluim@gmail.com>
> Sent: Friday, September 13, 2024 3:43 AM
> To: 73206@debbugs.gnu.org <73206@debbugs.gnu.org>
> Cc: John Ciolfi <ciolfi@mathworks.com>
> Subject: Re: bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
>
> >>>>> On Thu, 12 Sep 2024 13:53:26 -0400, John Ciolfi via "Bug reports for GNU Emacs, the Swiss army
> knife of text editors" <bug-gnu-emacs@gnu.org> said:
>
> John> 1. Create foo.xml containing:
>
> John> <foo>
> John> <!--
> John> foo
>
> John> bar
> -->
> John> </foo>
>
> John> Notice that we have a blank line in the xml comment.
>
> John> 2. Put the point on 'bar' and press TAB.
>
> John> Notice that the 'bar' line is not indented correctly.
>
> John> Solution
> John> --------
>
> John> The problem is that in nxml-compute-indent-in-delimited-token, there
> John> is a (forward-line -1) which assumes that the prior line is indented. In
> John> this case the prior line is a blank line and thus doesn't have
> John> indentation. After the (forward-line -1), we need to keep going back
> John> until we see a non-blank line:
>
> John> (forward-line -1)
> John> + (while (looking-at "^[:blank:]*$")
> John> + (forward-line -1))
>
> John> Attached is a patch for nxml-mode.el that fixes the issue.
>
> Your patch does the right thing, but for the wrong reasons. Regexp
> character classes only have meaning with a bracket expression, so what
> you’re matching here is actually any lines containing zero or more of
> ':', 'b', 'l', 'a', 'n', or 'k'. IOW, you should use:
>
> (looking-at "^[[:blank:]]*$")
>
> Robert
> --
^ permalink raw reply [flat|nested] 6+ messages in thread
* bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
2024-09-28 8:45 ` Eli Zaretskii
@ 2024-09-29 20:47 ` Stefan Kangas
2024-10-02 11:59 ` John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors
0 siblings, 1 reply; 6+ messages in thread
From: Stefan Kangas @ 2024-09-29 20:47 UTC (permalink / raw)
To: Eli Zaretskii, John Ciolfi; +Cc: 73206, rpluim
Eli Zaretskii <eliz@gnu.org> writes:
> Stefan, does the patch with the regexp fix look correct to you?
If we want to "keep going back until we see a non-blank line", surely
the fragment should read:
(while (looking-at "^$")
(forward-line -1))
Since
(looking-at "^[[:blank:]]*$")
will match both blank lines, and lines containing only blank space.
Which of the two do we want here?
I think it would also be good to add one or more tests here.
^ permalink raw reply [flat|nested] 6+ messages in thread
* bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
2024-09-29 20:47 ` Stefan Kangas
@ 2024-10-02 11:59 ` John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors
0 siblings, 0 replies; 6+ messages in thread
From: John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors @ 2024-10-02 11:59 UTC (permalink / raw)
To: Stefan Kangas, Eli Zaretskii; +Cc: 73206@debbugs.gnu.org, rpluim@gmail.com
[-- Attachment #1.1: Type: text/plain, Size: 2000 bytes --]
Hi
Using "^$" or "^[[:blank:]]*$" is fine. The difference is very minor. Consider foo.xml where the visibly "empty" line contains a single space and foo starts on the 1st column:
<foo>
<!--
bar
foo
-->
</foo>
When using "^$" and you type tab on the foo line, you'll get the following where foo starts on the 2nd column:
<foo>
<!--
bar
foo
-->
</foo>
If you use "^[[:blank:]]*$, you'll get what I expected:
<foo>
<!--
bar
foo
-->
</foo>
However, if you select all and indent-region, C-M-\ on the original you'll get the expected result with either "^$" or "^[[:blank:]]*$ because nxml-mode will pad out the space line. If the "empty" line truly blank (no spaces or tabs), then the two regex's behave identical.
I suggest for test cases, two versions of foo.xml where one version of it has the empty line truly blank (no spaces or tabs) and the other version contains a space in the "empty" line.
You can use the attached nxml-mode-indent-fix.el which overrides the broken function to try things out on a stock Emacs, emacs -Q.
Thanks
John
________________________________
From: Stefan Kangas <stefankangas@gmail.com>
Sent: Sunday, September 29, 2024 4:47 PM
To: Eli Zaretskii <eliz@gnu.org>; John Ciolfi <ciolfi@mathworks.com>
Cc: rpluim@gmail.com <rpluim@gmail.com>; 73206@debbugs.gnu.org <73206@debbugs.gnu.org>
Subject: Re: bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
Eli Zaretskii <eliz@gnu.org> writes:
> Stefan, does the patch with the regexp fix look correct to you?
If we want to "keep going back until we see a non-blank line", surely
the fragment should read:
(while (looking-at "^$")
(forward-line -1))
Since
(looking-at "^[[:blank:]]*$")
will match both blank lines, and lines containing only blank space.
Which of the two do we want here?
I think it would also be good to add one or more tests here.
[-- Attachment #1.2: Type: text/html, Size: 9348 bytes --]
[-- Attachment #2: nxml-mode-indent-fix.el --]
[-- Type: text/x-emacs-lisp, Size: 1327 bytes --]
(defvar xmltok-start) ;; from xmltok.el
(defun nxml-compute-indent-in-delimited-token-override (pos open-delim close-delim)
"Return the indent for a line that start inside a token with delimiters.
OPEN-DELIM and CLOSE-DELIM are strings giving the opening and closing
delimiters. POS is the position of the first non-whitespace character
of the line. This expects the xmltok-* variables to be set up as by
`xmltok-forward'.
Fix: https://debbugs.gnu.org/cgi/bugreport.cgi?bug=73206"
(cond ((let ((end (+ pos (length close-delim))))
(and (<= end (point-max))
(string= (buffer-substring-no-properties pos end)
close-delim)))
(goto-char xmltok-start))
((progn
(goto-char pos)
(forward-line -1)
(while (looking-at "^[[:blank:]]*$")
(forward-line -1))
(<= (point) xmltok-start))
(goto-char (+ xmltok-start (length open-delim)))
(when (and (string= open-delim "<!--")
(looking-at " "))
(goto-char (1+ (point)))))
(t (back-to-indentation)))
(current-column))
(when (version<= emacs-version "29")
;; Assume https://debbugs.gnu.org/cgi/bugreport.cgi?bug=73206 will be fixed in Emacs 30
(eval-after-load 'nxml-mode
'(advice-add 'nxml-compute-indent-in-delimited-token
:override
#'nxml-compute-indent-in-delimited-token-override)))
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2024-10-02 11:59 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-09-12 17:53 bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors
2024-09-13 7:43 ` Robert Pluim
2024-09-15 12:08 ` John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors
2024-09-28 8:45 ` Eli Zaretskii
2024-09-29 20:47 ` Stefan Kangas
2024-10-02 11:59 ` John Ciolfi via Bug reports for GNU Emacs, the Swiss army knife of text editors
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/emacs.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).