unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* bug#36227: 26.1; sgml-mode indents as comment inside of -- text blocks
@ 2019-06-15 18:02 Akkana Peck
  2019-06-15 23:59 ` Noam Postavsky
  0 siblings, 1 reply; 3+ messages in thread
From: Akkana Peck @ 2019-06-15 18:02 UTC (permalink / raw)
  To: 36227

In emacs -Q, edit a file in html-mode (derived from sgml-mode),
e.g. visit /tmp/foo.html.

M-x auto-fill-mode <RET>

Type some nonsense words with ' -- ' in the middle, like this:

asd shdf ladshjkl sjdk -- asdfh jklsdfh

and keep typing until it wraps.

sgml-mode will wrap adding unwanted extra dashes and indentations,
like this:

asd shdf ladshjkl sjdk -- asdfh jklsdfh jasdklf hjsdkl hjsdaklf sdf --
		       -- jkldsfhj kalshfjkasdlfh jaksd

This happens because sgml-mode is confused about what constitutes a
comment in sgml/html.

At some point, someone clearly agreed that this was a bug and tried to
fix it. If you look at the sgml.el source and search for --,
you'll see comments like:

> (defvar sgml-specials '(?\")
>   "List of characters that have a special meaning for SGML mode.
> This list is used when first loading the `sgml-mode' library.
> The supported characters and potential disadvantages are:
>
>   ?\\\"	Makes \" in text start a string.
>   ?\\='	Makes \\=' in text start a string.
>   ?-	Makes -- in text start a comment.
>
> When only one of ?\\\" or ?\\=' are included, \"\\='\" or \\='\"\\=', as can be found in
> DTDs, start a string.  To partially avoid this problem this also makes these
> self insert as named entities depending on `sgml-quick-keys'.
>
> Including ?- has the problem of affecting dashes that have nothing to do
> with comments, so we normally turn it off.")

The dash is not included in sgml-specials by default, but the problem
affecting dashes that have nothing to do with comments happens anyway.
Setting sgml-specials to nil in the mode hook doesn't help either.

The problematic behavior seems to come from comment-line-break-function.
One workaround I've found is to add this in my html-mode and sgml-mode
hooks to prevent the special processing of line breaks inside comments:

    (kill-local-variable 'comment-line-break-function)

Maybe comment-line-break-function should check sgml-specials and return
without doing anything if - isn't one of the specials?



In GNU Emacs 26.1 (build 2, x86_64-pc-linux-gnu, GTK+ Version 3.24.4)
 of 2019-02-03, modified by Debian built on zam904
Windowing system distributor 'The X.Org Foundation', version 11.0.12004000
System Description:	Debian GNU/Linux 10 (buster)

Recent messages:
For information about GNU Emacs and the GNU system, type C-h C-a.

Configured using:
 'configure --build x86_64-linux-gnu --prefix=/usr
 --sharedstatedir=/var/lib --libexecdir=/usr/lib
 --localstatedir=/var/lib --infodir=/usr/share/info
 --mandir=/usr/share/man --enable-libsystemd --with-pop=yes
 --enable-locallisppath=/etc/emacs:/usr/local/share/emacs/26.1/site-lisp:/usr/local/share/emacs/site-lisp:/usr/share/emacs/26.1/site-lisp:/usr/share/emacs/site-lisp
 --with-sound=alsa --without-gconf --with-mailutils --build
 x86_64-linux-gnu --prefix=/usr --sharedstatedir=/var/lib
 --libexecdir=/usr/lib --localstatedir=/var/lib
 --infodir=/usr/share/info --mandir=/usr/share/man --enable-libsystemd
 --with-pop=yes
 --enable-locallisppath=/etc/emacs:/usr/local/share/emacs/26.1/site-lisp:/usr/local/share/emacs/site-lisp:/usr/share/emacs/26.1/site-lisp:/usr/share/emacs/site-lisp
 --with-sound=alsa --without-gconf --with-mailutils --with-x=yes
 --with-x-toolkit=gtk3 --with-toolkit-scroll-bars 'CFLAGS=-g -O2
 -fdebug-prefix-map=/build/emacs-26.1+1=. -fstack-protector-strong
 -Wformat -Werror=format-security -Wall' 'CPPFLAGS=-Wdate-time
 -D_FORTIFY_SOURCE=2' LDFLAGS=-Wl,-z,relro'

Configured features:
XPM JPEG TIFF GIF PNG RSVG IMAGEMAGICK SOUND GPM DBUS GSETTINGS NOTIFY
ACL LIBSELINUX GNUTLS LIBXML2 FREETYPE M17N_FLT LIBOTF XFT ZLIB
TOOLKIT_SCROLL_BARS GTK3 X11 THREADS LIBSYSTEMD LCMS2

Important settings:
  value of $LC_COLLATE: C
  value of $LANG: en_US.UTF-8
  locale-coding-system: utf-8-unix

Major mode: Lisp Interaction

Minor modes in effect:
  tooltip-mode: t
  global-eldoc-mode: t
  eldoc-mode: t
  electric-indent-mode: t
  mouse-wheel-mode: t
  tool-bar-mode: t
  menu-bar-mode: t
  file-name-shadow-mode: t
  global-font-lock-mode: t
  font-lock-mode: t
  blink-cursor-mode: t
  auto-composition-mode: t
  auto-encryption-mode: t
  auto-compression-mode: t
  line-number-mode: t
  transient-mark-mode: t

Load-path shadows:
None found.

Features:
(shadow sort mail-extr emacsbug message rmc puny seq byte-opt gv
bytecomp byte-compile cconv cl-loaddefs cl-lib dired dired-loaddefs
format-spec rfc822 mml easymenu mml-sec password-cache epa derived epg
epg-config gnus-util rmail rmail-loaddefs mm-decode mm-bodies mm-encode
mail-parse rfc2231 mailabbrev gmm-utils mailheader sendmail rfc2047
rfc2045 ietf-drums mm-util mail-prsvr mail-utils elec-pair time-date
mule-util tooltip eldoc electric uniquify ediff-hook vc-hooks
lisp-float-type mwheel term/x-win x-win term/common-win x-dnd tool-bar
dnd fontset image regexp-opt fringe tabulated-list replace newcomment
text-mode elisp-mode lisp-mode prog-mode register page menu-bar
rfn-eshadow isearch timer select scroll-bar mouse jit-lock font-lock
syntax facemenu font-core term/tty-colors frame cl-generic cham georgian
utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean
japanese eucjp-ms cp51932 hebrew greek romanian slovak czech european
ethiopic indian cyrillic chinese composite charscript charprop
case-table epa-hook jka-cmpr-hook help simple abbrev obarray minibuffer
cl-preloaded nadvice loaddefs button faces cus-face macroexp files
text-properties overlay sha1 md5 base64 format env code-pages mule
custom widget hashtable-print-readable backquote dbusbind inotify lcms2
dynamic-setting system-font-setting font-render-setting move-toolbar gtk
x-toolkit x multi-tty make-network-process emacs)

Memory information:
((conses 16 95390 8152)
 (symbols 48 20395 1)
 (miscs 40 45 93)
 (strings 32 28323 1158)
 (string-bytes 1 740749)
 (vectors 16 14651)
 (vector-slots 8 497220 10982)
 (floats 8 49 68)
 (intervals 56 262 0)
 (buffers 992 11))





^ permalink raw reply	[flat|nested] 3+ messages in thread

* bug#36227: 26.1; sgml-mode indents as comment inside of -- text blocks
  2019-06-15 18:02 bug#36227: 26.1; sgml-mode indents as comment inside of -- text blocks Akkana Peck
@ 2019-06-15 23:59 ` Noam Postavsky
  2021-02-01  9:42   ` Lars Ingebrigtsen
  0 siblings, 1 reply; 3+ messages in thread
From: Noam Postavsky @ 2019-06-15 23:59 UTC (permalink / raw)
  To: Akkana Peck; +Cc: 36227

severity 36227 minor
tags 36227 + confirmed
quit

Akkana Peck <akkana@shallowsky.com> writes:

> In emacs -Q, edit a file in html-mode (derived from sgml-mode),
> e.g. visit /tmp/foo.html.
>
> M-x auto-fill-mode <RET>
>
> Type some nonsense words with ' -- ' in the middle, like this:
>
> asd shdf ladshjkl sjdk -- asdfh jklsdfh
>
> and keep typing until it wraps.
>
> sgml-mode will wrap adding unwanted extra dashes and indentations,
> like this:
>
> asd shdf ladshjkl sjdk -- asdfh jklsdfh jasdklf hjsdkl hjsdaklf sdf --
> 		       -- jkldsfhj kalshfjkasdlfh jaksd

> The problematic behavior seems to come from comment-line-break-function.
> One workaround I've found is to add this in my html-mode and sgml-mode
> hooks to prevent the special processing of line breaks inside comments:
>
>     (kill-local-variable 'comment-line-break-function)
>
> Maybe comment-line-break-function should check sgml-specials and return
> without doing anything if - isn't one of the specials?

It's not to do with the sgml-specials, the problem is that the standard
auto-fill-function, `do-auto-fill', calls `default-indent-new-line'
which calls `comment-line-break-function' regardless of whether point is
inside a comment or not.  As far as I can tell, it works in most modes
because they disable auto-filling outside of comments in the first
place.  sgml-mode doesn't, because most of its non-comment text is just
plain text that should be filled normally.

So I think there should be a sgml-mode specific
comment-line-break-function, which checks whether it's in a comment or
not (presumably using syntax-ppss).





^ permalink raw reply	[flat|nested] 3+ messages in thread

* bug#36227: 26.1; sgml-mode indents as comment inside of -- text blocks
  2019-06-15 23:59 ` Noam Postavsky
@ 2021-02-01  9:42   ` Lars Ingebrigtsen
  0 siblings, 0 replies; 3+ messages in thread
From: Lars Ingebrigtsen @ 2021-02-01  9:42 UTC (permalink / raw)
  To: Noam Postavsky; +Cc: 36227, Akkana Peck

Noam Postavsky <npostavs@gmail.com> writes:

> So I think there should be a sgml-mode specific
> comment-line-break-function, which checks whether it's in a comment or
> not (presumably using syntax-ppss).

I've now done this in Emacs 28, and it seems to work correctly in the
scenario described.

-- 
(domestic pets only, the antidote for overdose, milk.)
   bloggy blog: http://lars.ingebrigtsen.no





^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2021-02-01  9:42 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-06-15 18:02 bug#36227: 26.1; sgml-mode indents as comment inside of -- text blocks Akkana Peck
2019-06-15 23:59 ` Noam Postavsky
2021-02-01  9:42   ` Lars Ingebrigtsen

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).