unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* bug#53493: 28.0.50; eww bug
@ 2022-01-24  4:33 Richard Stallman
  2022-01-24  8:57 ` Andreas Schwab
  0 siblings, 1 reply; 6+ messages in thread
From: Richard Stallman @ 2022-01-24  4:33 UTC (permalink / raw)
  To: 53493


I reproduced this myself with eww.  The failure does not happen
with lynx, so I think the bug is in eww, not in stallman.org.

Date: Sat, 22 Jan 2022 13:59:48 -0500
From: jmg <jmg@fedora.email>
To: rms@gnu.org
Subject: Re: Emacs, stallman.org
In-Reply-To: <E1nB8Bu-0003HK-UI@fencepost.gnu.org>
Message-ID: <DA7D2CB9-900B-4439-B658-101ECD93CF08@fedora.email>
Content-Type: multipart/alternative;
 boundary=----JZ1TNHIZYP9YHUM1856Q0WMM0ZE64J


[1:text/plain Hide]

Sure. Also: I just consider you to be an emacs expert. The steps are as 
follows:
1. run emacs 
2. M-x eww
3. Type in "stallman.org" and press enter
4. Go to the non political articles and try to view each of them.

On January 21, 2022 11:39:54 PM EST, Richard Stallman <rms@gnu.org> wrote:
>[[[ To any NSA and FBI agents reading my email: please consider    ]]]
>[[[ whether defending the US Constitution against all enemies,     ]]]
>[[[ foreign or domestic, requires you to follow Snowden's example. ]]]
>
>I have never used EWW.  I don't know how.  Also, I am not the one who
>needs to debug this.
>
>Could you possibly send me give concrete instructions, for someone who
>has never used EWW, for how to see the problem?  I could try it, then
>pass them along to the helpers who would debug this and fix it.
>
>
>-- 
>Dr Richard Stallman (https://stallman.org)
>Chief GNUisance of the GNU Project (https://gnu.org)
>Founder, Free Software Foundation (https://fsf.org)
>Internet Hall-of-Famer (https://internethalloffame.org)
>
>
>

-- 
Non-reluctant practical critic

[2:text/html Show Save:noname (1kB)]

.


In GNU Emacs 28.0.50 (build 1, x86_64-pc-linux-gnu, GTK+ Version 2.24.32, cairo version 1.15.10)
 of 2021-05-12 built on freetop
Repository revision: 47070ed39eda524d334e5f82dc7f4a50b8d3252c
Repository branch: master
System Description: Trisquel GNU/Linux Etiona (9.0)

Configured using:
 'configure --with-gnutls=ifavailable 'CFLAGS=-g -O0''

Configured features:
CAIRO DBUS FREETYPE GIF GLIB GPM GSETTINGS HARFBUZZ JPEG LIBOTF
LIBXML2 M17N_FLT MODULES NOTIFY INOTIFY PDUMPER PNG RSVG SECCOMP SOUND
THREADS TIFF TOOLKIT_SCROLL_BARS X11 XDBE XIM XPM GTK2 ZLIB

Important settings:
  value of $LANG: en_US.UTF-8
  locale-coding-system: utf-8-unix

Major mode: RMAIL

Minor modes in effect:
  shell-dirtrack-mode: t
  gpm-mouse-mode: t
  tooltip-mode: t
  global-eldoc-mode: t
  mouse-wheel-mode: t
  tool-bar-mode: t
  menu-bar-mode: t
  file-name-shadow-mode: t
  global-font-lock-mode: t
  font-lock-mode: t
  blink-cursor-mode: t
  auto-composition-mode: t
  auto-encryption-mode: t
  auto-compression-mode: t
  buffer-read-only: t
  line-number-mode: t
  transient-mark-mode: t
  abbrev-mode: t

Load-path shadows:
None found.

Features:
(shadow emacsbug conf-mode tls gnutls network-stream url-http url-gw
nsm url-cache url-auth cal-move pcmpl-gnu make-mode rect two-column
add-log pcmpl-unix battery pp descr-text org-element avl-tree ol-eww
ol-rmail ol-mhe ol-irc ol-info ol-gnus nnselect gnus-search gnus-art
mm-uu mml2015 mm-view mml-smime smime dig gnus-sum gnus-group
gnus-undo gnus-start gnus-dbus dbus gnus-cloud nnimap nnmail
mail-source utf7 netrc nnoo gnus-spec gnus-int gnus-range gnus-win
ol-docview ol-bibtex bibtex ol-bbdb ol-w3m org ob ob-tangle ob-ref
ob-lob ob-table ob-exp org-macro org-footnote org-src ob-comint
org-pcomplete org-list org-faces org-entities org-version
ob-emacs-lisp ob-core ob-eval org-table ol org-keys org-compat
org-macs org-loaddefs format-spec cal-menu calendar cal-loaddefs
doc-view image-mode exif edmacro sort compare-w tmm time-stamp
goto-addr noutline outline vc-git vc-dispatcher bug-reference info
eieio-opt speedbar ezimage dframe shortdoc mode-local texinfo
texinfo-loaddefs ffap smerge-mode diff-mode easy-mmode diff unrmail
tabify man pulse cl-extra parse-time iso8601 vc-cvs warnings cl-print
debug backtrace find-func help-fns radix-tree mule-util etags fileloop
generator xref project jka-compr ispell files-x grep compile
mhtml-mode css-mode smie eww xdg url-queue mm-url gnus nnheader
wid-edit color js imenu cc-mode cc-fonts cc-guess cc-menus cc-cmds
cc-styles cc-align cc-engine cc-vars cc-defs sgml-mode facemenu kmacro
dabbrev quail help-mode epa-mail thingatpt misearch multi-isearch
rmailkwd shell pcomplete comint ansi-color ring rmailsum shr kinsoku
svg xml dom qp rmailout mailalias sendmail rmailmm message rmc puny
rfc822 mml mml-sec epa epg epg-config gnus-util text-property-search
time-date mm-decode mm-bodies mm-encode mailabbrev gmm-utils
mailheader mail-parse rfc2231 rmail rmail-loaddefs rfc2047 rfc2045
ietf-drums mm-util mail-prsvr mail-utils dired-aux dired
dired-loaddefs t-mouse term/linux view derived paren cus-load advice
finder-inf package browse-url url url-proxy url-privacy url-expand
url-methods url-history url-cookie url-domsuf url-util mailcap
url-handlers url-parse auth-source cl-seq eieio eieio-core cl-macs
eieio-loaddefs password-cache json subr-x map url-vars seq byte-opt gv
bytecomp byte-compile cconv cl-loaddefs cl-lib iso-transl tooltip
eldoc electric uniquify ediff-hook vc-hooks lisp-float-type mwheel
term/x-win x-win term/common-win x-dnd tool-bar dnd fontset image
regexp-opt fringe tabulated-list replace newcomment text-mode
elisp-mode lisp-mode prog-mode register page tab-bar menu-bar
rfn-eshadow isearch easymenu timer select scroll-bar mouse jit-lock
font-lock syntax font-core term/tty-colors frame minibuffer cl-generic
cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet
lao korean japanese eucjp-ms cp51932 hebrew greek romanian slovak
czech european ethiopic indian cyrillic chinese composite charscript
charprop case-table epa-hook jka-cmpr-hook help simple abbrev obarray
cl-preloaded nadvice button loaddefs faces cus-face macroexp files
window text-properties overlay sha1 md5 base64 format env code-pages
mule custom widget hashtable-print-readable backquote threads dbusbind
inotify dynamic-setting system-font-setting font-render-setting cairo
move-toolbar gtk x-toolkit x multi-tty make-network-process emacs)

Memory information:
((conses 16 1325532 189125)
 (symbols 48 40669 7)
 (strings 32 241261 26610)
 (string-bytes 1 6896576)
 (vectors 16 74573)
 (vector-slots 8 2057351 120142)
 (floats 8 461 493)
 (intervals 56 172170 4486)
 (buffers 992 212))
[[[ To any NSA and FBI agents reading my email: please consider    ]]]
[[[ whether defending the US Constitution against all enemies,     ]]]
[[[ foreign or domestic, requires you to follow Snowden's example. ]]]


-- 
Dr Richard Stallman (https://stallman.org)
Chief GNUisance of the GNU Project (https://gnu.org)
Founder, Free Software Foundation (https://fsf.org)
Internet Hall-of-Famer (https://internethalloffame.org)







^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#53493: 28.0.50; eww bug
  2022-01-24  4:33 bug#53493: 28.0.50; eww bug Richard Stallman
@ 2022-01-24  8:57 ` Andreas Schwab
  2022-01-24  9:59   ` Lars Ingebrigtsen
  0 siblings, 1 reply; 6+ messages in thread
From: Andreas Schwab @ 2022-01-24  8:57 UTC (permalink / raw)
  To: Richard Stallman; +Cc: 53493

On Jan 23 2022, Richard Stallman wrote:

> I reproduced this myself with eww.  The failure does not happen
> with lynx, so I think the bug is in eww, not in stallman.org.

The site is broken:

    <p>
    <a href="articles/made-for-you.html">Made for You    <p>

-- 
Andreas Schwab, schwab@linux-m68k.org
GPG Key fingerprint = 7578 EB47 D4E5 4D69 2510  2552 DF73 E780 A9DA AEC1
"And now for something completely different."





^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#53493: 28.0.50; eww bug
  2022-01-24  8:57 ` Andreas Schwab
@ 2022-01-24  9:59   ` Lars Ingebrigtsen
  2022-01-25  4:16     ` Richard Stallman
  0 siblings, 1 reply; 6+ messages in thread
From: Lars Ingebrigtsen @ 2022-01-24  9:59 UTC (permalink / raw)
  To: Andreas Schwab; +Cc: Richard Stallman, 53493

Andreas Schwab <schwab@linux-m68k.org> writes:

> On Jan 23 2022, Richard Stallman wrote:
>
>> I reproduced this myself with eww.  The failure does not happen
>> with lynx, so I think the bug is in eww, not in stallman.org.
>
> The site is broken:
>
>     <p>
>     <a href="articles/made-for-you.html">Made for You    <p>

So libxml2 parses the rest of the page as belonging in that <a
href... (which is correct, but pedantic).  So there's not anything eww
can do here to fix this, and I'm closing this bug report.

-- 
(domestic pets only, the antidote for overdose, milk.)
   bloggy blog: http://lars.ingebrigtsen.no





^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#53493: 28.0.50; eww bug
  2022-01-24  9:59   ` Lars Ingebrigtsen
@ 2022-01-25  4:16     ` Richard Stallman
  2022-01-25 12:23       ` Lars Ingebrigtsen
  0 siblings, 1 reply; 6+ messages in thread
From: Richard Stallman @ 2022-01-25  4:16 UTC (permalink / raw)
  To: Lars Ingebrigtsen; +Cc: schwab, 53493

[[[ To any NSA and FBI agents reading my email: please consider    ]]]
[[[ whether defending the US Constitution against all enemies,     ]]]
[[[ foreign or domestic, requires you to follow Snowden's example. ]]]

  > > The site is broken:
  > >
  > >     <p>
  > >     <a href="articles/made-for-you.html">Made for You    <p>

Thanks for figuring that out.  I fixed that page.  But,

  > So libxml2 parses the rest of the page as belonging in that <a
  > href... (which is correct, but pedantic).  So there's not anything eww
  > can do here to fix this, and I'm closing this bug report.

why does Lynx handle the page correctly?
Wouldn't it be better for eww to do what Lynx does?

-- 
Dr Richard Stallman (https://stallman.org)
Chief GNUisance of the GNU Project (https://gnu.org)
Founder, Free Software Foundation (https://fsf.org)
Internet Hall-of-Famer (https://internethalloffame.org)







^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#53493: 28.0.50; eww bug
  2022-01-25  4:16     ` Richard Stallman
@ 2022-01-25 12:23       ` Lars Ingebrigtsen
  2022-01-26  3:39         ` Richard Stallman
  0 siblings, 1 reply; 6+ messages in thread
From: Lars Ingebrigtsen @ 2022-01-25 12:23 UTC (permalink / raw)
  To: Richard Stallman; +Cc: schwab, 53493

Richard Stallman <rms@gnu.org> writes:

> why does Lynx handle the page correctly?
> Wouldn't it be better for eww to do what Lynx does?

Lynx uses a different HTML parser.  Emacs uses libxml2 to parse HTML,
and it parses HTML according to the standards, while Lynx has a more
DWIM parser.

If you want Emacs to handle this differently, you'll have to file a bug
report with the libxml2 developers.

-- 
(domestic pets only, the antidote for overdose, milk.)
   bloggy blog: http://lars.ingebrigtsen.no





^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#53493: 28.0.50; eww bug
  2022-01-25 12:23       ` Lars Ingebrigtsen
@ 2022-01-26  3:39         ` Richard Stallman
  0 siblings, 0 replies; 6+ messages in thread
From: Richard Stallman @ 2022-01-26  3:39 UTC (permalink / raw)
  To: Lars Ingebrigtsen; +Cc: schwab, 53493

[[[ To any NSA and FBI agents reading my email: please consider    ]]]
[[[ whether defending the US Constitution against all enemies,     ]]]
[[[ foreign or domestic, requires you to follow Snowden's example. ]]]

  > Lynx uses a different HTML parser.  Emacs uses libxml2 to parse HTML,
  > and it parses HTML according to the standards, while Lynx has a more
  > DWIM parser.

The DWIM approach is much better for users.  I wish Emacs used it.
Rigidly following a standard is not the GNU way.

But since it would be a lot of work to change that, I won't push for it.

  > If you want Emacs to handle this differently, you'll have to file a bug
  > report with the libxml2 developers.

Thanks.  (I suppose they would say it's too much trouble.)

-- 
Dr Richard Stallman (https://stallman.org)
Chief GNUisance of the GNU Project (https://gnu.org)
Founder, Free Software Foundation (https://fsf.org)
Internet Hall-of-Famer (https://internethalloffame.org)







^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2022-01-26  3:39 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-01-24  4:33 bug#53493: 28.0.50; eww bug Richard Stallman
2022-01-24  8:57 ` Andreas Schwab
2022-01-24  9:59   ` Lars Ingebrigtsen
2022-01-25  4:16     ` Richard Stallman
2022-01-25 12:23       ` Lars Ingebrigtsen
2022-01-26  3:39         ` Richard Stallman

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).