From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: =?UTF-8?Q?=E7=A9=8D=E4=B8=B9=E5=B0=BC?= Dan Jacobson Newsgroups: gmane.emacs.bugs Subject: bug#24831: shr mangling messages Date: Mon, 31 Oct 2016 10:45:58 +0800 Message-ID: <87shrd6xsp.fsf_-_@jidanni.org> References: <87shrgvt8y.fsf@jidanni.org> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="=-=-=" X-Trace: blaine.gmane.org 1477882038 1367 195.159.176.226 (31 Oct 2016 02:47:18 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Mon, 31 Oct 2016 02:47:18 +0000 (UTC) Cc: Katsumi Yamaoka To: 24831@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Mon Oct 31 03:47:13 2016 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1c12cy-0007sk-DP for geb-bug-gnu-emacs@m.gmane.org; Mon, 31 Oct 2016 03:47:12 +0100 Original-Received: from localhost ([::1]:32782 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1c12d1-0002qf-22 for geb-bug-gnu-emacs@m.gmane.org; Sun, 30 Oct 2016 22:47:15 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:43431) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1c12cu-0002qZ-9Z for bug-gnu-emacs@gnu.org; Sun, 30 Oct 2016 22:47:09 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1c12co-0002rX-SP for bug-gnu-emacs@gnu.org; Sun, 30 Oct 2016 22:47:08 -0400 Original-Received: from debbugs.gnu.org ([208.118.235.43]:50364) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1c12co-0002rP-OZ for bug-gnu-emacs@gnu.org; Sun, 30 Oct 2016 22:47:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1c12co-0002cm-GT for bug-gnu-emacs@gnu.org; Sun, 30 Oct 2016 22:47:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: =?UTF-8?Q?=E7=A9=8D=E4=B8=B9=E5=B0=BC?= Dan Jacobson Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Mon, 31 Oct 2016 02:47:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 24831 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: X-Debbugs-Original-To: bug-gnu-emacs Original-Received: via spool by submit@debbugs.gnu.org id=B.147788198010028 (code B ref -1); Mon, 31 Oct 2016 02:47:02 +0000 Original-Received: (at submit) by debbugs.gnu.org; 31 Oct 2016 02:46:20 +0000 Original-Received: from localhost ([127.0.0.1]:37530 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1c12c8-0002bg-3r for submit@debbugs.gnu.org; Sun, 30 Oct 2016 22:46:20 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:52391) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1c12c6-0002bT-6S for submit@debbugs.gnu.org; Sun, 30 Oct 2016 22:46:18 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1c12c0-0002Ul-3l for submit@debbugs.gnu.org; Sun, 30 Oct 2016 22:46:13 -0400 Original-Received: from lists.gnu.org ([2001:4830:134:3::11]:37662) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1c12c0-0002UF-0g for submit@debbugs.gnu.org; Sun, 30 Oct 2016 22:46:12 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:43358) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1c12by-0002SB-DR for bug-gnu-emacs@gnu.org; Sun, 30 Oct 2016 22:46:11 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1c12bv-0002Q1-9T for bug-gnu-emacs@gnu.org; Sun, 30 Oct 2016 22:46:10 -0400 Original-Received: from homie.mail.dreamhost.com ([208.97.132.208]:40663 helo=homiemail-a2.g.dreamhost.com) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1c12bu-0002NC-SZ for bug-gnu-emacs@gnu.org; Sun, 30 Oct 2016 22:46:07 -0400 Original-Received: from homiemail-a2.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a2.g.dreamhost.com (Postfix) with ESMTP id C2111280070; Sun, 30 Oct 2016 19:46:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=jidanni.org; h=from:to:cc :subject:references:date:message-id:mime-version:content-type; s=jidanni.org; bh=4grULzmgRmXv2spZXIQudNHGHdc=; b=T/UY0mwpl+ryc j5GC1yyVXDvkUEJuvt/QIxz/qpWsmw31pHcFmCHgrn6WjzK1b0Q7Iy/Zm1lOWK6u QA6qwRYsTqLOVApgwU4D7+lqke60CUN9Zy5eRe5PJwQPjHdjcOS3SlsW9cPiZEXU IhCEOQQSw7AkiVY7t2K323INV65XT8= Original-Received: from jidanni.org (122-118-148-81.dynamic.hinet.net [122.118.148.81]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) (Authenticated sender: jidanni@jidanni.org) by homiemail-a2.g.dreamhost.com (Postfix) with ESMTPSA id EA8D0280062; Sun, 30 Oct 2016 19:46:02 -0700 (PDT) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:125171 Archived-At: --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Gentelmen, the "shr" program is mangling messages. It could remove vital words, causing arguments: "I did include the address!" "No you didn't." "Yes I did. Your mail reader probably cut it out." We're talking data loss here. It may still be on the disk, but not shown to the user. True, the HTML might not be perfect, but at least Chromium, Firefox, etc. show it fine. >>>>> "KY" =3D=3D Katsumi Yamaoka writes: KY> Emacs-w3m renders it as: KY> http://w = =20 KY> Hi, you have a new email from Catherineme = =20 KY> [25] = =20 KY> View your inbox at http://www.travel-buddies.com/Inbox.aspx= =20 = =20 KY> =C2=A9 Travel Buddies 2015 | All rights reserved = =20 Hmmm, w3m -dump on the attachment shows the first URL in full. KY> However shr renders it as: KY> Travel Buddies =20 KY> =C2=A9 Travel Buddies 2015 | All rights reserved =20 KY> http://www.travel-buddies.com/=20 KY> *=20 KY> There lacks the "Hi, you have a new mail" message. The return KY> value of `libxml-parse-html-region' contains the message as KY> (h1 nil (span nil "Hi, you have a new email from") "Catherineme") KY> (p nil "View your inbox at " KY> (a ((href . "http://www.travel-buddies.com/Inbox.aspx")) KY> "http://www.travel-buddies.com/Inbox.aspx")) KY> regardless of whether all style specs are removed[1] or not KY> (three nil portions above are replaced with style specs if they KY> are not removed). So, style specs are not cause of not KY> displaying some meaningful message in an html mail, I believe. KY> In that case, making shr display images does not help. KY> I think there's something wrong in shr.el, and what you should KY> do would be to send a bug report to the Emacs bug team, i.e., KY> M-x report-emacs-bug, with the sample html part (I'm not so KY> familiar with recent shr, sorry). Note that a mail containing KY> html part might be rejected by the server, so putting it in your KY> web site separately would be better. KY> [1] I tested it by modifying mm-shr so as to remove style specs. OK I'll send the message, --=-=-= Content-Type: application/gzip Content-Disposition: attachment; filename=SHRcutOFFmessage.gz Content-Transfer-Encoding: base64 Content-Description: message H4sICCX/E1gAA1NIUmN1dE9GRm1lc3NhZ2UApVdtb+JIEv48/hUtRneT3GHT7RgMBqNhgNlFE5Io 5KJdjVaRsRvcM7bb224msFrdb78qA8Fkkkx2L1YQrq7uquept+ajkinJ5FxGGzILNLkMNbE7hHY8 p+nRDrEpaxnXXK9UZl4FOvZIL5OK58nmvVbBN56Y81UUCV5YoUz7oBly8Y1HHlngwYkMgySWhSYn zHYtCg/zGGXO2SmZb8gXEQVZJs6s3RdLqiW5Fzo2CJlMB1dOl9jNrUvgBmEtr0k95hCTwp/xiznL g9Qcxjz8ypV5y1UhZOYRlA6KIigKkREHjZqKtTpO02mats3a5AROa5q0bdLWKZGZ8SYvGKPUtSLF gxTdzUs4ewvnHHB65F/795kO9KrwyIWskyIENnxmtYniv6+E4hG8dIjmhS78n2+m53cA5Kfx3eXF +a93Nq0bb0rhdDybgbg+nUzHd6UEFerXw9vR3eTibjq7mnyCBbtOrv9zPp7djq9nk8sLn7mtlt20 HxwLsuUqWHLwBUSXSixFFiTmjfT21LL3z2Eb8QQCBf6+Tv1RYGOZCp4GIjHTtW0tD+qoTE5QWKpY qPNo9bNN21bHtdiZbdmM/nZqvIFceMYwObmC7wuxPi0Tg4xn05srIiIydNxBxx6yFqUf2dB4s5CK 9H4Eo98lNyteP8oq2vIc17PbkFUupWBvdHP6GC6i4OsSDMbbSjlg/F72ueMAqpblQJo7rR2wH1D1 DLz22BmOBjalg8ExtveVWvm7cAquIPQ2c03mOqbtUKgLamE+mNulwso41OxnULFAxQIVC1R+IzFP pD+a3V5TcM1tbREWqc6pc8RECYbjQkBOxmuRQiG2gRXYcMIzKCeZc7P05aV2AtpABZv/YTsXWPMf ODc/7mL9BB1PsMEcZIMx8m8K+WBguR0aBbOo8RF88EjtpjROPmyN14yXu1ylYqr2jVGgufeU/dbO /mw1/8JD7ZFf5YrEcDAJSMbvScqhXy25MZSZ5pk2bzY5HKT5WjdinSZdEsaBKrj2V4UZFKEQB00V ZMUC2t84C2UksqVHfl9JDWWdK5HpYJ5waA3TXWjMwQpbOKPOWcd1m9WVkYT4ISkQb9rGpU/n05l5 vUq4ORmBfC+Zbn01B6EuWQwTHmT7xUGmxVGPzKSGHgkM8ahOEhHyrODQKgutRLm/uvEqFkVcQvgr u26FQkOfgiKHwH6FMcbDlRJ6Q9oWJZgq5yJbrQkihTVM7zr5ts0BVLGY5dqsTuZBgY0Uw9ZgtAFR pK7nUA+K6G0b093ufGf2AeYQSYBp8FXkOY9A76cMVtGkOZMrFUI0sTwiobz/Nqbbbw2IvLFncwK1 2UPbjNpNdua4dseq9ID3L3aRPhhccl0uKw4kwcCKyvIyUTaXa49MLj5c/mIAFQjyDKKp+MJ7egKX PTsVReixtt05M4weJmG/F/Mg6ve00AnvHxdMr7GV9hpbnfI6MV+GMpHKPxvVfOMtt/GpwX7MSRLy JMmDCDMWFWitlBR5EB4kc6kirnYvvnEvIh3jG1TSP2rV4yuHK/iPSJCIZYYrkDmaq51V3/gbdh+s wqBBLwq9STi+p4GCcWtqmXt2M19399Z/BM83XmPowcxWx2P5mhQygXb4NnR9I3RDt3vA+wI12KY/ wIExF8tYlzo2zde+UbGxgGZiLoJUJBtvoEQADacUFeIP7jHL5ulOcF8e4s1lEnWBi9KK93ZR/nXJ DjBc8MBZCg/yAk6KdEkKFaKlWOvcazTu7+8tbKwQkaPW2hApXmcaiVxKK8+WNYikfmJfZRd6kTYq YEou4PYKTnS3oMEhfKmRBiSojvBDVanzjefI2yKrHL6HSB5CXob73jf2EWg2d8H+K1mG3vhG6c9j U+Zcag0zqjS427SXHSUFPL5RJsWB73fP84as5bHUsmjoeJXOC+h4FqO+YWEXumPUanbg885tudaX fPnuwbF3i0QG2kv4Qnd3NaBKljGtuu/6Pd9Aksn/AwaA7DM8Zq/NU8zSXUZC0vsGpD0eAKxnhyN2 Cs0AH1j+WdTJ5ngYl20WL0q+kfYauL1PhvDTh8NA5SmHHscAXf4qtyDBPWp1Kq7tLMNUy7i5z0+r CRq1/q0A8+CNIiKDpk3g91gv8A0SQ69+uQowmhPcY8EYhJo71vSN53V7jQAKIu//AJBvHJh+PaDy 4H3JlQmunk7yhy66TwoUPE5vt4ZHHCq4URbf0xL1XG+sdHC8X5nlqPC2c+KhhZXmCfaxXSiPyICI VMlwmxU2OgE+j9lwUaNaA76xr4Ja/5+hzDdd0oPbjcyW303WnRgvJk3yJxkkCSnrDYDglQgvNNHr SMGpjCMah7lh/A8jgc8k+g8AAA== --=-=-= Content-Type: text/plain here in this bug report about In GNU Emacs 24.5.1 (i686-pc-linux-gnu, GTK+ Version 3.21.5) of 2016-09-06 on x86-csail-01, modified by Debian. --=-=-=--