From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Ted Zlatanov Newsgroups: gmane.emacs.bugs Subject: bug#24831: shr mangling messages Date: Fri, 04 Nov 2016 14:18:03 -0400 Organization: =?UTF-8?Q?=D0=A2=D0=B5=D0=BE=D0=B4=D0=BE=D1=80_?= =?UTF-8?Q?=D0=97=D0=BB=D0=B0=D1=82=D0=B0=D0=BD=D0=BE=D0=B2?= @ Cienfuegos Message-ID: <87twbn3y90.fsf@lifelogs.com> References: <87shrgvt8y.fsf@jidanni.org> <87oa1z5trs.fsf@jidanni.org> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: blaine.gmane.org 1478283603 28090 195.159.176.226 (4 Nov 2016 18:20:03 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Fri, 4 Nov 2016 18:20:03 +0000 (UTC) User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.0.50 (gnu/linux) Cc: larsi@gnus.org, =?UTF-8?Q?=E7=A9=8D=E4=B8=B9=E5=B0=BC?= Dan Jacobson , 24831@debbugs.gnu.org, yamaoka@jpl.org To: Richard Stallman Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Fri Nov 04 19:19:58 2016 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1c2j5G-0002dF-T6 for geb-bug-gnu-emacs@m.gmane.org; Fri, 04 Nov 2016 19:19:23 +0100 Original-Received: from localhost ([::1]:40226 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1c2j5J-0003gL-Pr for geb-bug-gnu-emacs@m.gmane.org; Fri, 04 Nov 2016 14:19:25 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:50145) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1c2j51-0003XM-Cq for bug-gnu-emacs@gnu.org; Fri, 04 Nov 2016 14:19:08 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1c2j4w-0005yq-BF for bug-gnu-emacs@gnu.org; Fri, 04 Nov 2016 14:19:07 -0400 Original-Received: from debbugs.gnu.org ([208.118.235.43]:57807) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1c2j4w-0005yU-86 for bug-gnu-emacs@gnu.org; Fri, 04 Nov 2016 14:19:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1c2j4v-00032D-Th for bug-gnu-emacs@gnu.org; Fri, 04 Nov 2016 14:19:01 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Ted Zlatanov Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 04 Nov 2016 18:19:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24831 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 24831-submit@debbugs.gnu.org id=B24831.147828349411608 (code B ref 24831); Fri, 04 Nov 2016 18:19:01 +0000 Original-Received: (at 24831) by debbugs.gnu.org; 4 Nov 2016 18:18:14 +0000 Original-Received: from localhost ([127.0.0.1]:44973 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1c2j49-00031A-Qx for submit@debbugs.gnu.org; Fri, 04 Nov 2016 14:18:13 -0400 Original-Received: from mail-qk0-f169.google.com ([209.85.220.169]:34369) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1c2j47-00030v-O4 for 24831@debbugs.gnu.org; Fri, 04 Nov 2016 14:18:12 -0400 Original-Received: by mail-qk0-f169.google.com with SMTP id q130so107895733qke.1 for <24831@debbugs.gnu.org>; Fri, 04 Nov 2016 11:18:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=lifelogs.com; s=google; h=from:to:cc:subject:organization:references:mail-copies-to :gmane-reply-to-list:date:in-reply-to:message-id:user-agent :mime-version; bh=wz4AC9v1p8Jn8UmMyqr707wBZ//do26hE44O+v5qZ0s=; b=OjWGIt+hus48Mh3iMag8jVW5242lWjz2PktfScckEY286/RKQxRP2iLeUUHTo/fKVe MwyMPlHXkfgba+lWeRLCLREpz7+n4IK3xlqZkRugGBT8RuL2ften04nj+G4MoHkVTwnG ilZGqXXbhTSxBi+B3QTCn/cHqU/sZcAxnbKEg= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:to:cc:subject:organization:references :mail-copies-to:gmane-reply-to-list:date:in-reply-to:message-id :user-agent:mime-version; bh=wz4AC9v1p8Jn8UmMyqr707wBZ//do26hE44O+v5qZ0s=; b=AECC85sUpc/S4E46FwatR9tugwbvye9PR0zfpw69k2a6TpWTpkOnztwomBtBA/7/mg RBdxAx+gJKumDoZ9QPriRLHSGIGeX5+P2Ew+Wj0QH8caqLxivyfo+hEkmBwXwwOVUlGD +jdogZxOsGyVCKFUTMVX7fUK+3sgArehFaVrLNKtT9K05+qeODR6jdYySW1gJ8fiba+B 7X2WugHWfTAML55lSC1BF4uXHR5qMNgs0fquqOl0Ug9K4q160PNXJ4u3tcpjaPJOAD3H jz6HCDlQyUqaU7JRiPziqrSmGYjJx0xPNDoArTnVyjiFcnAdezykendXGr8O5hlXsR3H ccjg== X-Gm-Message-State: ABUngvfpNt9NXGNpGoyUPTieYQFnmvlyLzWSETayjKLUMv+XVRJkpMbjq4r4SMYIPeu0pQ== X-Received: by 10.55.104.68 with SMTP id d65mr14330559qkc.119.1478283486340; Fri, 04 Nov 2016 11:18:06 -0700 (PDT) Original-Received: from flea (c-98-229-60-157.hsd1.ma.comcast.net. [98.229.60.157]) by smtp.gmail.com with ESMTPSA id s23sm8240321qka.10.2016.11.04.11.18.05 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Fri, 04 Nov 2016 11:18:05 -0700 (PDT) X-Face: bd.DQ~'29fIs`T_%O%C\g%6jW)yi[zuz6; d4V0`@y-~$#3P_Ng{@m+e4o<4P'#(_GJQ%TT= D}[Ep*b!\e,fBZ'j_+#"Ps?s2!4H2-Y"sx" Mail-Copies-To: never Gmane-Reply-To-List: yes In-Reply-To: (Richard Stallman's message of "Tue, 01 Nov 2016 13:16:52 -0400") X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:125346 Archived-At: On Tue, 01 Nov 2016 13:16:52 -0400 Richard Stallman wrote: >> Another idea would be first run it through a validator. >> If valid, proceed as before. >> If invalid, just spew out all the text nodes of the whole document, >> separated by spaces. RS> Do we have a validator in Emacs Lisp? Or would we run one as a child? RS> What program is available? IMHO validation is not a workable solution, both because of complexity and because real-world HTML authors are incredibly skilled at writing broken HTML that somehow renders in the browsers they support. Ted