From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:470:142:3::10]:41599) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1iyqOX-0001lv-8e for guix-patches@gnu.org; Tue, 04 Feb 2020 00:05:06 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1iyqOW-0002TW-0h for guix-patches@gnu.org; Tue, 04 Feb 2020 00:05:05 -0500 Received: from debbugs.gnu.org ([209.51.188.43]:36556) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1iyqOU-0002Rq-GM for guix-patches@gnu.org; Tue, 04 Feb 2020 00:05:03 -0500 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1iyqOU-0006DW-BD for guix-patches@gnu.org; Tue, 04 Feb 2020 00:05:02 -0500 Subject: [bug#39410] [PATCH] gnu: netsurf: Fix entity parsing. Resent-Message-ID: Received: from eggs.gnu.org ([2001:470:142:3::10]:40781) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1iyqNn-0001O8-83 for guix-patches@gnu.org; Tue, 04 Feb 2020 00:04:20 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1iyqNm-00032a-1u for guix-patches@gnu.org; Tue, 04 Feb 2020 00:04:19 -0500 Received: from mout01.posteo.de ([185.67.36.141]:43473) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1iyqNl-0002mm-QS for guix-patches@gnu.org; Tue, 04 Feb 2020 00:04:17 -0500 Received: from submission (posteo.de [89.146.220.130]) by mout01.posteo.de (Postfix) with ESMTPS id D393A160060 for ; Tue, 4 Feb 2020 06:04:15 +0100 (CET) From: Eric Bavier Date: Mon, 3 Feb 2020 23:06:09 -0600 Message-Id: <20200204050609.27557-1-bavier@member.fsf.org> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+kyle=kyleam.com@gnu.org Sender: "Guix-patches" To: 39410@debbugs.gnu.org Cc: Eric Bavier From: Eric Bavier From: Eric Bavier Follow-up to commit 31afa654c58cd7aa8bd11a771fa6eabcd766d443. * gnu/packages/web.scm (netsurf)[arguments]: In 'adjust-welcome' phase, e= nsure html entities are parsed and find their way back to the output. --- gnu/packages/web.scm | 11 +++++++++-- 1 file changed, 9 insertions(+), 2 deletions(-) diff --git a/gnu/packages/web.scm b/gnu/packages/web.scm index f6ae958ae3..60fd5cfd0f 100644 --- a/gnu/packages/web.scm +++ b/gnu/packages/web.scm @@ -6,7 +6,7 @@ ;;; Copyright =C2=A9 2015, 2016, 2017, 2018, 2019 Ricardo Wurmus ;;; Copyright =C2=A9 2018 Raoul Jean Pierre Bonnal ;;; Copyright =C2=A9 2015 Taylan Ulrich Bay=C4=B1rl=C4=B1/Kammer -;;; Copyright =C2=A9 2015, 2016, 2017, 2018, 2019 Eric Bavier +;;; Copyright =C2=A9 2015, 2016, 2017, 2018, 2019, 2020 Eric Bavier ;;; Copyright =C2=A9 2015 Eric Dvorsak ;;; Copyright =C2=A9 2016 Sou Bunnbu ;;; Copyright =C2=A9 2016 Jelle Licht @@ -5075,12 +5075,19 @@ w3c webidl files and a binding configuration file= .") ;; Leave the DOCTYPE header as is. (display (read-line in 'concat) out) (sxml->xml - (let rec ((sxml (xml->sxml in))) + (let rec ((sxml (xml->sxml in + #:default-entity-handler + (lambda (port name) + (string-append "" + (symbol->string name) + ""))))) ;; We'd like to use sxml-match here, but it can't ;; match against generic tag symbols... (match sxml (`(div (@ (class "links")) . ,rest) '()) + (`(ENTITY ,ent) + `(*ENTITY* ,ent)) ((x ...) (map rec x)) (x x))) --=20 2.25.0