From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:59218) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1do6dn-00076W-14 for guix-patches@gnu.org; Sat, 02 Sep 2017 07:31:11 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1do6di-0004or-6K for guix-patches@gnu.org; Sat, 02 Sep 2017 07:31:07 -0400 Received: from debbugs.gnu.org ([208.118.235.43]:32925) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1do6di-0004oi-33 for guix-patches@gnu.org; Sat, 02 Sep 2017 07:31:02 -0400 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1do6dh-0000Ep-T7 for guix-patches@gnu.org; Sat, 02 Sep 2017 07:31:01 -0400 Subject: [bug#28235] [PATCH 2/3] gnu: Add python-html5-parser, python2-html5-parser Resent-Message-ID: From: Marius Bakke In-Reply-To: <8760dbtsf0.fsf@gnu.org> References: <878ti7tsli.fsf@gnu.org> <8760dbtsf0.fsf@gnu.org> Date: Sat, 02 Sep 2017 13:30:11 +0200 Message-ID: <874lsll4n0.fsf@fastmail.com> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha512; protocol="application/pgp-signature" List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+kyle=kyleam.com@gnu.org Sender: "Guix-patches" To: Roel Janssen , 28235@debbugs.gnu.org --=-=-= Content-Type: text/plain Content-Transfer-Encoding: quoted-printable Roel Janssen writes: > * gnu/packages/python.scm (python-html5-parser): New variable. > (python2-html5-parser: New variable. > --- > gnu/packages/python.scm | 29 +++++++++++++++++++++++++++++ > 1 file changed, 29 insertions(+) > > diff --git a/gnu/packages/python.scm b/gnu/packages/python.scm > index 9bf46fb6f..8629228db 100644 > --- a/gnu/packages/python.scm > +++ b/gnu/packages/python.scm > @@ -5868,6 +5868,35 @@ and written in Python.") > (define-public python2-html5lib-0.9 > (package-with-python2 python-html5lib-0.9)) >=20=20 > +(define-public python-html5-parser > + (package > + (name "python-html5-parser") > + (version "0.4.4") > + (source (origin > + (method url-fetch) > + (uri (pypi-uri "html5-parser" version)) > + (sha256 > + (base32 > + "1d8sxhl41ffh7qlk7wlsy17xw6slzx5v1yna9s72wx5qrpaa3wxr"))= )) > + (build-system python-build-system) > + (native-inputs > + `(("pkg-config" ,pkg-config))) > + (inputs > + `(("libxml2" ,libxml2))) > + (propagated-inputs > + `(("python-lxml" ,python-lxml) > + ("python-beautifulsoup4" ,python-beautifulsoup4))) > + (home-page "https://html5-parser.readthedocs.io") > + (synopsis "Fast C-based HTML5 parsing for Python") > + (description "This package provides a fast implementation of the HTM= L5 > +parsing spec for Python. Parsing is done in C using a variant of the gu= mbo > +parser. The gumbo parse tree is then transformed into an lxml tree, als= o in > +C, yielding parse times that can be a thirtieth of the html5lib parse ti= mes.") > + (license license:asl2.0))) The files 'src/as-libxml.[ch]' are GPL3. Everything else in this series LG= TM! --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQEzBAEBCgAdFiEEu7At3yzq9qgNHeZDoqBt8qM6VPoFAlmqlkMACgkQoqBt8qM6 VPrmTggArOLC266pyNNgTZz7r1XqNSp9F1uiaZuQdJPBzrbKpuf5zr5OTHL0myuw X+3HrqtrrnOF9jH+wicxn16PyN7t6EtUKlhEEmEYKiSRlO8tgZp7NbNgRvR9DPq+ aNk5zEsVueJVBDYxU2ZKUiIj0LU+2+2TiVxrBAxra2rD7mYa1JjuABA//q9OUNTv wBNGXvWr+myY2s+cxPvcYlW3W2EAvgHQGJj3A9QYZeDokUqa21TLyqgK5EQv7ach kFV1zpirmDgeJPXRqRGsJTr5yaO6o8eOOzrak2DOcSZ0wWnvswTm6Ye9EZUnWj77 ScMlkPzfsIf/br/kfwJvhOHzFXp5Rw== =sHzj -----END PGP SIGNATURE----- --=-=-=--