From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:57989) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1eV1m5-0000KT-W5 for guix-patches@gnu.org; Fri, 29 Dec 2017 16:01:07 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1eV1m2-0008Fr-LO for guix-patches@gnu.org; Fri, 29 Dec 2017 16:01:06 -0500 Received: from debbugs.gnu.org ([208.118.235.43]:48311) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1eV1m2-0008FK-Gr for guix-patches@gnu.org; Fri, 29 Dec 2017 16:01:02 -0500 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1eV1m2-0008Bw-6l for guix-patches@gnu.org; Fri, 29 Dec 2017 16:01:02 -0500 Subject: [bug#29902] [PATCH] gnu: Add html-xml-utils. Resent-Message-ID: Received: from eggs.gnu.org ([2001:4830:134:3::10]:57202) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1eV1lS-0008LA-Ok for guix-patches@gnu.org; Fri, 29 Dec 2017 16:00:30 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1eV1lP-0007XM-5L for guix-patches@gnu.org; Fri, 29 Dec 2017 16:00:26 -0500 Received: from dd5424.kasserver.com ([85.13.138.252]:51106) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1eV1lO-0007Tf-GO for guix-patches@gnu.org; Fri, 29 Dec 2017 16:00:22 -0500 From: Stefan =?UTF-8?Q?Reich=C3=B6r?= Date: Fri, 29 Dec 2017 22:00:18 +0100 Message-ID: <151458121841.14489.3140405883925988155.stgit@sandburg> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+kyle=kyleam.com@gnu.org Sender: "Guix-patches" To: 29902@debbugs.gnu.org * gnu/packages/xml.scm (html-xml-utils): New variable. --- gnu/packages/xml.scm | 54 ++++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 54 insertions(+) diff --git a/gnu/packages/xml.scm b/gnu/packages/xml.scm index 344d7c3..dde1964 100644 --- a/gnu/packages/xml.scm +++ b/gnu/packages/xml.scm @@ -1116,6 +1116,60 @@ match and extract data, and elements can be added, deleted or modified using XSLT and EXSLT.") (license license:x11))) +(define-public html-xml-utils + (package + (name "html-xml-utils") + (version "7.4") + (source + (origin + (method url-fetch) + (uri (string-append + "https://www.w3.org/Tools/HTML-XML-utils/html-xml-utils-" + version ".tar.gz")) + (sha256 + (base32 + "04pgrahsfawnzd9pilvirs05pfdgsd7qwvw4dvkb42rgybhw6h95")))) + (build-system gnu-build-system) + (home-page "https://www.w3.org/Tools/HTML-XML-utils/") + (synopsis "Command line utilities to manipulate HTML and XML files") + (description "HTML-XML-utils provides a number of simple utilities for +manipulating and converting HTML and XML files in various ways. The suite +consists of the following tools: + +@itemize + @item @command{asc2xml} convert from @code{UTF-8} to @code{&#nnn;} entities + @item @command{xml2asc} convert from @code{&#nnn;} entities to @code{UTF-8} + @item @command{hxaddid} add IDs to selected elements + @item @command{hxcite} replace bibliographic references by hyperlinks + @item @command{hxcite} mkbib - expand references and create bibliography + @item @command{hxclean} apply heuristics to correct an HTML file + @item @command{hxcopy} copy an HTML file while preserving relative links + @item @command{hxcount} count elements and attributes in HTML or XML files + @item @command{hxextract} extract selected elements + @item @command{hxincl} expand included HTML or XML files + @item @command{hxindex} create an alphabetically sorted index + @item @command{hxmkbib} create bibliography from a template + @item @command{hxmultitoc} create a table of contents for a set of HTML files + @item @command{hxname2id} move some @code{ID=} or @code{NAME=} from A elements to their parents + @item @command{hxnormalize} pretty-print an HTML file + @item @command{hxnsxml} convert output of hxxmlns back to normal XML + @item @command{hxnum} number section headings in an HTML file + @item @command{hxpipe} convert XML to a format easier to parse with Perl or AWK + @item @command{hxprintlinks} number links and add table of URLs at end of an HTML file + @item @command{hxprune} remove marked elements from an HTML file + @item @command{hxref} generate cross-references + @item @command{hxselect} extract elements that match a (CSS) selector + @item @command{hxtoc} insert a table of contents in an HTML file + @item @command{hxuncdata} replace CDATA sections by character entities + @item @command{hxunent} replace HTML predefined character entities to @code{UTF-8} + @item @command{hxunpipe} convert output of pipe back to XML format + @item @command{hxunxmlns} replace \"global names\" by XML Namespace prefixes + @item @command{hxwls} list links in an HTML file + @item @command{hxxmlns} replace XML Namespace prefixes by \"global names\" +@end itemize +") + (license license:expat))) + (define-public xlsx2csv (package (name "xlsx2csv")