From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Lennart Borgman Newsgroups: gmane.emacs.devel Subject: Re: Linking Emacs with libxml2 Date: Mon, 6 Sep 2010 20:44:06 +0200 Message-ID: References: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1283798690 4405 80.91.229.12 (6 Sep 2010 18:44:50 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Mon, 6 Sep 2010 18:44:50 +0000 (UTC) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Sep 06 20:44:49 2010 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1OsggP-000512-Qx for ged-emacs-devel@m.gmane.org; Mon, 06 Sep 2010 20:44:46 +0200 Original-Received: from localhost ([127.0.0.1]:47103 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1OsggP-0000IF-Au for ged-emacs-devel@m.gmane.org; Mon, 06 Sep 2010 14:44:45 -0400 Original-Received: from [140.186.70.92] (port=42002 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1OsggH-0000DT-To for emacs-devel@gnu.org; Mon, 06 Sep 2010 14:44:40 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1OsggA-0006uc-M5 for emacs-devel@gnu.org; Mon, 06 Sep 2010 14:44:35 -0400 Original-Received: from mail-qw0-f41.google.com ([209.85.216.41]:49975) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1OsggA-0006uK-DK for emacs-devel@gnu.org; Mon, 06 Sep 2010 14:44:30 -0400 Original-Received: by qwf7 with SMTP id 7so5421461qwf.0 for ; Mon, 06 Sep 2010 11:44:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=s7wHICglp2XqQsKHM3j5HVhcxI4qre0lKHYl676Qaz8=; b=pV1c42sSNLXiwSnksyQDF7/47pGcNEr8N/6cgoYx9Xe9wsFIG5gRaRqsEzICDl0wO2 D8ozfMnxbIFu4SzjdMbUCbJI+XpPE5RWRL7hNNja1re6I14w4zwpg3P+1ijz4jXQKEKl g+gjoGQd+tBbpRIJlNZsWbNqEwITAKb32fe+Y= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=KWBGVYlCEmyQPU6CO834ardu1cOUyN72kwt0yOr48znuKn3lZnTV8BaqSUnY2A5roe l8t1aB/JI82syLcijrV4J6OW6nJ6Yb4PVhbOGvvgcyeon+NA7xNGCfceVNvRNUd7Uy87 0aSr7/6SMYOPglAxInNfPGt9F5wLWPQ6IgWVs= Original-Received: by 10.229.10.219 with SMTP id q27mr2745743qcq.245.1283798667198; Mon, 06 Sep 2010 11:44:27 -0700 (PDT) Original-Received: by 10.229.216.74 with HTTP; Mon, 6 Sep 2010 11:44:06 -0700 (PDT) In-Reply-To: X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:129716 Archived-At: On Mon, Sep 6, 2010 at 5:21 PM, Lars Magne Ingebrigtsen wr= ote: > Apparently libxml2 comes with a parser for "real world" HTML, which is > very intriguing: > > http://www.xmlsoft.org/html/libxml-HTMLparser.html > > If Emacs provided a native interface to this function, we could say > > (parse-html "file.html") > =3D> (:html (:head ...) (:body ...)) > > and get a nice parse tree out very fast. =C2=A0(Parsing HTML from Emacs L= isp > is rather slow.) > > Has this been discussed before and rejected? =C2=A0It seems like an obvio= us > idea, and would enable both easier extraction of data from HTML files, > as well as writing a (simple) HTML renderer in Emacs Lisp. It was discussed before here: http://lists.gnu.org/archive/html/emacs-devel/2007-06/msg01147.html Wasn't there a problem with linking to external libraries at that time?