From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.devel Subject: Re: Linking Emacs with libxml2 Date: Mon, 06 Sep 2010 23:08:40 +0200 Message-ID: References: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: dough.gmane.org 1283807333 7258 80.91.229.12 (6 Sep 2010 21:08:53 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Mon, 6 Sep 2010 21:08:53 +0000 (UTC) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Sep 06 23:08:52 2010 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Osivr-0002kn-5W for ged-emacs-devel@m.gmane.org; Mon, 06 Sep 2010 23:08:51 +0200 Original-Received: from localhost ([127.0.0.1]:57495 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Osivq-00060o-Hs for ged-emacs-devel@m.gmane.org; Mon, 06 Sep 2010 17:08:50 -0400 Original-Received: from [140.186.70.92] (port=34023 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Osivk-00060j-Dw for emacs-devel@gnu.org; Mon, 06 Sep 2010 17:08:45 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1Osivj-0004qm-Dn for emacs-devel@gnu.org; Mon, 06 Sep 2010 17:08:44 -0400 Original-Received: from impaqm3.telefonica.net ([213.4.138.3]:60370) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Osivj-0004qb-79 for emacs-devel@gnu.org; Mon, 06 Sep 2010 17:08:43 -0400 Original-Received: from IMPmailhost3.adm.correo ([10.20.102.124]) by IMPaqm3.telefonica.net with bizsmtp id 3X0e1f03d2h2L9m3PZ8hvG; Mon, 06 Sep 2010 23:08:41 +0200 Original-Received: from ceviche.home ([83.61.51.178]) by IMPmailhost3.adm.correo with BIZ IMP id 3Z8g1f00B3qhCuj1jZ8h45; Mon, 06 Sep 2010 23:08:41 +0200 X-Brightmail-Tracker: AAAAAA== X-TE-authinfo: authemail="monnier$movistar.es" |auth_email="monnier@movistar.es" X-TE-AcuTerraCos: auth_cuTerraCos="cosuitnetc01" Original-Received: by ceviche.home (Postfix, from userid 20848) id 75B89660BB; Mon, 6 Sep 2010 23:08:40 +0200 (CEST) In-Reply-To: (Lars Magne Ingebrigtsen's message of "Mon, 06 Sep 2010 17:21:09 +0200") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.0.50 (gnu/linux) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:129725 Archived-At: > Apparently libxml2 comes with a parser for "real world" HTML, which is > very intriguing: [...] > Has this been discussed before and rejected? It seems like an obvious > idea, and would enable both easier extraction of data from HTML files, > as well as writing a (simple) HTML renderer in Emacs Lisp. It's an obvious idea, but I think it's a fair bit of work: - you'll probably want your function to be able to read from a buffer rather than from a file (reading from a file would slow down the operation to a point where using a separate xml-to-elisp executable isn't that much worse). - parsing HTML is the easy part, rendering it in Emacs is a lot more difficult. Stefan