From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Lars Magne Ingebrigtsen Newsgroups: gmane.emacs.devel Subject: Re: Problems with xml-parse-string Date: Fri, 24 Sep 2010 21:06:52 +0200 Organization: Programmerer Ingebrigtsen Message-ID: References: <87vd5ymptn.fsf@stupidchicken.com> <87vd5x7ty2.fsf@stupidchicken.com> <87vd5wo48a.fsf@stupidchicken.com> <8739t03q2g.fsf@stupidchicken.com> <87k4mb2mfu.fsf@stupidchicken.com> <87pqw3nm4y.fsf@stupidchicken.com> <87iq1v0yi2.fsf@stupidchicken.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: dough.gmane.org 1285355247 15659 80.91.229.12 (24 Sep 2010 19:07:27 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Fri, 24 Sep 2010 19:07:27 +0000 (UTC) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Sep 24 21:07:26 2010 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1OzDcB-0002St-Q6 for ged-emacs-devel@m.gmane.org; Fri, 24 Sep 2010 21:07:24 +0200 Original-Received: from localhost ([127.0.0.1]:34700 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1OzDcB-0002WP-A2 for ged-emacs-devel@m.gmane.org; Fri, 24 Sep 2010 15:07:23 -0400 Original-Received: from [140.186.70.92] (port=36011 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1OzDc2-0002UI-8g for emacs-devel@gnu.org; Fri, 24 Sep 2010 15:07:17 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1OzDbx-00084c-N5 for emacs-devel@gnu.org; Fri, 24 Sep 2010 15:07:14 -0400 Original-Received: from lo.gmane.org ([80.91.229.12]:40739) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1OzDbx-00084R-CI for emacs-devel@gnu.org; Fri, 24 Sep 2010 15:07:09 -0400 Original-Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1OzDbs-00027G-Nh for emacs-devel@gnu.org; Fri, 24 Sep 2010 21:07:04 +0200 Original-Received: from cm-84.215.34.171.getinternet.no ([84.215.34.171]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Fri, 24 Sep 2010 21:07:04 +0200 Original-Received: from larsi by cm-84.215.34.171.getinternet.no with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Fri, 24 Sep 2010 21:07:04 +0200 X-Injected-Via-Gmane: http://gmane.org/ Mail-Followup-To: emacs-devel@gnu.org Original-Lines: 101 Original-X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: cm-84.215.34.171.getinternet.no Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAAFVBMVEXZz8utoaJvZWRGPTui l5giGxqRh4jjC5C/AAACb0lEQVQ4jVXTvXLjIBAA4MXjSy0Uxr0XSX0gqLYcogcQQ28hzPs/wu0i +ZJjnMafl/0j0MjjXLR1Jgzel1IQC0xSsgmpg9U2huz9TFTA+6kGvEfrIp1F1Zh/oGLEyBImWQX2 31OAcYEjQpaqgt+hD3R/jego5AfoJhdtjSgHvG6KLmj6iyHl3/Cke8K2WDql/AZHYPSIw4AZa1nw ShHCorsi13lOE/e4Q0tgjV5v0KiS8k/EFyVF1BMAnOet+4FnjAYHXHlmbRo9SR2JoiYI0lWuUrwX c4CXF4LFzPeHeAixDTsoz7ldsMOKD7gJtaGZuCq/9x1tp/AGYr5r/bpKym/eA26pgTdtNNrsyw5U FCcHDwJ7RNsxTLSQnvozQ4YWYMak7fXVeU/9mQbgQg2KZMxwQEsLCuMLNlxK3uG9ToogMTwx8EwY vrVzmkBUeFVFoJ+9s+YBbxWsNt2R4/PpemNWKqlG6DHvoD7oiVgkqMfaYVZ+JmhXGyj5C5zFnHeY bTDW3ECtmaZFK+P3y3DXzrhlgnaikk8xDGUvl8BaFyYQN7rpT7B4URVUro/weqRYjrFTuVuFUfL3 J3rcWu45JNL+aO4rZQBqNejrDvfEOaLpGGiVAT8ZJtkWerE6mkwgCBZaVIULR1DF3CFDsHVW3l+4 XPowvPEGnN9BzY5Ghcj1nun/xNibFAWyl20fDOqBcyiqvO7sBLTG9csgbgNfJTUN9PNMyaCUTJ5O eeaByGSjHu/c6pAe5C10E9ANTZusXgV3lAcpANpmbipcUBsl6tSEPDdwlvwjkEKlEVsp4NepIsS6 +vX/ryvQm5NQF/AXyDvPSRzF/6oAAAAASUVORK5CYII= Mail-Copies-To: never X-Now-Playing: Marianne Faithfull's _Dangerous Acquaintances_: "Tenderness" User-Agent: Gnus/5.110011 (No Gnus v0.11) Emacs/24.0.50 (gnu/linux) Cancel-Lock: sha1:y8x9gaT4mib2k1kFb5BT+yEFX2Y= X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:130797 Archived-At: Chong Yidong writes: >> To take a concrete example: You want the src of the img node you have. >> >> xml.el: (cdr (assq 'img (cadr node))) >> sxml.el: (if (and (consp (cadr node)) >> (eq (caadr node) '@)) >> (cadr (assq 'img node))) >> >> (And I'm not even sure that's correct. It's probably not. Which is my >> point.) >> >> libxml: (cdr (assq :img (cdr node))) >> >> (The difference between libxml and xml.c for attributes is minuscule.) > > I think this example is confused. If you're scanning at top-level, > without descent, the code for all three cases is practically identical: > it's a simple assq. Sorry, "img" in the examples should be replaced with "src". But I don't understand what you mean with "top-level". This example was about getting an attribute. Chong Yidong writes: > I think that code might just be crufty. Here's an implementation, > assuming sxml format: > > (defun nnrss-find-el (tag data &optional found-list) > "Find the all matching elements in the data. > Careful with this on large documents!" > (nreverse (nnrss-find-el-1 tag data))) > > (defun nnrss-find-el-1 (tag data &optional found-list) > (and (consp data) > (not (eq (car data) '@)) > (if (equal tag (car data)) > (push data found-list) > (dolist (bit (cdr data)) > (setq found-list > (nnrss-find-el-1 tag bit found-list))))) > found-list) Here's the libxml version: (defun nnrss-find-el (tag node) (let (result) (dolist (elem (cdr node)) (when (eq (car elem) tag) (push elem result)) (when (consp (cdr elem)) (setq result (nconc (nnrss-find-el tag elem) result)))) result)) (Of course, if we were allowed to use CL constructions, it would be something like (untested, because I have to run): (defun nnrss-find-el (tag node) (loop for elem in (cdr node) if (eq (car elem) tag) collect elem when (consp (cdr elem)) append (nnrss-find-el tag node))) Can get much clearer than that, can it? > Doesn't seem too horrific. Here's the example tree: It's not too horrific. It's just that it's not as nice as it could be. And if you deal with these structures all the time, you get to recreate the (and (consp data) (not (eq (car data) '@))) in every single branch of every single little trivial function you write (and read). > (setq test-sxml-tree > '(tag (@ (attr1 "value1") > (attr2 "value2")) > "Free text" > (foo "Text node 1") > (bar "Text node 2") > (baz > "More free text" > (foo "Text node 3") > (bar "Text node 4")))) (setq test-libxml-tree '(tag (:attr1 . "value1") (:attr2 . "value2") (text . "Free text") (foo (text . "Text node 1")) (bar (text . "Text node 2")) (baz (text . "More free text") (foo (text . "Text node 3")) (bar (text . "Text node 4"))))) -- (domestic pets only, the antidote for overdose, milk.) larsi@gnus.org * Lars Magne Ingebrigtsen