From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Tim Landscheidt Newsgroups: gmane.emacs.help Subject: Re: elisps dom library doesn't work as I expect Date: Tue, 09 May 2023 05:32:21 +0000 Organization: https://www.tim-landscheidt.de/ Message-ID: <87fs866pei.fsf@vagabond.tim-landscheidt.de> References: <87ttwmgnxg.fsf@mail.de> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="35149"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.2 (gnu/linux) Cc: help-gnu-emacs@gnu.org To: Stefan Huchler Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Tue May 09 07:33:01 2023 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1pwFy8-0008vO-8C for geh-help-gnu-emacs@m.gmane-mx.org; Tue, 09 May 2023 07:33:00 +0200 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1pwFxd-0001I6-NI; Tue, 09 May 2023 01:32:29 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pwFxb-0001Hx-VU for help-gnu-emacs@gnu.org; Tue, 09 May 2023 01:32:28 -0400 Original-Received: from andalucia.tim-landscheidt.de ([2a01:4f8:1c1c:d4d0::1]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pwFxa-0005zj-3N for help-gnu-emacs@gnu.org; Tue, 09 May 2023 01:32:27 -0400 Original-Received: from [195.226.160.202] (port=41742 helo=vagabond) by andalucia.tim-landscheidt.de with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1pwFxV-00043P-Th; Tue, 09 May 2023 05:32:21 +0000 In-Reply-To: <87ttwmgnxg.fsf@mail.de> (Stefan Huchler's message of "Tue, 09 May 2023 05:54:03 +0200") Received-SPF: pass client-ip=2a01:4f8:1c1c:d4d0::1; envelope-from=tim@tim-landscheidt.de; helo=andalucia.tim-landscheidt.de X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.help:143541 Archived-At: Stefan Huchler wrote: > I try to webscrape ebay auctions, but I can't scape the name as example: > [=E2=80=A6] > (print (length (dom-elements > (dom-by-tag ebay-tracker-auction 'meta) > 'itemprop "name")))) > ;; data that I try to scrape: > ;; > (ebay-tracker--query-test "185887279856") > Also the dom-elements funktion is not described on the gnu website, yet > it's the only way to find elements with this library that don't use > class or id as attributes. shouldn't that code find this node? dom-by-tag returns a list of DOM elements; however, dom-elements expects a single DOM element as its second ar- gument. So you need to iterate over the list of DOM ele- ments returned by dom-by-tag and call dom-elements on each, or use dom-search, etc. Tim