From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Stefan Huchler Newsgroups: gmane.emacs.help Subject: elisps dom library doesn't work as I expect Date: Tue, 09 May 2023 05:54:03 +0200 Message-ID: <87ttwmgnxg.fsf@mail.de> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="12010"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.2 (gnu/linux) To: help-gnu-emacs@gnu.org Cancel-Lock: sha1:CgZMBWJzEmjYgUGP4BojtmckYWA= Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Tue May 09 05:54:59 2023 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1pwERG-0002xE-El for geh-help-gnu-emacs@m.gmane-mx.org; Tue, 09 May 2023 05:54:58 +0200 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1pwEQa-0002Oc-ED; Mon, 08 May 2023 23:54:16 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pwEQY-0002O9-Au for help-gnu-emacs@gnu.org; Mon, 08 May 2023 23:54:14 -0400 Original-Received: from ciao.gmane.io ([116.202.254.214]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pwEQW-0003uY-K0 for help-gnu-emacs@gnu.org; Mon, 08 May 2023 23:54:14 -0400 Original-Received: from list by ciao.gmane.io with local (Exim 4.92) (envelope-from ) id 1pwEQT-00021v-6q for help-gnu-emacs@gnu.org; Tue, 09 May 2023 05:54:09 +0200 X-Injected-Via-Gmane: http://gmane.org/ Received-SPF: pass client-ip=116.202.254.214; envelope-from=geh-help-gnu-emacs@m.gmane-mx.org; helo=ciao.gmane.io X-Spam_score_int: -13 X-Spam_score: -1.4 X-Spam_bar: - X-Spam_report: (-1.4 / 5.0 requ) BAYES_00=-1.9, FREEMAIL_FORGED_FROMDOMAIN=0.249, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.25, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=no autolearn_force=no X-Spam_action: no action X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.help:143540 Archived-At: I try to webscrape ebay auctions, but I can't scape the name as example: (defun ebay-tracker--query-test (id) "Webscrape ebay auction by ID." (setq ebay-tracker-running 't) (let* ((auction-url (format "https://www.ebay.com/itm/%s" id))) (request auction-url :parser (lambda () (goto-char (point-min)) (re-search-forward "^$") ;; (next-line) (libxml-parse-html-region (point) (point-max))) :headers '(("Content-Type" . "application/json")) :success (cl-function (lambda (&key data &allow-other-keys) (when data (setq ebay-tracker-auction data) (message "success") (setq ebay-tracker-running nil)))))) (while ebay-tracker-running (sleep-for 0.03)) (print (length (dom-elements (dom-by-tag ebay-tracker-auction 'meta) 'itemprop "name")))) ;; data that I try to scrape: ;; (ebay-tracker--query-test "185887279856") Also the dom-elements funktion is not described on the gnu website, yet it's the only way to find elements with this library that don't use class or id as attributes. shouldn't that code find this node?