From: Felix Gruber <felgru@posteo.net>
To: 55044@debbugs.gnu.org
Cc: Felix Gruber <felgru@posteo.net>
Subject: [bug#55044] [PATCH 8/8] gnu: Add python-scrapy.
Date: Wed, 20 Apr 2022 17:28:04 +0000 [thread overview]
Message-ID: <20220420172804.8849-8-felgru@posteo.net> (raw)
In-Reply-To: <20220420172518.8609-1-felgru@posteo.net>
* gnu/packages/python-web.scm (python-scrapy): New variable.
---
gnu/packages/python-web.scm | 60 +++++++++++++++++++++++++++++++++++++
1 file changed, 60 insertions(+)
diff --git a/gnu/packages/python-web.scm b/gnu/packages/python-web.scm
index da3f9cf980..f4ff4f494c 100644
--- a/gnu/packages/python-web.scm
+++ b/gnu/packages/python-web.scm
@@ -6519,3 +6519,63 @@ by asyncio.")
HTML and XML using XPath and CSS selectors, optionally combined with
regular expressions.")
(license license:bsd-3)))
+
+(define-public python-scrapy
+ (package
+ (name "python-scrapy")
+ (version "2.6.1")
+ (source
+ (origin
+ (method url-fetch)
+ (uri (pypi-uri "Scrapy" version))
+ (sha256
+ (base32 "09rqalbwcz9ix8h0992mzjs50sssxsmmh8w9abkrqchgknjmbzan"))))
+ (build-system python-build-system)
+ (arguments
+ `(#:phases
+ (modify-phases %standard-phases
+ (replace 'check
+ (lambda* (#:key tests? #:allow-other-keys)
+ (when tests?
+ (invoke "pytest"
+ ;; requires network access
+ "--ignore" "tests/test_command_check.py"
+ "-k"
+ (string-append
+ ;; Failing for unknown reasons
+ "not test_server_set_cookie_domain_suffix_public_private"
+ " and not test_user_set_cookie_domain_suffix_public_private"
+ " and not test_pformat")
+ "tests")))))))
+ (propagated-inputs
+ (list python-botocore ; Optional: For S3FeedStorage class.
+ python-cryptography
+ python-cssselect
+ python-itemadapter
+ python-itemloaders
+ python-lxml
+ python-parsel
+ python-protego
+ python-pydispatcher
+ python-pyopenssl
+ python-queuelib
+ python-service-identity
+ python-setuptools
+ python-tldextract
+ python-twisted
+ python-w3lib
+ python-zope-interface))
+ (native-inputs
+ (list python-pytest
+ python-pyftpdlib
+ python-sybil
+ python-testfixtures
+ python-uvloop
+ ))
+ (home-page "https://scrapy.org")
+ (synopsis "A high-level Web Crawling and Web Scraping framework")
+ (description "Scrapy is a fast high-level web crawling and web
+scraping framework, used to crawl websites and extract structured data
+from their pages. It can be used for a wide range of purposes, from data
+mining to monitoring and automated testing.")
+ (license license:bsd-3)))
--
2.30.2
next prev parent reply other threads:[~2022-04-20 17:29 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-04-20 17:25 [bug#55044] [PATCH 0/8] Add python-scrapy Felix Gruber
2022-04-20 17:27 ` [bug#55044] [PATCH 1/8] gnu: Add python-sybil Felix Gruber
2022-04-20 17:27 ` [bug#55044] [PATCH 2/8] gnu: Add python-pydispatcher Felix Gruber
2022-04-20 17:27 ` [bug#55044] [PATCH 3/8] gnu: Add python-queuelib Felix Gruber
2022-04-20 17:28 ` [bug#55044] [PATCH 4/8] gnu: Add python-itemadapter Felix Gruber
2022-04-20 17:28 ` [bug#55044] [PATCH 5/8] gnu: Add python-protego Felix Gruber
2022-04-20 17:28 ` [bug#55044] [PATCH 6/8] gnu: Add python-parsel Felix Gruber
2022-04-20 17:28 ` [bug#55044] [PATCH 7/8] gnu: Add python-itemloaders Felix Gruber
2022-04-20 17:28 ` Felix Gruber [this message]
2022-05-02 13:14 ` bug#55044: [PATCH 0/8] Add python-scrapy Ludovic Courtès
2022-05-02 16:22 ` [bug#55044] " Felix Gruber
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20220420172804.8849-8-felgru@posteo.net \
--to=felgru@posteo.net \
--cc=55044@debbugs.gnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this external index
https://git.savannah.gnu.org/cgit/guix.git
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.