* web scraping update and a new issue
@ 2012-02-05 14:17 Catonano
0 siblings, 0 replies; only message in thread
From: Catonano @ 2012-02-05 14:17 UTC (permalink / raw)
To: guile-user
[-- Attachment #1: Type: text/plain, Size: 868 bytes --]
People,
should anyone be interested, here are some updates aout my web scraping
initiative
First things first: I owe some apologies, I indicated some issues,
privately, to Ian but I was blabbering. I was just confused. Those issues
emerge with some sites only and they do NOT emerge with my radio station
site. I applied the Ian's patch for chunked respones to my branch and it
works like a charm.
Yesterday I had fun like I hadn't since a long time.
Now I'm running into a new issue: the web server puts a cookie in the
response and I suspect it attempts to acces that cookie at the time of my
subsequent request. I also suspect that it doesn't succeed in accessing
that so the subsequent response contains some incorrect results
Not so when I test the sequence with Firefox.
Is there anything concerning the cookies I should be aware of ?
Thanks for any hint
[-- Attachment #2: Type: text/html, Size: 935 bytes --]
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2012-02-05 14:17 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-02-05 14:17 web scraping update and a new issue Catonano
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).