all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
* Abbreviated newspaper articles with eww
@ 2015-05-05 12:50 Michael Heerdegen
  2015-05-05 15:04 ` tomas
  0 siblings, 1 reply; 4+ messages in thread
From: Michael Heerdegen @ 2015-05-05 12:50 UTC (permalink / raw)
  To: Emacs mailing list

Hello,

I want to read newspaper articles on "www.sueddeutsche.de" with eww.
But some articles are appearing incomplete (there is a paywall on the
site, but the articles I mean are not behind it).

Here is an example:

http://www.sueddeutsche.de/politik/reparationszahlungen-gewolltes-missverstaendnis-1.2463623

With firefox or w3m, I get a complete article.  With eww, it is
abbreviated after:

| Bundesrepublik gebe endlich zu, Zahlungen prüfen zu müssen. Muss sie ...

Not all articles are affected, only some.

I tried to

(setq-default
 url-user-agent
 (concat "User-Agent: Emacs-w3m/" emacs-w3m-version " " w3m-version "\r\n"))

hoping this could help, but this didn't make a difference.  The article
is fetched incompletely by url-retrieve:

(url-retrieve
 "http://www.sueddeutsche.de/politik/reparationszahlungen-gewolltes-missverstaendnis-1.2463623"
 (lambda (&rest args) (pop-to-buffer (current-buffer))))

gives me the abbreviated version.

What can I do to get the complete article to eww?


Thanks,

Michael.



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Abbreviated newspaper articles with eww
  2015-05-05 12:50 Abbreviated newspaper articles with eww Michael Heerdegen
@ 2015-05-05 15:04 ` tomas
  2015-05-05 15:52   ` Michael Heerdegen
  0 siblings, 1 reply; 4+ messages in thread
From: tomas @ 2015-05-05 15:04 UTC (permalink / raw)
  To: Michael Heerdegen; +Cc: Emacs mailing list

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Tue, May 05, 2015 at 02:50:17PM +0200, Michael Heerdegen wrote:
> Hello,
> 
> I want to read newspaper articles on "www.sueddeutsche.de" with eww.
> But some articles are appearing incomplete (there is a paywall on the
> site, but the articles I mean are not behind it).
> 
> Here is an example:
> 
> http://www.sueddeutsche.de/politik/reparationszahlungen-gewolltes-missverstaendnis-1.2463623
> 
> With firefox or w3m, I get a complete article.  With eww, it is
> abbreviated after:
> 
> | Bundesrepublik gebe endlich zu, Zahlungen prüfen zu müssen. Muss sie ...
> 
> Not all articles are affected, only some.

Works for me (I know, I know...)

Never tried eww up to now (yikes!), so this was a handy chance. So don't
expect too much knowledge.

Some sites have a kind of "soft" paywall, which shows you (or whatever
their fuzzball of code thinks is "you") a limited number of articles
in full, after that they come in mutilated. The ellipsis in your quote
seems to hint at some "deliberate" action and not at an accident (the
version I got just continues there, no ellipsis).

Regards
- -- tomás
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.12 (GNU/Linux)

iEYEARECAAYFAlVI2/sACgkQBcgs9XrR2kZUuQCeLLEKDJvgNoZXaytFjMPirP2O
Dx4AnAwyznSOwQZCtbKcidGm5M8JODuE
=ZIxd
-----END PGP SIGNATURE-----



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Abbreviated newspaper articles with eww
  2015-05-05 15:04 ` tomas
@ 2015-05-05 15:52   ` Michael Heerdegen
  2015-05-05 16:13     ` tomas
  0 siblings, 1 reply; 4+ messages in thread
From: Michael Heerdegen @ 2015-05-05 15:52 UTC (permalink / raw)
  To: help-gnu-emacs

Hello Tomas,

> Some sites have a kind of "soft" paywall, which shows you (or whatever
> their fuzzball of code thinks is "you") a limited number of articles
> in full, after that they come in mutilated. The ellipsis in your quote
> seems to hint at some "deliberate" action and not at an accident (the
> version I got just continues there, no ellipsis).

I think you are right.  I got confused by the "fuzz" part.  I think they
have such a "soft" paywall.  Dunno why I thought the problem would come
from Emacs... ;-)


Many Thanks,

Michael.




^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Abbreviated newspaper articles with eww
  2015-05-05 15:52   ` Michael Heerdegen
@ 2015-05-05 16:13     ` tomas
  0 siblings, 0 replies; 4+ messages in thread
From: tomas @ 2015-05-05 16:13 UTC (permalink / raw)
  To: Michael Heerdegen; +Cc: help-gnu-emacs

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Tue, May 05, 2015 at 05:52:06PM +0200, Michael Heerdegen wrote:
> Hello Tomas,
> 
> [...] Dunno why I thought the problem would come
> from Emacs... ;-)

It *never* comes from Emacs, does it? ;-)

cheers
- -- t
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.12 (GNU/Linux)

iEYEARECAAYFAlVI7DAACgkQBcgs9XrR2kaLJgCdEvW3gAdBy5SrO/ftlvKOOivH
EuYAniUnoT5CuxDi1wo7R7bQR6s7dxDE
=3NPG
-----END PGP SIGNATURE-----



^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2015-05-05 16:13 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-05-05 12:50 Abbreviated newspaper articles with eww Michael Heerdegen
2015-05-05 15:04 ` tomas
2015-05-05 15:52   ` Michael Heerdegen
2015-05-05 16:13     ` tomas

Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.