* Clipboard madness
@ 2009-12-18 3:41 Lennart Borgman
2009-12-18 3:58 ` Miles Bader
` (2 more replies)
0 siblings, 3 replies; 7+ messages in thread
From: Lennart Borgman @ 2009-12-18 3:41 UTC (permalink / raw)
To: Emacs-Devel devel
I am trying to send copy the summary text from an article in
PDF-format which I am reading with Acrobat Reader. It does not work
well at all in this case. For example ">" becomes "4". Some spaces
disappears.
I do not know what Emacs does when copying from clip board. However
buffer-file-coding-system is a variable defined in `C source code'.
Its value is utf-8-dos
Local in buffer kisely-sum.txt; global value is utf-8-dos
The same thing seems to happen with the default iso-latin-1-dos.
I am doing this on w32 with recent builds.
The PDF I am reading is the full-text PDF from The British Journal of
Psychiatry 2007 v. 191, p. 373-374 (Kisely and Campbell). It is
available on this page:
http://bjp.rcpsych.org/content/vol191/issue5/
I am just copying the text in "Summary", after that word (from
"Supervised" to the end of the paragraph).
Can anyone please shed some light on this?
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Clipboard madness
2009-12-18 3:41 Clipboard madness Lennart Borgman
@ 2009-12-18 3:58 ` Miles Bader
2009-12-18 4:00 ` Lennart Borgman
2009-12-18 4:22 ` Kenichi Handa
2009-12-18 8:57 ` tomas
2 siblings, 1 reply; 7+ messages in thread
From: Miles Bader @ 2009-12-18 3:58 UTC (permalink / raw)
To: Lennart Borgman; +Cc: Emacs-Devel devel
Lennart Borgman <lennart.borgman@gmail.com> writes:
> I am trying to send copy the summary text from an article in
> PDF-format which I am reading with Acrobat Reader. It does not work
> well at all in this case. For example ">" becomes "4". Some spaces
...
> http://bjp.rcpsych.org/content/vol191/issue5/
>
> I am just copying the text in "Summary", after that word (from
> "Supervised" to the end of the paragraph).
>
> Can anyone please shed some light on this?
I don't think it's an Emacs (or Windows) problem -- I same behavior when
pasting that text into e.g. gnome-terminal, on Debian.
-Miles
--
Guilt, n. The condition of one who is known to have committed an indiscretion,
as distinguished from the state of him who has covered his tracks.
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Clipboard madness
2009-12-18 3:58 ` Miles Bader
@ 2009-12-18 4:00 ` Lennart Borgman
2009-12-18 4:06 ` Miles Bader
0 siblings, 1 reply; 7+ messages in thread
From: Lennart Borgman @ 2009-12-18 4:00 UTC (permalink / raw)
To: Miles Bader; +Cc: Emacs-Devel devel
On Fri, Dec 18, 2009 at 4:58 AM, Miles Bader <miles@gnu.org> wrote:
> Lennart Borgman <lennart.borgman@gmail.com> writes:
>> I am trying to send copy the summary text from an article in
>> PDF-format which I am reading with Acrobat Reader. It does not work
>> well at all in this case. For example ">" becomes "4". Some spaces
> ...
>> http://bjp.rcpsych.org/content/vol191/issue5/
>>
>> I am just copying the text in "Summary", after that word (from
>> "Supervised" to the end of the paragraph).
>>
>> Can anyone please shed some light on this?
>
> I don't think it's an Emacs (or Windows) problem -- I same behavior when
> pasting that text into e.g. gnome-terminal, on Debian.
Thanks for testing. Are you using Acrobat Reader?
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Clipboard madness
2009-12-18 4:00 ` Lennart Borgman
@ 2009-12-18 4:06 ` Miles Bader
2009-12-19 0:30 ` Lennart Borgman
0 siblings, 1 reply; 7+ messages in thread
From: Miles Bader @ 2009-12-18 4:06 UTC (permalink / raw)
To: Lennart Borgman; +Cc: Emacs-Devel devel
Lennart Borgman <lennart.borgman@gmail.com> writes:
>>> Can anyone please shed some light on this?
>>
>> I don't think it's an Emacs (or Windows) problem -- I same behavior when
>> pasting that text into e.g. gnome-terminal, on Debian.
>
> Thanks for testing. Are you using Acrobat Reader?
Nope (I used Evince, which is a Gtk app that uses poppler for PDF
stuff), so not an Acrobat bug either.
I'm wondering if it's just a wonky PDF file...
-Miles
--
Erudition, n. Dust shaken out of a book into an empty skull.
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Clipboard madness
2009-12-18 3:41 Clipboard madness Lennart Borgman
2009-12-18 3:58 ` Miles Bader
@ 2009-12-18 4:22 ` Kenichi Handa
2009-12-18 8:57 ` tomas
2 siblings, 0 replies; 7+ messages in thread
From: Kenichi Handa @ 2009-12-18 4:22 UTC (permalink / raw)
To: Lennart Borgman; +Cc: emacs-devel
In article <e01d8a50912171941h68dacd88g740c45786c6ffaae@mail.gmail.com>, Lennart Borgman <lennart.borgman@gmail.com> writes:
> I am trying to send copy the summary text from an article in
> PDF-format which I am reading with Acrobat Reader. It does not work
> well at all in this case. For example ">" becomes "4". Some spaces
> disappears.
[...]
> The PDF I am reading is the full-text PDF from The British Journal of
> Psychiatry 2007 v. 191, p. 373-374 (Kisely and Campbell). It is
> available on this page:
> http://bjp.rcpsych.org/content/vol191/issue5/
> I am just copying the text in "Summary", after that word (from
> "Supervised" to the end of the paragraph).
I see that problem on GNU/Linux Ubunto too, not only with
Emacs but also with the other applications; gnome-terminal,
gedit, etc. Even more, I see the problem when I paste it to
the search box of acroread itself.
So, I think there's no way to fix it on Emacs side.
---
Kenichi Handa
handa@m17n.org
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Clipboard madness
2009-12-18 3:41 Clipboard madness Lennart Borgman
2009-12-18 3:58 ` Miles Bader
2009-12-18 4:22 ` Kenichi Handa
@ 2009-12-18 8:57 ` tomas
2 siblings, 0 replies; 7+ messages in thread
From: tomas @ 2009-12-18 8:57 UTC (permalink / raw)
To: Lennart Borgman; +Cc: Emacs-Devel devel
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On Fri, Dec 18, 2009 at 04:41:44AM +0100, Lennart Borgman wrote:
> I am trying to send copy the summary text from an article in
> PDF-format which I am reading with Acrobat Reader. It does not work
> well at all in this case. For example ">" becomes "4". Some spaces
> disappears.
>
> I do not know what Emacs does when copying from clip board. However
[...]
> http://bjp.rcpsych.org/content/vol191/issue5/
>
> I am just copying the text in "Summary", after that word (from
> "Supervised" to the end of the paragraph).
>
>
> Can anyone please shed some light on this?
Running pdftotext over it shows heavily mangled text (things are
misplaced, etc). I get the impression that text was "deduced" from the
laayout after the fact, as if someone running OCR over a print.
Surely some distiller abomination :-(
Regards
- -- tomás
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
iD8DBQFLK0P7Bcgs9XrR2kYRAlV0AJ9FnQe0/N8SjlNP3qnMHIJEJLPzVQCfbzUI
jaDPHNPpP+QsR/h6Z1k+cMc=
=7uii
-----END PGP SIGNATURE-----
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Clipboard madness
2009-12-18 4:06 ` Miles Bader
@ 2009-12-19 0:30 ` Lennart Borgman
0 siblings, 0 replies; 7+ messages in thread
From: Lennart Borgman @ 2009-12-19 0:30 UTC (permalink / raw)
To: Miles Bader; +Cc: Emacs-Devel devel
On Fri, Dec 18, 2009 at 5:06 AM, Miles Bader <miles@gnu.org> wrote:
> Lennart Borgman <lennart.borgman@gmail.com> writes:
>>>> Can anyone please shed some light on this?
>>>
>>> I don't think it's an Emacs (or Windows) problem -- I same behavior when
>>> pasting that text into e.g. gnome-terminal, on Debian.
>>
>> Thanks for testing. Are you using Acrobat Reader?
>
> Nope (I used Evince, which is a Gtk app that uses poppler for PDF
> stuff), so not an Acrobat bug either.
>
> I'm wondering if it's just a wonky PDF file...
Yes, it looks like that. I asked here and got some advice on how it is
possible to further look into such problems:
http://forums.adobe.com/message/2468131#2468131
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2009-12-19 0:30 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-12-18 3:41 Clipboard madness Lennart Borgman
2009-12-18 3:58 ` Miles Bader
2009-12-18 4:00 ` Lennart Borgman
2009-12-18 4:06 ` Miles Bader
2009-12-19 0:30 ` Lennart Borgman
2009-12-18 4:22 ` Kenichi Handa
2009-12-18 8:57 ` tomas
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/emacs.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).