unofficial mirror of help-gnu-emacs@gnu.org
 help / color / mirror / Atom feed
* How to convert .doc to txt
@ 2003-10-30 18:02 Don Saklad
  2003-10-30 18:28 ` roodwriter
                   ` (2 more replies)
  0 siblings, 3 replies; 7+ messages in thread
From: Don Saklad @ 2003-10-30 18:02 UTC (permalink / raw)


In rmail in emacs, how do you convert .doc to txt ?...

It converted to pdf
using the free thing at
http://preprints.cern.ch/Convert?emailGuide
but that new pdf wouldn't convert to ascii plain text.


Nor did that new pdf convert to ascii plain text using the free thing at
http://www.adobe.com/products/acrobat/access_onlinetools.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: How to convert .doc to txt
  2003-10-30 18:02 How to convert .doc to txt Don Saklad
@ 2003-10-30 18:28 ` roodwriter
  2003-11-11 14:14 ` Alex Ott
  2003-11-11 17:00 ` Burton Samograd
  2 siblings, 0 replies; 7+ messages in thread
From: roodwriter @ 2003-10-30 18:28 UTC (permalink / raw)


Don Saklad wrote:

> In rmail in emacs, how do you convert .doc to txt ?...
> 
> It converted to pdf
> using the free thing at
> http://preprints.cern.ch/Convert?emailGuide
> but that new pdf wouldn't convert to ascii plain text.
> 
> 
> Nor did that new pdf convert to ascii plain text using the free thing at
> http://www.adobe.com/products/acrobat/access_onlinetools.html


I should start this out with the caveat that for some reason I can't receive 
e-mail on Emacs. But for what it's worth.

Emacs doesn't have a built-in function to do this, sadly. For converting 
.docs to text I use antiword. (I have a Linux box.) I save the .doc from my 
regular e-mail program, go into Dired, and use the ! command to use 
antiword. This translates the .doc into text in its own window. I then copy 
it and paste it into another buffer.

I don't do this by hand anymore. I made a macro to do all this and bound a 
shortcut key sequence to it.

Linux has several other ways to convert .docs to other things. Word2x 
(uncapitalized) converts Word 6 files to LaTeX or text. wvWare (that's how 
it's capitalized) converts to HTML. 

There are more.

Be aware that none are perfect and there are some variations in the text 
output. These are the ones I thought worked best. Best thing to do is 
experiment.

I do what I can do avoid Word files. Whenever someone sends me one I 
politely point out that not everyone has Word on their machine or will go 
out and buy it just to read a few lines of text. I also note that plain 
text, which is just a simple cutting and pasting job, can be read by 
everyone at no risk. Many people avoid attachments because that's where the 
viruses live.

It can also be easier for the sender because with plain text they're not 
tempted to prettify what they're writing. If you need pretty there are PDFs 
and HTMLs.

I've found that if you're nice and reasonable about it, most people will 
comply. They had just never thought about it. I've even had a few thank me. 
Only one has refused outright not to do it. Says she doesn't have the time. 
I can't make her understand it probably takes less time since she won't be 
wasting it on formatting.

So about twice a month I get a .doc file from her. The others I get are from 
people sending me something for the first time or from people that just 
forget.

The Free Software Foundation has a suggested text you can send back, but I 
think it's too strong. Worth reading for ideas, though.

Hope this helps.

--Rod

-- 
Author of "Linux for Non-Geeks--Clear-eyed Answered for Practical Consumers" 
and "Boring Stories from Uncle Rod." Both are available at 
http://www.rodwriterpublishing.com/index.html

To reply by e-mail, take the extra "o" out of the name.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: How to convert .doc to txt
  2003-10-30 18:02 How to convert .doc to txt Don Saklad
  2003-10-30 18:28 ` roodwriter
@ 2003-11-11 14:14 ` Alex Ott
  2003-11-11 15:10   ` Bruce Ingalls
  2003-11-11 17:00 ` Burton Samograd
  2 siblings, 1 reply; 7+ messages in thread
From: Alex Ott @ 2003-11-11 14:14 UTC (permalink / raw)


>>>>> "DS" == Don Saklad writes:
 DS> In rmail in emacs, how do you convert .doc to txt ?...

You can use catdoc programm as converter .doc, .xls, .rtf to txt

-- 
With best wishes, Alex Ott	http://xtalk.msk.su/~ott/

The solution to a problem changes the nature of the problem.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: How to convert .doc to txt
  2003-11-11 14:14 ` Alex Ott
@ 2003-11-11 15:10   ` Bruce Ingalls
  2003-11-11 17:27     ` Thien-Thi Nguyen
  0 siblings, 1 reply; 7+ messages in thread
From: Bruce Ingalls @ 2003-11-11 15:10 UTC (permalink / raw)


Alex Ott wrote:
>>>>>>"DS" == Don Saklad writes:
> 
>  DS> In rmail in emacs, how do you convert .doc to txt ?...
> 
> You can use catdoc programm as converter .doc, .xls, .rtf to txt

In the contrib section of the http://EMacro.sf.net project, there is
elisp code to call catdoc.

There was such little interest in importing Word documents, that I
pulled out the code. Frankly, you'll get better results by launching 
Abiword or Open Office, then exporting, unless you run console mode,
only, on your incompatible box.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: How to convert .doc to txt
  2003-10-30 18:02 How to convert .doc to txt Don Saklad
  2003-10-30 18:28 ` roodwriter
  2003-11-11 14:14 ` Alex Ott
@ 2003-11-11 17:00 ` Burton Samograd
  2 siblings, 0 replies; 7+ messages in thread
From: Burton Samograd @ 2003-11-11 17:00 UTC (permalink / raw)


Don Saklad <dsaklad@nestle.ai.mit.edu> writes:

> In rmail in emacs, how do you convert .doc to txt ?...

Antiword gives good results:

http://www.winfield.demon.nl/

-- 
burton samograd
http://kruhftwerk.dyndns.org

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: How to convert .doc to txt
  2003-11-11 15:10   ` Bruce Ingalls
@ 2003-11-11 17:27     ` Thien-Thi Nguyen
  2003-11-11 19:27       ` Dan Anderson
  0 siblings, 1 reply; 7+ messages in thread
From: Thien-Thi Nguyen @ 2003-11-11 17:27 UTC (permalink / raw)


Bruce Ingalls <bingalls@nj.rr.BLOCK.spam.com> writes:

> launching Abiword
> or Open Office, then exporting, unless you run console mode

aren't those packages supposed to be scriptable?
anything scriptable can be controlled by emacs
(presuming the scriptability is rich enough).

thi

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: How to convert .doc to txt
  2003-11-11 17:27     ` Thien-Thi Nguyen
@ 2003-11-11 19:27       ` Dan Anderson
  0 siblings, 0 replies; 7+ messages in thread
From: Dan Anderson @ 2003-11-11 19:27 UTC (permalink / raw)
  Cc: help-gnu-emacs

On Tue, 2003-11-11 at 12:27, Thien-Thi Nguyen wrote:
> Bruce Ingalls <bingalls@nj.rr.BLOCK.spam.com> writes:
> 
> > launching Abiword
> > or Open Office, then exporting, unless you run console mode
> 
> aren't those packages supposed to be scriptable?
> anything scriptable can be controlled by emacs
> (presuming the scriptability is rich enough).

If you knew what you were doing you could crack open the source, make
some modifications and recompile openoffice to take command line options
like:

oofficce --convert --from ./foo.doc --to ./foo.txt

Of course, FAIK, it's already included in the code.  Check out the
documentation for OpenOffice for more details.

-Dan

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2003-11-11 19:27 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-10-30 18:02 How to convert .doc to txt Don Saklad
2003-10-30 18:28 ` roodwriter
2003-11-11 14:14 ` Alex Ott
2003-11-11 15:10   ` Bruce Ingalls
2003-11-11 17:27     ` Thien-Thi Nguyen
2003-11-11 19:27       ` Dan Anderson
2003-11-11 17:00 ` Burton Samograd

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).