unofficial mirror of help-gnu-emacs@gnu.org
 help / color / mirror / Atom feed
* When is a text file not a text file?
@ 2004-01-02 17:24 sebyte
  2004-01-09  9:43 ` Oliver Scholz
  0 siblings, 1 reply; 4+ messages in thread
From: sebyte @ 2004-01-02 17:24 UTC (permalink / raw)


Hi all,

I often use the command 'html2text <htmlfile>' in a *shell* buffer to view html 
files, and nine times out of ten the file is displayed beautifully, (with all 
the html tags removed etc).  However, the command 'html2text -o <newfilename> 
<htmlfile>', writes a file to disk which when opened in an Emacs buffer, (or any 
text editor for that matter), displays more formatting tags than actual text! 
In fact, it appears as if only some of the text is to be found buried amongst 
all the tags!  Yet returning to the *shell* buffer and issuing the command 'cat 
<newfilename>' diplays the file as it is meant to be seen once more.

No doubt there is a simple explanation for this, but damned if I know where to 
even start!  I don't believe it as an html2text issue as I have observed similar 
behaviour before with html files I've downloaded.

TIA for any explanations of what's actually going on here.

sebyte

P.S. html2text is available through Fink, (for potentially interested OS X users 
out there).

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2004-01-09 18:48 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-01-02 17:24 When is a text file not a text file? sebyte
2004-01-09  9:43 ` Oliver Scholz
2004-01-09 18:17   ` sebyte
2004-01-09 18:48     ` Oliver Scholz

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).