unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
* html2text
@ 2004-10-31 18:25 Alfred M. Szmidt
  2004-11-01 11:55 ` html2text Reiner Steib
       [not found] ` <mzxv9dyj.fsf@blue.sea.net>
  0 siblings, 2 replies; 11+ messages in thread
From: Alfred M. Szmidt @ 2004-10-31 18:25 UTC (permalink / raw)


html2text is quite nice, but it doesn't strip all HTML files into
something that is readable.  The following patch makes it strip some
"newer" tags that have croped up.  Though, it still doesn't make
things as nice as they could be, tables and comments are still left
intact.

I guess that a better way to do this is to convert all known tags to
something nice, and then just strip all remaining tags that are left.

diff -ur html2text.el html2text.el.new
--- html2text.el	2004-10-31 19:23:06.000000000 +0100
+++ html2text.el.new	2004-10-31 19:23:46.000000000 +0100
@@ -75,8 +75,10 @@
 
 (defvar html2text-format-tag-list
   '(("b" 	  . html2text-clean-bold)
+    ("strong"     . html2text-clean-bold)
     ("u" 	  . html2text-clean-underline)
     ("i" 	  . html2text-clean-italic)
+    ("em"         . html2text-clean-italic)
     ("blockquote" . html2text-clean-blockquote)
     ("a"          . html2text-clean-anchor)
     ("ul"         . html2text-clean-ul)

Diff finished.  Sun Oct 31 19:23:56 2004

^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2004-11-15  8:31 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-10-31 18:25 html2text Alfred M. Szmidt
2004-11-01 11:55 ` html2text Reiner Steib
2004-11-01 19:21   ` html2text Alfred M. Szmidt
2004-11-02  4:46   ` html2text Katsumi Yamaoka
2004-11-02  9:22     ` html2text Reiner Steib
2004-11-02 11:59       ` html2text Katsumi Yamaoka
2004-11-02 14:12         ` html2text Reiner Steib
     [not found] ` <mzxv9dyj.fsf@blue.sea.net>
2004-11-08 15:51   ` html2text Reiner Steib
2004-11-08 18:02     ` html2text David Kastrup
2004-11-09 22:44     ` html2text Reiner Steib
2004-11-15  8:31     ` html2text Jari Aalto

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).