unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
From: Reiner Steib <reinersteib+gmane@imap.cc>
Cc: Emacs development <emacs-devel@gnu.org>
Subject: Re: html2text
Date: Mon, 08 Nov 2004 16:51:34 +0100	[thread overview]
Message-ID: <v9bre8qqyh.fsf@marauder.physik.uni-ulm.de> (raw)
In-Reply-To: <mzxv9dyj.fsf@blue.sea.net> (Jari Aalto's message of "Sat, 06 Nov 2004 17:47:32 +0200")

On Sat, Nov 06 2004, Jari Aalto+mail.emacs wrote:

> This is your copy. Article has been posted to the newsgroup(s).

I didn't see your message on emacs-devel, see
<URL:http://thread.gmane.org/1099247139.071920.12084.nullmailer@Update.UU.SE>.

> * Sun 2004-10-31 Alfred Szmidt <ams AT kemisten.nu> gmane.emacs.devel
> * Message-Id: 1099247139.071920.12084.nullmailer AT Update.UU.SE
> | html2text is quite nice, but it doesn't strip all HTML files into
> | something that is readable.  The following patch makes it strip some
> | "newer" tags that have croped up.
>
> There is more entities. This patch is against the Gnus CVS, but I
> assume it will work for Emacs as well. The entities are in
> alphabetical order.
>
> 2004-11-06 Sat  Jari Aalto  <jari dot aalto A T cante dot net>
>
>         * text2html (html2text-replace-list). Added more HTML 4.0
>         entities.

It seems you have signed papers for Emacs as you are listed in the
AUTHORS file.  But I can't check it myself.  Could you please confirm?

[ The suggested patch from Jari's original message was: ]

--8<---------------cut here---------------start------------->8---
--- html2text.el.7.10	2004-11-06 17:20:46.000000000 +0200
+++ html2text.el	2004-11-06 17:41:12.000000000 +0200
@@ -42,8 +42,42 @@
 (defvar html2text-format-single-element-list '(("hr" . html2text-clean-hr)))

 (defvar html2text-replace-list
-  '(("&nbsp;" . " ") ("&gt;" . ">") ("&lt;" . "<") ("&quot;" . "\"")
-    ("&amp;" . "&") ("&apos;" . "'"))
+  '(("&acute;" . "`")
+    ("&amp;" . "&")
+    ("&apos;" . "'")
+    ("&brvbar;" . "|")
+    ("&cent;" . "c")
+    ("&circ;" . "^")
+    ("&copy;" . "(C)")
+    ("&curren;" . "¤")
+    ("&deg;" . "degree")
+    ("&divide;" . "/")
+    ("&euro;" . "e")
+    ("&frac12;" . "½")
+    ("&gt;" . ">")
+    ("&iquest;" . "?")
+    ("&laquo;" . "<<")
+    ("&ldquo" . "\"")
+    ("&lsaquo;" . "(")
+    ("&lsquo;" . "`")
+    ("&lt;" . "<")
+    ("&mdash;" . "--")
+    ("&nbsp;" . " ")
+    ("&ndash;" . "-")
+    ("&permil;" . "%%")
+    ("&plusmn;" . "+-")
+    ("&pound;" . "£")
+    ("&quot;" . "\"")
+    ("&raquo;" . ">>")
+    ("&rdquo" . "\"")
+    ("&reg;" . "(R)")
+    ("&rsaquo;" . ")")
+    ("&rsquo;" . "'")
+    ("&sect;" . "§")
+    ("&sup1;" . "^1")
+    ("&sup2;" . "^2")
+    ("&sup3;" . "^3")
+    ("&tilde;" . "~"))
   "The map of entity to text.
--8<---------------cut here---------------end--------------->8---

Bye, Reiner.
-- 
       ,,,
      (o o)
---ooO-(_)-Ooo---  |  PGP key available  |  http://rsteib.home.pages.de/

  parent reply	other threads:[~2004-11-08 15:51 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-10-31 18:25 html2text Alfred M. Szmidt
2004-11-01 11:55 ` html2text Reiner Steib
2004-11-01 19:21   ` html2text Alfred M. Szmidt
2004-11-02  4:46   ` html2text Katsumi Yamaoka
2004-11-02  9:22     ` html2text Reiner Steib
2004-11-02 11:59       ` html2text Katsumi Yamaoka
2004-11-02 14:12         ` html2text Reiner Steib
     [not found] ` <mzxv9dyj.fsf@blue.sea.net>
2004-11-08 15:51   ` Reiner Steib [this message]
2004-11-08 18:02     ` html2text David Kastrup
2004-11-09 22:44     ` html2text Reiner Steib
2004-11-15  8:31     ` html2text Jari Aalto

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=v9bre8qqyh.fsf@marauder.physik.uni-ulm.de \
    --to=reinersteib+gmane@imap.cc \
    --cc=emacs-devel@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).