From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Reiner Steib <4.uce.03.r.s@nurfuerspam.de> Newsgroups: gmane.emacs.help Subject: Re: Turning HTML character references into something readable? Date: Mon, 28 Apr 2003 15:02:48 +0200 Organization: Dept. of Theoretical Physics, University of Ulm Sender: help-gnu-emacs-bounces+gnu-help-gnu-emacs=m.gmane.org@gnu.org Message-ID: References: Reply-To: reiner.steib@gmx.de NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1051535226 22868 80.91.224.249 (28 Apr 2003 13:07:06 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 28 Apr 2003 13:07:06 +0000 (UTC) Original-X-From: help-gnu-emacs-bounces+gnu-help-gnu-emacs=m.gmane.org@gnu.org Mon Apr 28 15:07:01 2003 Return-path: Original-Received: from monty-python.gnu.org ([199.232.76.173]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 19A8LV-0005wE-00 for ; Mon, 28 Apr 2003 15:07:01 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 19A8K2-0008Jz-02 for gnu-help-gnu-emacs@m.gmane.org; Mon, 28 Apr 2003 09:05:30 -0400 Original-Newsgroups: gnu.emacs.help In-Reply-To: (Karl Eichwalder's message of "Sun, 27 Apr 2003 21:09:39 +0200") X-Face: /U7=m^"/-Dn61mAl{g9e3>\G5Tp,oEX|V)g2I1hBk\ML; )7A?6cmB-y7y?'NA^J<=oz7syB =(McAwIHgLX!.B?R3X}98d@?>CrT094KLWh]WU4gDpnL/")MS(XoQTv`Oq225uL>+; CpPXo$N5e>N> $tPd-gbB^F{gQS#1ase]XO~D4p4M"3+F-7~u]dy3I?Pb8RO*H-EFeWDUf?Rf,d]pv\Jvh2Cht!A=im yKAS2Z%Ao^;}W/qzMvMm Mail-Copies-To: nobody User-Agent: Gnus/5.09002 (Oort Gnus v0.20) Emacs/21.3 (gnu/linux) Cancel-Lock: sha1:yhPFWEUE+5hEi2zOzyEXhUVAMf0= Original-NNTP-Posting-Host: theotp5.physik.uni-ulm.de Original-X-Trace: news.uni-ulm.de 1051535052 134.60.10.145 (28 Apr 2003 15:04:12 +0200) Original-Lines: 54 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!news-spur1.maxwell.syr.edu!news.maxwell.syr.edu!newsfeed00.sul.t-online.de!newsfeed01.sul.t-online.de!t-online.de!news.belwue.de!news.uni-ulm.de!theotp5.physik.uni-ulm.de!not-for-mail Original-Xref: shelby.stanford.edu gnu.emacs.help:112449 Original-To: help-gnu-emacs@gnu.org X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1b5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Help: List-Post: List-Subscribe: , List-Archive: List-Unsubscribe: , Errors-To: help-gnu-emacs-bounces+gnu-help-gnu-emacs=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.help:8947 X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:8947 On Sun, Apr 27 2003, Karl Eichwalder wrote: > Benjamin Riefenstahl writes: > >> Actually that literal seems to be in some JIS encoding on my side, Same here: ,----[ `C-u C-x =' ] | character: [ removed "mirrored `R'" ] (0151701, 54209, 0xd3c1) | charset: japanese-jisx0208 (JISX0208.1983/1990 Japanese Kanji: ISO-IR-87) | code point: 39 65 | syntax: word | category: Y:Cyrillic characters of 2-byte character sets j:Japanese | |:While filling, we can break a line at this character. | buffer code: 0x92 0xA7 0xC1 | file code: ESC 24 42 27 41 (encoded by coding system iso-2022-jp-2) | font: -Misc-Fixed-Medium-R-Normal--14-130-75-75-C-140-JISX0208.1983-0 `---- What does `C-u C-x =' say on that character before sending? >> while Я indicates Unicode. > > Gnus decided to turn it into JIS; initially it was Unicode/UTF-8. I don't think that Gnus is able to convert UTF-8 to JIS. Running `find-coding-systems-region' in your message shows that Emacs 21.3 doesn't list any UTF coding-system. This is basically what Gnus does in the function `mm-find-mime-charset-region' in `mm-util.el'. >> (char-to-string (decode-char 'ucs 1071)) When I insert this char into the buffer... (insert (char-to-string (decode-char 'ucs 1071))); Я ... and use... (setq mm-coding-system-priorities nil) ;; default ... I get iso-8859-5. With my setting of... (setq mm-coding-system-priorities '(iso-latin-1 iso-latin-9 mule-utf-8)) ... I get utf-8. Bye, Reiner. -- ,,, (o o) ---ooO-(_)-Ooo--- PGP key available via WWW http://rsteib.home.pages.de/