From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Uwe Brauer Newsgroups: gmane.emacs.help Subject: Re: wikipedia and UTF8 coding (emacs21) Date: Fri, 04 Nov 2005 21:22:12 +0100 Message-ID: <87irv817uz.fsf@mat.ucm.es> References: <87oe51vlv7.fsf@mat.ucm.es> Reply-To: Uwe Brauer NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1131136596 6991 80.91.229.2 (4 Nov 2005 20:36:36 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Fri, 4 Nov 2005 20:36:36 +0000 (UTC) Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Fri Nov 04 21:36:35 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1EY8D8-00007c-7O for geh-help-gnu-emacs@m.gmane.org; Fri, 04 Nov 2005 21:30:54 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1EY8D7-0006DZ-LJ for geh-help-gnu-emacs@m.gmane.org; Fri, 04 Nov 2005 15:30:53 -0500 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!headwall.stanford.edu!newsfeed.news2me.com!newsfeed.stueberl.de!newsgate.cistron.nl!news.tele.dk!news.tele.dk!small.news.tele.dk!uio.no!quimby.gnus.org!not-for-mail Original-Newsgroups: gnu.emacs.help Original-Lines: 72 Original-NNTP-Posting-Host: maportatil10.quim.ucm.es Original-X-Trace: quimby.gnus.org 1131135734 15014 147.96.7.218 (4 Nov 2005 20:22:14 GMT) Original-X-Complaints-To: usenet@quimby.gnus.org Original-NNTP-Posting-Date: Fri, 4 Nov 2005 20:22:14 +0000 (UTC) User-Agent: Gnus/5.110003 (No Gnus v0.3) XEmacs/21.4.17 (linux) X-Hashcash: 1:20:051104:gnu.emacs.help::zne3QOvbTLLoMBWE:0002lVm Cancel-Lock: sha1:iOXHx1OV+2U+a7/iqrZNaV2Ce90= Original-Xref: shelby.stanford.edu gnu.emacs.help:135244 Original-To: help-gnu-emacs@gnu.org X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:30837 Archived-At: >>>>> "Reiner" == Reiner Steib writes: Reiner> On Thu, Nov 03 2005, Peter Dyballa wrote: >> Am 03.11.2005 um 15:37 schrieb Uwe Brauer: >> >>> Has anybody experience with wikipedia article using UTF8 >>> coding. >>> >> I'm not experienced with this, but I am sure that Reiner> ^^^^^^^^^^^^^^^ ^^^^^^^^^ Reiner> Convincing. :-) Right, logic rationale and all that >> you need at least two things: >> >> 1. fonts (in X11 or such) with these glyphs Reiner> ACK >> 2. GNU Emacs 23 from CVS Reiner> I'd guess that Uwe prevailingly needs Spanish and German Reiner> non-ASCII characters. Those are sufficiently supported by Reiner> already in Emacs 21 with (add-to-list Reiner> 'file-coding-system-alist '("/wikipedia/" utf-8)) as Reiner> suggested by Stefan. With this entry every file in Reiner> .../wikipedia/ is de- and encoded with UTF-8. Honestly I don't know enough of wiki(pedia) but I am oops sure/not sure that the coding is more general. I tried to convert the document using iconv to iso8859-1 and it failed. I have seen in those documents Hebrew, Cyrillic and other codings, so that might be "true" utf-8. Uwe Reiner> For example, I wrote the following in Emacs 21 and Reiner> everything is displayed correctly: Reiner> ÄÖÜ äöü ß „Hallo Welt und Grüß Gott“ Reiner> éíúáó ÉÍÚÁÓ ñ¡¿ ¡Hola! Reiner> “Hello world” ∑ α >> GNU Emacsen 21.4 and 22 (from CVS) fail to show a particular >> glyph when it's not in the font or fontset used. I check that, but brute force emacs -font "-adobe-courier-bold-r-normal--18-180-75-75-m-110-iso10646-1" -bg grey86 \!* & works so far, so that is fine for me, right now Reiner> Maybe there's something wrong in your fontset config or your system. Could be, I will try thanks anyhow Reiner> My Emacs 21, correctly uses Reiner> -Misc-Fixed-Medium-R-SemiCondensed--13-120-75-75-C-60-ISO8859-1 for “ü” and Reiner> -Misc-Fixed-Medium-R-SemiCondensed--13-120-75-75-C-60-ISO10646-1 for “α”. PS right now I send this using xemacs no mule, hope the message does get messed up if not, I might consider any sort of mule solution in the future Uwe