From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.help Subject: Re: wikipedia and UTF8 coding (emacs21) Date: Tue, 22 Nov 2005 14:40:26 -0500 Organization: Bell Sympatico Message-ID: <87mzjwfp92.fsf-monnier+gnu.emacs.help@gnu.org> References: <87oe51vlv7.fsf@mat.ucm.es> <87irv817uz.fsf@mat.ucm.es> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1132689893 15262 80.91.229.2 (22 Nov 2005 20:04:53 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 22 Nov 2005 20:04:53 +0000 (UTC) Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Tue Nov 22 21:04:51 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1EeeM9-0006BX-Qr for geh-help-gnu-emacs@m.gmane.org; Tue, 22 Nov 2005 21:03:10 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1EeeM9-00007m-18 for geh-help-gnu-emacs@m.gmane.org; Tue, 22 Nov 2005 15:03:09 -0500 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!postnews.google.com!news4.google.com!border1.nntp.dca.giganews.com!nntp.giganews.com!wns14feed!worldnet.att.net!207.35.177.252!nf3.bellglobal.com!nf1.bellglobal.com!nf2.bellglobal.com!news20.bellglobal.com.POSTED!not-for-mail Original-Newsgroups: gnu.emacs.help User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.0.50 (gnu/linux) Cancel-Lock: sha1:2Yx6pLCcFZfd706/sUuGIdEsKNc= Original-Lines: 21 Original-NNTP-Posting-Host: 67.71.33.38 Original-X-Complaints-To: abuse@sympatico.ca Original-X-Trace: news20.bellglobal.com 1132688426 67.71.33.38 (Tue, 22 Nov 2005 14:40:26 EST) Original-NNTP-Posting-Date: Tue, 22 Nov 2005 14:40:26 EST Original-Xref: shelby.stanford.edu gnu.emacs.help:135658 Original-To: help-gnu-emacs@gnu.org X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:31261 Archived-At: Reiner> I'd guess that Uwe prevailingly needs Spanish and German Reiner> non-ASCII characters. Those are sufficiently supported by Reiner> already in Emacs 21 with (add-to-list Reiner> 'file-coding-system-alist '("/wikipedia/" utf-8)) as Reiner> suggested by Stefan. With this entry every file in Reiner> .../wikipedia/ is de- and encoded with UTF-8. > Honestly I don't know enough of wiki(pedia) but I am oops sure/not > sure that the coding is more general. I tried to convert the document > using iconv to iso8859-1 and it failed. I have seen in those documents > Hebrew, Cyrillic and other codings, so that might be "true" utf-8. Emacs-21.4's handling of UTF-8 takes care of significantly more than just latin-1. What Reiner was saying is that you probably only really care about the latin-1 chars (which will assuredly be displayed properly) and don't mind if Emacs doesn't properly display some of the other characters, as long as it does preserve them across load&save (it will display them as an empty box if your unicode font doesn't have them). Stefan