From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.help Subject: Re: wikipedia and UTF8 coding (emacs21) Date: Tue, 22 Nov 2005 14:46:52 -0500 Organization: Bell Sympatico Message-ID: <87hda4fp0n.fsf-monnier+gnu.emacs.help@gnu.org> References: <87oe51vlv7.fsf@mat.ucm.es> <87br11y9ta.fsf-monnier+gnu.emacs.help@gnu.org> <87fyqdmuxo.fsf@mat.ucm.es> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1132690157 16248 80.91.229.2 (22 Nov 2005 20:09:17 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 22 Nov 2005 20:09:17 +0000 (UTC) Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Tue Nov 22 21:09:09 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1EeePD-0000G4-Cq for geh-help-gnu-emacs@m.gmane.org; Tue, 22 Nov 2005 21:06:19 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1EeePC-0001u9-OL for geh-help-gnu-emacs@m.gmane.org; Tue, 22 Nov 2005 15:06:18 -0500 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!headwall.stanford.edu!newsfeed.news2me.com!nx01.iad01.newshosting.com!newshosting.com!novia!nf3.bellglobal.com!nf1.bellglobal.com!nf2.bellglobal.com!news20.bellglobal.com.POSTED!not-for-mail Original-Newsgroups: gnu.emacs.help User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.0.50 (gnu/linux) Cancel-Lock: sha1:jjP5ufBrCIQnWnI4sdY/+jAiGGo= Original-Lines: 25 Original-NNTP-Posting-Host: 67.71.33.38 Original-X-Complaints-To: abuse@sympatico.ca Original-X-Trace: news20.bellglobal.com 1132688812 67.71.33.38 (Tue, 22 Nov 2005 14:46:52 EST) Original-NNTP-Posting-Date: Tue, 22 Nov 2005 14:46:52 EST Original-Xref: shelby.stanford.edu gnu.emacs.help:135659 Original-To: help-gnu-emacs@gnu.org X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:31262 Archived-At: > I presume you don't a difference in > Brüder and Brüder ^^^^^^^ This shows that the utf-8 format was not treated as such and instead Emacs read the file as if it were encoded in latin-1 or binary. > I did your C-u C-x = which tells me: > character: à (0303, 195, 0xc3) > charset: eight-bit-graphic (8-bit graphic char (0xA0..0xFF)) ^^^^^^^^^^^^^^^^^ This means "unknown char, thus treated as a byte". If you tell your Emacs to use utf-8 for this file (try C-x RET u utf-8 RET before C-x C-f, or set file-coding-system-alist so it's done automatically everytime), your char's charset should be either latin-8859-1 or mule-unicode-nnnn. All the above assumes you're not using Emacs in unibyte mode. Unibyte mode (which you sadly get if you have a lingering `special-display-european' in your .emacs) can't handle UTF-8. Stefan