From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Piet van Oostrum Newsgroups: gmane.emacs.devel Subject: Re: Problem with national characters in XHTML Date: Thu, 29 Sep 2005 15:34:05 +0200 Message-ID: References: <14e4cba14e7621.14e762114e4cba@net.lu.se> <433AA30F.8050203@student.lu.se> <433AEB2D.7070906@student.lu.se> <20050929084322.GA16219@www.trapp.net> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1128001639 15205 80.91.229.2 (29 Sep 2005 13:47:19 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Thu, 29 Sep 2005 13:47:19 +0000 (UTC) Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Sep 29 15:47:11 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1EKyjn-0008Gn-BS for ged-emacs-devel@m.gmane.org; Thu, 29 Sep 2005 15:46:15 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1EKyjm-0008Mx-Mc for ged-emacs-devel@m.gmane.org; Thu, 29 Sep 2005 09:46:14 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1EKycR-0005kV-Kd for emacs-devel@gnu.org; Thu, 29 Sep 2005 09:38:39 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1EKycM-0005ix-Qh for emacs-devel@gnu.org; Thu, 29 Sep 2005 09:38:38 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1EKycM-0005fO-HF for emacs-devel@gnu.org; Thu, 29 Sep 2005 09:38:34 -0400 Original-Received: from [131.211.80.10] (helo=mail.cs.uu.nl) by monty-python.gnu.org with esmtp (Exim 4.34) id 1EKyY4-0008KU-7a for emacs-devel@gnu.org; Thu, 29 Sep 2005 09:34:08 -0400 Original-Received: from mail.cs.uu.nl (localhost.localdomain [127.0.0.1]) by mail.cs.uu.nl (Postfix) with ESMTP id 56B18A35F1 for ; Thu, 29 Sep 2005 15:34:07 +0200 (CEST) Original-Received: from ohana.local (ohana.cs.uu.nl [131.211.81.32]) by mail.cs.uu.nl (Postfix) with ESMTP id 3969DA35EE for ; Thu, 29 Sep 2005 15:34:07 +0200 (CEST) X-Mailer: emacs 22.0.50.3 (via feedmail 8 I) Original-To: emacs-devel@gnu.org In-Reply-To: <20050929084322.GA16219@www.trapp.net> (Tomas Zerolo's message of "Thu, 29 Sep 2005 10:43:22 +0200") User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.0.50 (darwin) X-AV-Checked: ClamAV using ClamSMTP at cs.uu.nl X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:43355 Archived-At: >>>>> tomas@tuxteam.de (Tomas Zerolo) (TZ) wrote: >TZ> Ah. You have to distinguish between Emacs's internal representation >TZ> (that's possibly the 2276 you mention), which doesn't change (al least >TZ> unless you try hard ;) and what is in the file (how Emacs writes or >TZ> interprets what it reads). You can change those things changing the >TZ> coding system (look for something like `multilingual environment'). By default Emacs uses different internal representations for the "same" character in different coding systems. So a iso-8859-1 "=C3=A4" is a differ= ent thing than a utf-8 "=C3=A4". This difference will disappear when Emacs swit= ches to Unicode internally. For the time being the OP could use Unicode unification, if his Emacs version is young enough. I have used this for some years now without any problems. Maybe it solves the original problem. (require 'ucs-tables) (unify-8859-on-encoding-mode 1) (unify-8859-on-decoding-mode 1) --=20 Piet van Oostrum URL: http://www.cs.uu.nl/~piet [PGP 8DAE142BE17999C4] Private email: piet@vanoostrum.org