From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: tomas@tuxteam.de (Tomas Zerolo) Newsgroups: gmane.emacs.devel Subject: Re: Problem with national characters in XHTML Date: Wed, 28 Sep 2005 12:41:09 +0200 Message-ID: <20050928104109.GB8332@www.trapp.net> References: <14e4cba14e7621.14e762114e4cba@net.lu.se> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0266843128==" X-Trace: sea.gmane.org 1127910615 7646 80.91.229.2 (28 Sep 2005 12:30:15 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 28 Sep 2005 12:30:15 +0000 (UTC) Cc: "emacs-devel@gnu.org" Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Sep 28 14:30:12 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1EKb3g-00013J-Nt for ged-emacs-devel@m.gmane.org; Wed, 28 Sep 2005 14:29:13 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1EKb3e-00028Q-Gq for ged-emacs-devel@m.gmane.org; Wed, 28 Sep 2005 08:29:10 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1EKZXL-00007V-6U for emacs-devel@gnu.org; Wed, 28 Sep 2005 06:51:43 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1EKZXE-000053-Nv for emacs-devel@gnu.org; Wed, 28 Sep 2005 06:51:37 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1EKZUv-0007ft-E0 for emacs-devel@gnu.org; Wed, 28 Sep 2005 06:49:13 -0400 Original-Received: from [217.22.192.104] (helo=www.elogos.de) by monty-python.gnu.org with esmtp (Exim 4.34) id 1EKZPa-0005Y2-Fx for emacs-devel@gnu.org; Wed, 28 Sep 2005 06:43:42 -0400 Original-Received: from www.elogos.de (localhost [127.0.0.1]) by www.elogos.de (Postfix) with ESMTP id D5D17DB046; Wed, 28 Sep 2005 12:41:09 +0200 (CEST) Original-Received: by www.elogos.de (Postfix, from userid 4000) id C260CDB047; Wed, 28 Sep 2005 12:41:09 +0200 (CEST) Original-To: LENNART BORGMAN In-Reply-To: <14e4cba14e7621.14e762114e4cba@net.lu.se> User-Agent: Mutt/1.5.6+20040907i X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:43323 Archived-At: --===============0266843128== Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="H1spWtNR+x+ondvy" Content-Disposition: inline --H1spWtNR+x+ondvy Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Wed, Sep 28, 2005 at 10:29:21AM +0200, LENNART BORGMAN wrote: > I have run into a problem with swedish national characters in an XHTML do= cument. The header of the document is like this: >=20 > > "http://www.w3.org/TR/REC-html40/loose.dtd"> > Hm. Note that the header says of itself that it's encoded in utf-8. I don't know whether it's relevant. > The swedish character =E4 looks like \344 in CVS Emacs (2005-09-23). If Emacs honors the header above, then this won't work: Octal 344 is an a-with-dieresis, but in iso 8859-1 encoding, not utf-8. > It looks ok in Internet Explorer, but not in Firefox. I'd say Firefox is right on this one ;-) Seriously: you can force the browser to assume an encoding, so what the browser shows depends on settings which may vary from time to time. On Firefox, it's under View -> Character Encoding. No idea about IE (and I'm glad not to know ;-). > Looking at the > file with Notepad also shows the swedish characters as expected. Notepad uses whatever encoding its font has; i guess an 8-bit fixed encoding. > I would be glad for some hints and pointers! I am using nxml-mode if > that matters here. You may try two things: changing the utf-8 in the header to iso-8859-1 or (better) insert your a-dieresis as an utf8-encoded char. Regards -- tom=E1s --H1spWtNR+x+ondvy Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.5 (GNU/Linux) iD8DBQFDOnNFBcgs9XrR2kYRAj7zAJ96HrQpTqYafdwTbTGjLsznhbzLogCfYpjO 2g1uq67k912Fby51F5+mgkQ= =rPLl -----END PGP SIGNATURE----- --H1spWtNR+x+ondvy-- --===============0266843128== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Emacs-devel mailing list Emacs-devel@gnu.org http://lists.gnu.org/mailman/listinfo/emacs-devel --===============0266843128==--