unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
* Problem with national characters in XHTML
@ 2005-09-28  8:29 LENNART BORGMAN
  2005-09-28 10:19 ` Jason Rumney
                   ` (5 more replies)
  0 siblings, 6 replies; 20+ messages in thread
From: LENNART BORGMAN @ 2005-09-28  8:29 UTC (permalink / raw)


I have run into a problem with swedish national characters in an XHTML document. The header of the document is like this:

  <?xml version="1.0" encoding="utf-8"?>
  <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"
            "http://www.w3.org/TR/REC-html40/loose.dtd">
  <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">

The swedish character ä looks like \344 in CVS Emacs (2005-09-23). It looks ok in Internet Explorer, but not in Firefox. Looking at the file with Notepad also shows the swedish characters as expected.

I would be glad for some hints and pointers! I am using nxml-mode if that matters here.

^ permalink raw reply	[flat|nested] 20+ messages in thread
* Re: Problem with national characters in XHTML
@ 2005-09-28 11:08 LENNART BORGMAN
  0 siblings, 0 replies; 20+ messages in thread
From: LENNART BORGMAN @ 2005-09-28 11:08 UTC (permalink / raw)


Ok, thanks for help to all that replied. I tried to learn a bit;-)

Putting iso-8859-1 in the header instead of utf-8 as Tomas Zerolo suggested solved the problem.


----- Original Message -----
From: Juanma Barranquero <lekktu@gmail.com>
Date: Wednesday, September 28, 2005 12:44 pm
Subject: Re: Problem with national characters in XHTML

> On 9/28/05, LENNART BORGMAN <lennart.borgman.073@student.lu.se> wrote:
> 
> > I have run into a problem with swedish national characters in an 
> XHTML document. The header of the document is like this:
> >
> >   <?xml version="1.0" encoding="utf-8"?>
> >   <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"
> >             "http://www.w3.org/TR/REC-html40/loose.dtd">
> >   <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">
> >
> > The swedish character ä looks like \344 in CVS Emacs (2005-09-23).
> 
> Hmm. An XHTML document with encoding="utf-8" should not have "swedish
> national characters" in it, should it? Upon reading the file, Emacs
> will set its coding system to mule-utf-8, so it's no surprise than
> high-bit, non-valid utf8 byte sequences appear as \xxx...
> 
> I've created a document with your header, and put an "É" in it with
> notepad. Emacs shows this char as \311. I would not consider this an
> error :)
> 
> --
>                    /L/e/k/t/u
>

^ permalink raw reply	[flat|nested] 20+ messages in thread

end of thread, other threads:[~2005-10-01 11:22 UTC | newest]

Thread overview: 20+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-09-28  8:29 Problem with national characters in XHTML LENNART BORGMAN
2005-09-28 10:19 ` Jason Rumney
2005-09-28 10:22 ` David Hansen
2005-09-28 10:22 ` Paul Pogonyshev
2005-09-28 10:41 ` Tomas Zerolo
2005-09-28 10:44 ` Juanma Barranquero
2005-09-29 11:11   ` Mathias Dahl
2005-09-29 13:28     ` Piet van Oostrum
2005-09-29 13:52       ` Lennart Borgman
2005-09-28 11:09 ` Kenichi Handa
2005-09-28 14:05   ` Lennart Borgman
2005-09-28 19:12     ` Lennart Borgman
2005-09-29  8:43       ` Tomas Zerolo
2005-09-29 13:34         ` Piet van Oostrum
2005-09-29 14:02           ` Lennart Borgman
2005-09-30 22:15             ` Piet van Oostrum
2005-09-30 23:02               ` Lennart Borgman
2005-10-01  4:29                 ` Tomas Zerolo
2005-10-01 11:22                 ` Piet van Oostrum
  -- strict thread matches above, loose matches on Subject: below --
2005-09-28 11:08 LENNART BORGMAN

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).