From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Juanma Barranquero Newsgroups: gmane.emacs.devel Subject: Re: Problem with national characters in XHTML Date: Wed, 28 Sep 2005 12:44:59 +0200 Message-ID: References: <14e4cba14e7621.14e762114e4cba@net.lu.se> Reply-To: Juanma Barranquero NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1127910246 6495 80.91.229.2 (28 Sep 2005 12:24:06 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 28 Sep 2005 12:24:06 +0000 (UTC) Cc: "emacs-devel@gnu.org" Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Sep 28 14:24:03 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1EKaxw-0007io-Up for ged-emacs-devel@m.gmane.org; Wed, 28 Sep 2005 14:23:17 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1EKaxw-0001CT-2B for ged-emacs-devel@m.gmane.org; Wed, 28 Sep 2005 08:23:16 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1EKZUY-0007gy-2G for emacs-devel@gnu.org; Wed, 28 Sep 2005 06:48:50 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1EKZUW-0007g2-4z for emacs-devel@gnu.org; Wed, 28 Sep 2005 06:48:48 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1EKZUT-0007ft-BT for emacs-devel@gnu.org; Wed, 28 Sep 2005 06:48:47 -0400 Original-Received: from [64.233.182.193] (helo=nproxy.gmail.com) by monty-python.gnu.org with esmtp (Exim 4.34) id 1EKZQr-0005kF-H2 for emacs-devel@gnu.org; Wed, 28 Sep 2005 06:45:01 -0400 Original-Received: by nproxy.gmail.com with SMTP id x37so506334nfc for ; Wed, 28 Sep 2005 03:44:59 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=uD9MmSr+ZYcWPEyt+w5HOsRiUPRvZwuow3NSg8h3YZUcMcuaMQjjU+ntdDFEFAW3w+eOsy28zsxCxalmA59KwplDp3avS8nq4VUIYGgVG7JD8BW1LSUnO++qPjkrDiLW7WMhgbEsF/ZxlKN3cbODmF9LQmHpHJrAbwB0uDykd0M= Original-Received: by 10.48.3.12 with SMTP id 12mr596610nfc; Wed, 28 Sep 2005 03:44:59 -0700 (PDT) Original-Received: by 10.48.157.10 with HTTP; Wed, 28 Sep 2005 03:44:59 -0700 (PDT) Original-To: LENNART BORGMAN In-Reply-To: <14e4cba14e7621.14e762114e4cba@net.lu.se> Content-Disposition: inline X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:43322 Archived-At: On 9/28/05, LENNART BORGMAN wrote: > I have run into a problem with swedish national characters in an XHTML do= cument. The header of the document is like this: > > > "http://www.w3.org/TR/REC-html40/loose.dtd"> > > > The swedish character =E4 looks like \344 in CVS Emacs (2005-09-23). Hmm. An XHTML document with encoding=3D"utf-8" should not have "swedish national characters" in it, should it? Upon reading the file, Emacs will set its coding system to mule-utf-8, so it's no surprise than high-bit, non-valid utf8 byte sequences appear as \xxx... I've created a document with your header, and put an "=C9" in it with notepad. Emacs shows this char as \311. I would not consider this an error :) -- /L/e/k/t/u