From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Jason Rumney Newsgroups: gmane.emacs.devel,gmane.emacs.pretest.bugs Subject: Re: 23.0.60; [nxml] BOM and utf-8 Date: Sun, 18 May 2008 09:56:18 +0100 Message-ID: <482FEF32.2080300@gnu.org> References: <87od75kt78.fsf@pdrechsler.de> <87mymofip6.fsf@uwakimon.sk.tsukuba.ac.jp> <878wy8ny36.fsf@catnip.gol.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1211101018 15013 80.91.229.12 (18 May 2008 08:56:58 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 18 May 2008 08:56:58 +0000 (UTC) Cc: emacs-pretest-bug@gnu.org, stephen@xemacs.org, patrick@pdrechsler.de, Miles Bader To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sun May 18 10:57:34 2008 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1Jxehv-00045N-V3 for ged-emacs-devel@m.gmane.org; Sun, 18 May 2008 10:57:32 +0200 Original-Received: from localhost ([127.0.0.1]:53460 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JxehC-0000NI-7V for ged-emacs-devel@m.gmane.org; Sun, 18 May 2008 04:56:46 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1Jxeh5-0000Mv-Rp for emacs-devel@gnu.org; Sun, 18 May 2008 04:56:39 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1Jxeh3-0000M8-3A for emacs-devel@gnu.org; Sun, 18 May 2008 04:56:38 -0400 Original-Received: from [199.232.76.173] (port=56763 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Jxeh2-0000Lx-Mi for emacs-devel@gnu.org; Sun, 18 May 2008 04:56:36 -0400 Original-Received: from fencepost.gnu.org ([140.186.70.10]:49543) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1Jxeh2-00007X-D2 for emacs-devel@gnu.org; Sun, 18 May 2008 04:56:36 -0400 Original-Received: from mx10.gnu.org ([199.232.76.166]:47537) by fencepost.gnu.org with esmtp (Exim 4.67) (envelope-from ) id 1Jxefs-0006S8-Tm for emacs-pretest-bug@gnu.org; Sun, 18 May 2008 04:55:24 -0400 Original-Received: from Debian-exim by monty-python.gnu.org with spam-scanned (Exim 4.60) (envelope-from ) id 1Jxegy-00006o-JN for emacs-pretest-bug@gnu.org; Sun, 18 May 2008 04:56:36 -0400 Original-Received: from mk-outboundfilter-4.mail.uk.tiscali.com ([212.74.114.32]:21290) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1Jxegt-00006D-Q0; Sun, 18 May 2008 04:56:27 -0400 Original-X-Trace: 82306179/mk-outboundfilter-2.mail.uk.tiscali.com/F2S/$ACCEPTED/freedom2Surf-customers/83.67.23.108 X-SBRS: None X-RemoteIP: 83.67.23.108 X-IP-MAIL-FROM: jasonr@gnu.org X-IP-BHB: Once X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: ApsEAP6LL0hTQxds/2dsb2JhbACBVaoO X-IronPort-AV: E=Sophos;i="4.27,503,1204502400"; d="scan'208";a="82306179" X-IP-Direction: IN Original-Received: from i-83-67-23-108.freedom2surf.net (HELO wanchan.jasonrumney.net) ([83.67.23.108]) by smtp.f2s.tiscali.co.uk with ESMTP; 18 May 2008 09:56:26 +0100 Original-Received: from [192.168.249.27] (chiko.jasonrumney.net [192.168.249.27]) by wanchan.jasonrumney.net (Postfix) with ESMTP id DCB6715B9; Sun, 18 May 2008 09:56:25 +0100 (BST) User-Agent: Thunderbird 2.0.0.14 (Windows/20080421) In-Reply-To: X-Enigmail-Version: 0.95.6 OpenPGP: id=8086879D X-detected-kernel: by monty-python.gnu.org: Genre and OS details not recognized. X-detected-kernel: by monty-python.gnu.org: Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:97354 gmane.emacs.pretest.bugs:22372 Archived-At: Eli Zaretskii wrote: > I'm not sure you are barking the right tree. AFAIK, Microsoft doesn't > use UTF-8 at all, they use UTF-16 (where a BOM is generally > necessary). > What Miles is talking about is certain Microsoft software (including their XML library), which when saving to UTF-8 writes a UTF-8 encoded 0xFEFF at the start of the file. Its probably caused by first encoding in UTF-16 then transcoding to UTF-8.