From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Patrick Drechsler Newsgroups: gmane.emacs.devel,gmane.emacs.pretest.bugs Subject: Re: 23.0.60; [nxml] BOM and utf-8 Date: Sun, 18 May 2008 13:00:50 +0200 Organization: none Message-ID: <87prrj98rh.fsf@pdrechsler.de> References: <87od75kt78.fsf@pdrechsler.de> <87mymofip6.fsf@uwakimon.sk.tsukuba.ac.jp> <878wy8ny36.fsf@catnip.gol.com> <482FEF32.2080300@gnu.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1211108483 1461 80.91.229.12 (18 May 2008 11:01:23 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 18 May 2008 11:01:23 +0000 (UTC) Cc: emacs-pretest-bug@gnu.org To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sun May 18 13:02:01 2008 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1JxgeK-0004bM-0O for ged-emacs-devel@m.gmane.org; Sun, 18 May 2008 13:01:56 +0200 Original-Received: from localhost ([127.0.0.1]:34476 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Jxgda-0008MI-4G for ged-emacs-devel@m.gmane.org; Sun, 18 May 2008 07:01:10 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1JxgdV-0008M2-5I for emacs-devel@gnu.org; Sun, 18 May 2008 07:01:05 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1JxgdS-0008Le-Jj for emacs-devel@gnu.org; Sun, 18 May 2008 07:01:03 -0400 Original-Received: from [199.232.76.173] (port=56245 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JxgdS-0008Lb-Bn for emacs-devel@gnu.org; Sun, 18 May 2008 07:01:02 -0400 Original-Received: from main.gmane.org ([80.91.229.2]:59187 helo=ciao.gmane.org) by monty-python.gnu.org with esmtps (TLS-1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1JxgdR-0006bn-U6 for emacs-devel@gnu.org; Sun, 18 May 2008 07:01:02 -0400 Original-Received: from list by ciao.gmane.org with local (Exim 4.43) id 1JxgdQ-00044Y-1C for emacs-devel@gnu.org; Sun, 18 May 2008 11:01:00 +0000 Original-Received: from dslb-084-057-091-198.pools.arcor-ip.net ([84.57.91.198]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sun, 18 May 2008 11:01:00 +0000 Original-Received: from patrick by dslb-084-057-091-198.pools.arcor-ip.net with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sun, 18 May 2008 11:01:00 +0000 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 25 Original-X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: dslb-084-057-091-198.pools.arcor-ip.net Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAAFVBMVEUAAACtra19fX3y8vKA gID///9AQEBawxxrAAABeUlEQVR4nHXSwXKDIBAA0G3a8Vw05JxY9E4MObeK3h1Cz2qt//8J3UWj ppIdBxmfu4sC2CcB070+PQHDuAcyhQPzwLVFuSlfqQK7eOFirfQ2ly7JC8a3Kip1sx6QLOaRB0wo IUg8gFUa2D2BIFVb+OYGhC23cB4aUDbaQt4AgifjCACt9HwgQfrjWZWDwNPjEwHbtBu44kPC5D9Y FknMgFf8aZFawDDGIuoTkAYndYczzd5hjqAYwXzQ/QCraB2401HvHyQhKMb2tbw0M7whuBNoIs4M W/Xh056fQzwLPF/g14FkrcLN4nKp9WLByIMImSwyycLjDDsOF93pVFdC6yofHkEjxDhWumtWsMdH edc7SOf2/QiYQiByUU0gSldKD4LyBLUbK+kCaoI+diNlxgOGmIFWhVDpe5RgqEiPRR6Buwx8PxXE G3CGzeMVmAnctUB5h3Hsthljp34Ne/wjbu4+cIovBVbqfH4vFdOkoI2SzEWUZdMMQ9k/5csXnnRd prMAAAAASUVORK5CYII= User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.0.60 (gnu/linux) Cancel-Lock: sha1:wXdTP+4wAVjXN9Gt5oQL44LhHlI= X-detected-kernel: by monty-python.gnu.org: Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:97358 gmane.emacs.pretest.bugs:22375 Archived-At: Jason Rumney writes: > Eli Zaretskii wrote: >> I'm not sure you are barking the right tree. AFAIK, Microsoft >> doesn't use UTF-8 at all, they use UTF-16 (where a BOM is generally >> necessary). > > What Miles is talking about is certain Microsoft software (including > their XML library), which when saving to UTF-8 writes a UTF-8 encoded > 0xFEFF at the start of the file. Its probably caused by first encoding > in UTF-16 then transcoding to UTF-8. This seems to be the problem in my case: A microsoft .NET application adds the BOM to the xml file which is encoded as utf-8. Thanks to all for the instructive feedback on this issue! I second the opinion that it would be nice to have the option to hide (or remove) the BOM while editing the file in Emacs and reinserting it (if it was removed) when done editing. Otherwise one is not able to validate the rest of the xml file using nxml. Cheers, Patrick