From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Jason Rumney Newsgroups: gmane.emacs.devel,gmane.emacs.pretest.bugs Subject: Re: 23.0.60; [nxml] BOM and utf-8 Date: Thu, 22 May 2008 09:28:32 +0100 Message-ID: <48352EB0.5060805@gnu.org> References: <87od75kt78.fsf@pdrechsler.de> <87d4nk8y3q.fsf@everybody.org> <87r6bvs3jj.fsf@pdrechsler.de> <87mymjs2qw.fsf@pdrechsler.de> <20080522041745.GA29437@tomas> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1211444995 16054 80.91.229.12 (22 May 2008 08:29:55 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 22 May 2008 08:29:55 +0000 (UTC) Cc: emacs-pretest-bug@gnu.org, Patrick Drechsler , tomas@tuxteam.de, emacs-devel@gnu.org To: Miles Bader Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu May 22 10:30:31 2008 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1Jz6Ba-0000S5-OQ for ged-emacs-devel@m.gmane.org; Thu, 22 May 2008 10:30:07 +0200 Original-Received: from localhost ([127.0.0.1]:45264 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Jz6Aq-00008i-3V for ged-emacs-devel@m.gmane.org; Thu, 22 May 2008 04:29:20 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1Jz6AI-0008N8-VT for emacs-devel@gnu.org; Thu, 22 May 2008 04:28:47 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1Jz6AG-0008Jz-1m for emacs-devel@gnu.org; Thu, 22 May 2008 04:28:45 -0400 Original-Received: from [199.232.76.173] (port=40061 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Jz6AF-0008Jm-SN for emacs-devel@gnu.org; Thu, 22 May 2008 04:28:43 -0400 Original-Received: from fencepost.gnu.org ([140.186.70.10]:51841) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1Jz6AF-0007lo-4C for emacs-devel@gnu.org; Thu, 22 May 2008 04:28:43 -0400 Original-Received: from mail.gnu.org ([199.232.76.166]:36670 helo=mx10.gnu.org) by fencepost.gnu.org with esmtp (Exim 4.67) (envelope-from ) id 1Jz68u-0002ub-Mf for emacs-pretest-bug@gnu.org; Thu, 22 May 2008 04:27:20 -0400 Original-Received: from Debian-exim by monty-python.gnu.org with spam-scanned (Exim 4.60) (envelope-from ) id 1Jz6A9-0007jm-AC for emacs-pretest-bug@gnu.org; Thu, 22 May 2008 04:28:41 -0400 Original-Received: from mk-outboundfilter-4.mail.uk.tiscali.com ([212.74.114.32]:35627) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1Jz6A8-0007jQ-Ur; Thu, 22 May 2008 04:28:37 -0400 Original-X-Trace: 84354484/mk-outboundfilter-2.mail.uk.tiscali.com/F2S/$ACCEPTED/freedom2Surf-customers/83.67.23.108 X-SBRS: None X-RemoteIP: 83.67.23.108 X-IP-MAIL-FROM: jasonr@gnu.org X-IP-BHB: Once X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: ArwEAFPLNEhTQxds/2dsb2JhbACBVa4K X-IronPort-AV: E=Sophos;i="4.27,524,1204502400"; d="scan'208";a="84354484" X-IP-Direction: IN Original-Received: from i-83-67-23-108.freedom2surf.net (HELO wanchan.jasonrumney.net) ([83.67.23.108]) by smtp.f2s.tiscali.co.uk with ESMTP; 22 May 2008 09:28:35 +0100 Original-Received: from [192.168.249.27] (chiko.jasonrumney.net [192.168.249.27]) by wanchan.jasonrumney.net (Postfix) with ESMTP id CACBC15B9; Thu, 22 May 2008 09:28:34 +0100 (BST) User-Agent: Thunderbird 2.0.0.14 (Windows/20080421) In-Reply-To: X-Enigmail-Version: 0.95.6 OpenPGP: id=8086879D X-detected-kernel: by monty-python.gnu.org: Genre and OS details not recognized. X-detected-kernel: by monty-python.gnu.org: Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:97507 gmane.emacs.pretest.bugs:22409 Archived-At: Miles Bader wrote: > The encoding of BOM (incidentally, isn't this name for it obsolete?) The Unicode Consortium seems very confused over this, probably because of the format they've chosen for the character info tables they publish, which they don't want to change now because software relies on it (the only way of specifying an alternate name seems to be to deprecate the old name to the "1.0 name" field, and use the new name as the preferred name). It was renamed to ZWNBSP in Unicode 2.0 to reflect its dual purpose as BOM and zero width no break space. Then in a later version of Unicode its use as ZWNBSP was deprecated, but the official name was not changed back (swapping the ZWNBSP and BOM names would not be strictly correct, as ZWNBSP was not its name in Unicode 1.0).