From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: rusi Newsgroups: gmane.emacs.help Subject: Re: those funny non-ASCII characters Date: Fri, 1 Jun 2012 20:17:35 -0700 (PDT) Organization: http://groups.google.com Message-ID: <202f4594-9462-48dc-954d-8cf9ac6a581e@s6g2000pbi.googlegroups.com> References: <731567ba-000c-4643-9eff-0237129b90c7@oe8g2000pbb.googlegroups.com> <5cba8baa-c270-4985-aec8-4468aaa9ed05@wp3g2000pbc.googlegroups.com> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1338607220 9668 80.91.229.3 (2 Jun 2012 03:20:20 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Sat, 2 Jun 2012 03:20:20 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Sat Jun 02 05:20:18 2012 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Saesv-00020P-CD for geh-help-gnu-emacs@m.gmane.org; Sat, 02 Jun 2012 05:20:13 +0200 Original-Received: from localhost ([::1]:59359 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Saesv-0000bM-1S for geh-help-gnu-emacs@m.gmane.org; Fri, 01 Jun 2012 23:20:13 -0400 Original-Path: usenet.stanford.edu!postnews.google.com!s6g2000pbi.googlegroups.com!not-for-mail Original-Newsgroups: gnu.emacs.help Original-Lines: 44 Original-NNTP-Posting-Host: 116.74.133.115 Original-X-Trace: posting.google.com 1338607055 27096 127.0.0.1 (2 Jun 2012 03:17:35 GMT) Original-X-Complaints-To: groups-abuse@google.com Original-NNTP-Posting-Date: Sat, 2 Jun 2012 03:17:35 +0000 (UTC) Complaints-To: groups-abuse@google.com Injection-Info: s6g2000pbi.googlegroups.com; posting-host=116.74.133.115; posting-account=mBpa7woAAAAGLEWUUKpmbxm-Quu5D8ui User-Agent: G2/1.0 X-HTTP-UserAgent: Mozilla/5.0 (X11; Linux i686; rv:12.0) Gecko/20100101 Firefox/12.0,gzip(gfe) Original-Xref: usenet.stanford.edu gnu.emacs.help:192674 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:85079 Archived-At: On Jun 2, 2:06=C2=A0am, Xah Lee wrote: > Xah wrote > > > > =E3=80=88Unicode BOM Byte Order Mark Hack=E3=80=89http://xahlee.org/c= omp/unicode_BOM_byte_orde_mark.html > > > >http://www.unicode.org/faq/utf_bom.html#bom1 > > On Jun 1, 9:26=C2=A0am, rusi wrote: > > > Seehttp://www.unicode.org/versions/Unicode5.0.0/ch02.pdf > > (pg 36) "Use of a BOM is neither required nor recommended for UTF-8, > > but may > > be encountered in contexts where UTF-8 data is converted from other > > encoding forms..." > > > More specifically the non-recommendation of bom:http://www.unicode.org/= faq/utf_bom.html > > "Note that some recipients of UTF-8 encoded data do not expect a BOM. > > Where UTF-8 is used transparently in 8-bit environments, the use of a > > BOM will interfere with any protocol or file format that expects > > specific ASCII characters at the beginning, such as the use of "#!" of > > at the beginning of Unix shell scripts. " > > didn't i mention these 2 points exactly in the link i gave?? Yeah your own link says this: (as you know I often use and quote your unicode pages :-) ) - In unix-like OSes, BOM for utf-8 conflicts with the Shebang (Unix) hack. - Many Window software add BOM to utf-8 files, e.g. Notepad. But you also say > If your lang spec says unicode, you have to support BOM mark So I am not clear whats ur stand... Let me make my own position clear: The de jure unicode standard is set by the unicode consortium (or whatever its called) The de facto standard is set by microsoft and java The two conflict