From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Xah Lee Newsgroups: gmane.emacs.help Subject: Re: those funny non-ASCII characters Date: Fri, 1 Jun 2012 14:06:33 -0700 (PDT) Organization: http://groups.google.com Message-ID: <5cba8baa-c270-4985-aec8-4468aaa9ed05@wp3g2000pbc.googlegroups.com> References: <731567ba-000c-4643-9eff-0237129b90c7@oe8g2000pbb.googlegroups.com> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1338585015 7308 80.91.229.3 (1 Jun 2012 21:10:15 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Fri, 1 Jun 2012 21:10:15 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Fri Jun 01 23:10:12 2012 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1SaZ6q-0007z4-CZ for geh-help-gnu-emacs@m.gmane.org; Fri, 01 Jun 2012 23:10:12 +0200 Original-Received: from localhost ([::1]:48768 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SaZ6q-0004xI-2O for geh-help-gnu-emacs@m.gmane.org; Fri, 01 Jun 2012 17:10:12 -0400 Original-Path: usenet.stanford.edu!postnews.google.com!wp3g2000pbc.googlegroups.com!not-for-mail Original-Newsgroups: gnu.emacs.help Original-Lines: 25 Original-NNTP-Posting-Host: 76.126.112.84 Original-X-Trace: posting.google.com 1338584908 24977 127.0.0.1 (1 Jun 2012 21:08:28 GMT) Original-X-Complaints-To: groups-abuse@google.com Original-NNTP-Posting-Date: Fri, 1 Jun 2012 21:08:28 +0000 (UTC) Complaints-To: groups-abuse@google.com Injection-Info: wp3g2000pbc.googlegroups.com; posting-host=76.126.112.84; posting-account=bRPKjQoAAACxZsR8_VPXCX27T2YcsyMA User-Agent: G2/1.0 X-HTTP-UserAgent: Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/536.5 (KHTML, like Gecko) Chrome/19.0.1084.52 Safari/536.5,gzip(gfe) Original-Xref: usenet.stanford.edu gnu.emacs.help:192673 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:85078 Archived-At: Xah wrote > > =E3=80=88Unicode BOM Byte Order Mark Hack=E3=80=89 http://xahlee.org/co= mp/unicode_BOM_byte_orde_mark.html > > > http://www.unicode.org/faq/utf_bom.html#bom1 On Jun 1, 9:26=C2=A0am, rusi wrote: > See http://www.unicode.org/versions/Unicode5.0.0/ch02.pdf > (pg 36) "Use of a BOM is neither required nor recommended for UTF-8, > but may > be encountered in contexts where UTF-8 data is converted from other > encoding forms..." > > More specifically the non-recommendation of bom: http://www.unicode.org/f= aq/utf_bom.html > "Note that some recipients of UTF-8 encoded data do not expect a BOM. > Where UTF-8 is used transparently in 8-bit environments, the use of a > BOM will interfere with any protocol or file format that expects > specific ASCII characters at the beginning, such as the use of "#!" of > at the beginning of Unix shell scripts. " didn't i mention these 2 points exactly in the link i gave?? Xah