From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Xah Lee Newsgroups: gmane.emacs.help Subject: Re: those funny non-ASCII characters Date: Fri, 1 Jun 2012 00:03:12 -0700 (PDT) Organization: http://groups.google.com Message-ID: References: <731567ba-000c-4643-9eff-0237129b90c7@oe8g2000pbb.googlegroups.com> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1338534314 20814 80.91.229.3 (1 Jun 2012 07:05:14 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Fri, 1 Jun 2012 07:05:14 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Fri Jun 01 09:05:14 2012 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1SaLv7-0003YR-OF for geh-help-gnu-emacs@m.gmane.org; Fri, 01 Jun 2012 09:05:13 +0200 Original-Received: from localhost ([::1]:33575 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SaLv7-0000ko-8I for geh-help-gnu-emacs@m.gmane.org; Fri, 01 Jun 2012 03:05:13 -0400 Original-Path: usenet.stanford.edu!postnews.google.com!oo8g2000pbc.googlegroups.com!not-for-mail Original-Newsgroups: gnu.emacs.help Original-Lines: 29 Original-NNTP-Posting-Host: 76.126.112.84 Original-X-Trace: posting.google.com 1338534193 17738 127.0.0.1 (1 Jun 2012 07:03:13 GMT) Original-X-Complaints-To: groups-abuse@google.com Original-NNTP-Posting-Date: Fri, 1 Jun 2012 07:03:13 +0000 (UTC) Complaints-To: groups-abuse@google.com Injection-Info: oo8g2000pbc.googlegroups.com; posting-host=76.126.112.84; posting-account=bRPKjQoAAACxZsR8_VPXCX27T2YcsyMA User-Agent: G2/1.0 X-HTTP-UserAgent: Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/536.5 (KHTML, like Gecko) Chrome/19.0.1084.52 Safari/536.5,gzip(gfe) Original-Xref: usenet.stanford.edu gnu.emacs.help:192667 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:85072 Archived-At: On May 31, 10:43=C2=A0pm, rusi wrote: > On Jun 1, 9:23=C2=A0am, Jason Rumney wrote: > > > On Thursday, 31 May 2012 01:15:11 UTC+8, Buchs, Kevin =C2=A0wrote: > > > Xah suggested I embrace Unicode. So I could use (prefer-coding-system > > > 'utf-8) or the file variable: -*- coding: utf-8 -*-. Are there drawba= cks > > > to the former? What about opening an ASCII coded file? Can emacs > > > properly detect it or does it come up as UTF-8? > > > ASCII is a subset of UTF-8, so the problem you are imagining does not e= xist. > > This does not exactly work that way on windows. > eg recently saw a description of how notepad put a BOM mark in a > haskell-script which made the haskell scripts unrunnable haskell compiler probably should bear the blame. Last i read (~4 years ago), the lang spec says source code should be unicode (i forgot if it specified a encoding), however, no haskell compiler at the time supports it. If your lang spec says unicode, you have to support BOM mark. =E3=80=88Unicode BOM Byte Order Mark Hack=E3=80=89 http://xahlee.org/comp/unicode_BOM_byte_orde_mark.html http://www.unicode.org/faq/utf_bom.html#bom1 Xah