From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kevin Rodgers Newsgroups: gmane.emacs.help Subject: Re: To get Unicode file (UTF8) with a leading byte-order-marker characters? Date: Fri, 11 Jun 2010 23:56:41 -0600 Message-ID: References: <878w6p4kbm.fsf@debian-laptop.localdomain> <87fx0t8ew9.fsf@debian-laptop.localdomain> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Trace: dough.gmane.org 1276322261 22938 80.91.229.12 (12 Jun 2010 05:57:41 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Sat, 12 Jun 2010 05:57:41 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Sat Jun 12 07:57:40 2010 connect(): No such file or directory Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1ONJit-0003J9-Eo for geh-help-gnu-emacs@m.gmane.org; Sat, 12 Jun 2010 07:57:39 +0200 Original-Received: from localhost ([127.0.0.1]:58067 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1ONJis-0006IH-VZ for geh-help-gnu-emacs@m.gmane.org; Sat, 12 Jun 2010 01:57:38 -0400 Original-Received: from [140.186.70.92] (port=46020 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1ONJiG-0006HP-Es for help-gnu-emacs@gnu.org; Sat, 12 Jun 2010 01:57:01 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1ONJiF-0004Ln-7V for help-gnu-emacs@gnu.org; Sat, 12 Jun 2010 01:57:00 -0400 Original-Received: from lo.gmane.org ([80.91.229.12]:56479) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1ONJiE-0004Lg-Qy for help-gnu-emacs@gnu.org; Sat, 12 Jun 2010 01:56:59 -0400 Original-Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1ONJi9-00034Y-Lr for help-gnu-emacs@gnu.org; Sat, 12 Jun 2010 07:56:53 +0200 Original-Received: from c-71-237-24-138.hsd1.co.comcast.net ([71.237.24.138]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sat, 12 Jun 2010 07:56:53 +0200 Original-Received: from kevin.d.rodgers by c-71-237-24-138.hsd1.co.comcast.net with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sat, 12 Jun 2010 07:56:53 +0200 X-Injected-Via-Gmane: http://gmane.org/ connect(): No such file or directory Original-Lines: 54 Original-X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: c-71-237-24-138.hsd1.co.comcast.net User-Agent: Thunderbird 2.0.0.24 (Macintosh/20100228) In-Reply-To: <87fx0t8ew9.fsf@debian-laptop.localdomain> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:73875 Archived-At: Paul Chany wrote: > Eli Zaretskii writes: > >>> From: Paul Chany >>> >>> The file should be ASCII (using \U escapes for unicode characters) or >>> Unicode (UTF16 or UTF8) with a leading byte-order-marker. >>> >>> How can I get this file using Emacs? >> C-x RET f utf-8-with-signature RET > > Well, when I did so I get the following message: > > NAME:NSGenericException REASON:Parse failed at > line 2 (char 41) - unexpected character (wanted ';') INFO:(nil) > > and the translation don't works. :( > What's wrong with it? > > I attach the file here (it's small): I saved the file and visited it in Emacs 23.2.1 (i386-apple-darwin8.11.1, NS apple-appkit-824.48), and no errors or warnings were reported. `C-h C RET' displays: Coding system for saving this buffer: U -- utf-8-with-signature-unix ... and with the cursor over character 41 (the second character on line 2), `C-u C-x =' displays: character: ó (243, #o363, #xf3) preferred charset: unicode (Unicode (ISO10646)) code point: 0xF3 syntax: w which means: word category: .:Base, c:Chinese, j:Japanese, l:Latin, v:Viet buffer code: #xC3 #xB3 file code: #xEF #xBB #xBF #xC3 #xB3 (encoded by coding system utf-8-with-signature-unix) display: by this font (glyph code) nil:-apple-Monaco-medium-normal-normal-*-12-*-*-*-m-0-iso10646-1 (#x79) Character code properties: customize what to show name: LATIN SMALL LETTER O WITH ACUTE old-name: LATIN SMALL LETTER O ACUTE general-category: Ll (Letter, Lowercase) decomposition: (111 769) ('o' '́') -- Kevin Rodgers Denver, Colorado, USA