From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Charles Muller Newsgroups: gmane.emacs.help Subject: Re: How to make emacs auto-recognize utf-8 encoded files upon visiting Date: Wed, 25 Sep 2002 18:21:23 +0900 (JST) Sender: help-gnu-emacs-admin@gnu.org Message-ID: <20020925.182123.74749739.acmuller@gol.com> References: <20020925.154533.74753345.acmuller@gol.com> NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Trace: main.gmane.org 1032945945 31936 127.0.0.1 (25 Sep 2002 09:25:45 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 25 Sep 2002 09:25:45 +0000 (UTC) Cc: help-gnu-emacs@gnu.org Return-path: Original-Received: from monty-python.gnu.org ([199.232.76.173]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 17u8QR-0008Iw-00 for ; Wed, 25 Sep 2002 11:25:43 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10) id 17u8QZ-0007ld-00; Wed, 25 Sep 2002 05:25:51 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10) id 17u8PT-0007MY-00 for help-gnu-emacs@gnu.org; Wed, 25 Sep 2002 05:24:43 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10) id 17u8PR-0007H9-00 for help-gnu-emacs@gnu.org; Wed, 25 Sep 2002 05:24:43 -0400 Original-Received: from smtp01.fields.gol.com ([203.216.5.131]) by monty-python.gnu.org with esmtp (Exim 4.10) id 17u8PR-0007Gl-00 for help-gnu-emacs@gnu.org; Wed, 25 Sep 2002 05:24:41 -0400 Original-Received: from 203-216-51-112.dsl.gol.ne.jp ([203.216.51.112] helo=localhost) by smtp01.fields.gol.com with esmtp (Magnetic Fields) id 17u8PM-0004ty-00; Wed, 25 Sep 2002 18:24:36 +0900 Original-To: eliz@is.elta.co.il In-Reply-To: X-Mailer: Mew version 2.2 on Emacs 21.2 / Mule 5.0 (SAKAKI) X-Abuse-Complaints: abuse@gol.com Errors-To: help-gnu-emacs-admin@gnu.org X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.0.11 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.emacs.help:1835 X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:1835 After Eli wrote ... > Yes. Don't you see that when you visit etc/HELLO? I spend quite a bit of time trying to visit utf-8 files on my hard drives in Emacs 21.2 without the TEI package loaded, and could not view Chinese fonts in any of them, which is odd, in view of the fact that I can see them in the HELLO file. Then I tried C-u C-x = on some of the characters in HELLO, and got this interesting piece of information (for example): file code: ESC 24 28 43 31 5B (encoded by coding system iso-2022-7bit-unix) Unless I am completely misunderstanding something (and I may well be, for I am not a programmer) if this file is encoded as iso-2022-7, it seems that we should not be using it as a test example of utf-8 functionality. Am I right? I have written up a small test file in utf-8 that contains just one line each of Korean, Japanese, and Chinese, in case any one is interested in trying it. It displays for me fine with TEI installed, but as gibberish without it. http://www.acmuller.net/test.txt Chuck --------------------------- Charles Muller Faculty of Humanities, Toyo Gakuen University Digital Dictionary of Buddhism and CJKV-English Dictionary [http://www.acmuller.net] Mobile Phone: 090-9310-1787