From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Charles Muller Newsgroups: gmane.emacs.help Subject: Re: auto-recognize utf-8 encoded files upon visiting: solved (sort of...) Date: Tue, 24 Sep 2002 21:39:38 +0900 (JST) Sender: help-gnu-emacs-admin@gnu.org Message-ID: <20020924.213938.41631245.acmuller@gol.com> References: NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Trace: main.gmane.org 1032871111 15044 127.0.0.1 (24 Sep 2002 12:38:31 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Tue, 24 Sep 2002 12:38:31 +0000 (UTC) Return-path: Original-Received: from monty-python.gnu.org ([199.232.76.173]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 17toxP-0003u7-00 for ; Tue, 24 Sep 2002 14:38:28 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10) id 17toxS-0003m1-00; Tue, 24 Sep 2002 08:38:30 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10) id 17towS-0003P5-00 for help-gnu-emacs@gnu.org; Tue, 24 Sep 2002 08:37:28 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10) id 17towP-0003OI-00 for help-gnu-emacs@gnu.org; Tue, 24 Sep 2002 08:37:27 -0400 Original-Received: from smtp01.fields.gol.com ([203.216.5.131]) by monty-python.gnu.org with esmtp (Exim 4.10) id 17towO-0003Ni-00 for help-gnu-emacs@gnu.org; Tue, 24 Sep 2002 08:37:24 -0400 Original-Received: from 203-216-96-025.dsl.gol.ne.jp ([203.216.96.25] helo=localhost) by smtp01.fields.gol.com with esmtp (Magnetic Fields) id 17towN-00038w-00 for ; Tue, 24 Sep 2002 21:37:23 +0900 Original-To: help-gnu-emacs@gnu.org Original-Newsgroups: gnu.emacs.help In-Reply-To: X-Mailer: Mew version 2.2 on Emacs 21.2 / Mule 5.0 (SAKAKI) X-Abuse-Complaints: abuse@gol.com Errors-To: help-gnu-emacs-admin@gnu.org X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.0.11 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.emacs.help:1784 X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:1784 Gerald wrote: > Charles, thanks for your hint on TEI; I gave TEI a long try many years ago, > when SGML came up, they provided very good introductory material to the > whole issue. But I didn't know of their work on emacs. You say that the > unicode stuff didn't work right on emacs 21.2. It worked OK for me for European languages, and I can do CJK input/output in Chinese, Japanese, Korean, either in the national encodings or in utf-8. But regular Emacs 21.2, although it seems to be able to recognize the encodings of files, does not seem to be able to apply the proper fonts very well--especially East Asian fonts, when they are mixed in together with Latin. > I compiled an emacs version > from the CVS sources (http://savannah.gnu.org/projects/emacs/) and there > unicode integration seems to be already more evolved than in the official > distribution. Almost everything works very well. Perhaps you should give it > a try. TEI-Emacs applies a "fontifying" process wherein the proper fonts are applied to all of the different mixed-language codepoints that I am using. I wonder if this recent package you mention can do that? In any case, what you have told me here is certainly good news, and I'll keep it in mind. As you have guessed, I do not use Emacs for programming, but as an infinitely customizable text-editing environment to carry out humanities research projects. The main reason I use TEI-Emacs is not for the Unicode handling, but because I am running a number of research projects that are structured by TEI-XML. I'm using their DTD's, style sheets, and everything, so it's a complete package for me, that takes care of almost everything. Nonetheless, like you, I'm looking forward to the continued development of the Emacsens toward Unicode. I know that Xemacs 21.5 will set UTF-8 as the default encoding. I hope the developers at GNU Emacs have similar intentions in mind. Regards, Chuck --------------------------- Charles Muller Faculty of Humanities, Toyo Gakuen University Digital Dictionary of Buddhism and CJKV-English Dictionary [http://www.acmuller.net] Mobile Phone: 090-9310-1787