all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Charles Muller <acmuller@gol.com>
Cc: help-gnu-emacs@gnu.org
Subject: Re: How to make emacs auto-recognize utf-8 encoded files upon visiting
Date: Fri, 27 Sep 2002 23:10:07 +0900 (JST)	[thread overview]
Message-ID: <20020927.231007.59465175.acmuller@gol.com> (raw)
In-Reply-To: <vaf7kh7zlwd.fsf@lucy.cs.uni-dortmund.de>

Kai wrote:

> Maybe it is sufficient to install Mule-UCS?  I guess that TEI Emacs
> is Emacs with Mule-UCS pre-installed (plus some other packages).

TEI-Emacs does install Mule-UCS, but the reason for its ability to do what
it does must be more than that, because I always install Mule-UCS with my
Emacs, and they never render CJK fonts in Unicode until I install the TEI package. Since all of my
internet publication and data compilation has to be done in Unicode, that's
always the first thing I check. But I don't know enough about Lisp
programming to tell you exactly *what* the TEI package does to get this
working. No doubt Sebastian Rahtz, Christian Wittern, and some of the others
who wrote the package would be happy to tell you what the key routines are.

The first priority of the TEI people is to make sure that the SGML/XML/HTML
modes are working more precisely and comprehensively than in the standard
Emacs package,
since the target audience is mainly humanities scholars who are using
TEI-XML to mark up literary texts. For example, the way the PSGML is set up
in the standard package, it is hard to get it to determine the difference
between XML and SGML. They have also added a whole array of DTD's for
various purposes, including distinctions in XHTML/strict/transitional. There
is also an XSLT mode added that allows for adjustments and debugging. 

Then, on top of that, because there are so many of us working with mixed
international scripts (including CJK), apparently someone decided to figure
out how to get all the fonts properly recognized.

I am guessing that part of the problem facing the standard installation of
Emacs is that with any other traditional encoding outside of Unicode, such
as Big5, JIS, or KSC, you always have at least one full font set that is
traditionally mapped to the encoding. With Unicode, I don't know of a font
that is designed to work readily in Linux/Emacs, that covers all codepoints
(the way MS Arial Unicode does in Windows, for example). So a function needs
to be added which goes through the document and properly plugs in fonts for
each given codepoint. I am not well-enough versed at the technical end to be
able to explain how they have accomplished this over in Oxford.

I noticed a good bit of negative reaction toward TEI-Emacs when I first
mentioned it, where people expressed alarm about the TEI people not caring
about the about the GPL and not reporting to the GNU development team. I
think that these concerns come as a result of people not really checking
into what the package is, and what it does. It is not a new version of
Emacs, such as XEmacs. It is simply an add-on, that contains mode
enhancements, and some of its own new modes--just the way people are
accustomed to adding on calendar modes, e-mail packages, or whatever.

I know many of the dedicated people in the Text Encoding Initiative very
well, and there is not a bunch around who are more concerned about free
software and donating code. But when you really need to get a certain type
of application set up for a certain use, I don't think you can just write to
the GNU development team and then wait for a future version for it to be
implemented. And after all, the Lisp code for TEI-Emacs is just as openly
available as any other development based on Emacs. And they have also made a
concerted effort to have their add-on support GNU Emacs, rather than XEmacs,
so I really see it as a very positive development that should be learned
from, rather than disparaged.

Chuck

---------------------------
Charles Muller  <acmuller@gol.com>
Faculty of Humanities,  Toyo Gakuen University
Digital Dictionary of Buddhism and CJKV-English Dictionary 
[http://www.acmuller.net]
Mobile Phone: 090-9310-1787

  reply	other threads:[~2002-09-27 14:10 UTC|newest]

Thread overview: 32+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2002-09-23 16:39 How to make emacs auto-recognize utf-8 encoded files upon visiting Gerald Wildgruber
2002-09-23 23:35 ` Jesper Harder
2002-09-24  3:29   ` Charles Muller
     [not found]   ` <mailman.1032838300.26368.help-gnu-emacs@gnu.org>
2002-09-24  6:27     ` Miles Bader
2002-09-24  8:59       ` Charles Muller
2002-09-24 15:12         ` Eli Zaretskii
2002-09-25  6:45           ` Charles Muller
2002-09-25  6:55             ` Eli Zaretskii
2002-09-25  8:07               ` Charles Muller
2002-09-25  8:33               ` Charles Muller
2002-09-26  4:42                 ` Eli Zaretskii
2002-09-26  7:00                   ` Charles Muller
2002-09-26 16:05                     ` Eli Zaretskii
2002-09-27  0:36                       ` Charles Muller
     [not found]                       ` <mailman.1033086929.4506.help-gnu-emacs@gnu.org>
2002-09-27  1:42                         ` Miles Bader
2002-09-27  7:06                           ` Charles Muller
     [not found]                           ` <mailman.1033110323.17834.help-gnu-emacs@gnu.org>
2002-09-27  9:07                             ` Miles Bader
2002-09-27 11:56                             ` Kai Großjohann
2002-09-27 14:10                               ` Charles Muller [this message]
     [not found]                               ` <mailman.1033135767.32171.help-gnu-emacs@gnu.org>
2002-09-27 14:41                                 ` Miles Bader
2002-09-27 15:54                                 ` Stefan Monnier <foo@acm.com>
2002-09-25  9:21               ` Charles Muller
2002-09-25  9:26               ` Charles Muller
2002-09-25  9:41                 ` Charles Muller
     [not found]           ` <mailman.1032936261.7964.help-gnu-emacs@gnu.org>
2002-09-25  8:23             ` Miles Bader
2002-09-25 14:55             ` Stefan Monnier <foo@acm.com>
2002-09-24 19:05         ` tramp Roger Mason
     [not found]     ` <mailman.1032848900.31556.help-gnu-emacs@gnu.org>
2002-09-24  8:26       ` How to make emacs auto-recognize utf-8 encoded files upon visiting A. Lucien Meyers
2002-09-24 11:45 ` auto-recognize utf-8 encoded files upon visiting: solved (sort of...) Gerald Wildgruber
2002-09-24 12:39   ` Charles Muller
     [not found]   ` <mailman.1032871109.14505.help-gnu-emacs@gnu.org>
2002-09-25 14:28     ` A. L. Meyers
2002-09-24 18:57 ` How to make emacs auto-recognize utf-8 encoded files upon visiting Dominic Cronin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20020927.231007.59465175.acmuller@gol.com \
    --to=acmuller@gol.com \
    --cc=help-gnu-emacs@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.