unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
From: Kenichi Handa <handa@m17n.org>
Cc: mariano@gnome.org, alexander.winston@comcast.net,
	emacs-devel@gnu.org, danilo@gnome.org, monnier@iro.umontreal.ca,
	miles@gnu.org
Subject: Re: Possible UTF-8 CJK Regressions in Terminal Emulators
Date: Wed, 9 Jun 2004 16:24:13 +0900 (JST)	[thread overview]
Message-ID: <200406090724.QAA11057@etlken.m17n.org> (raw)
In-Reply-To: <rzqd64agch6.fsf@albion.dl.ac.uk> (message from Dave Love on Tue,  08 Jun 2004 18:56:37 +0100)

In article <rzqd64agch6.fsf@albion.dl.ac.uk>, Dave Love <d.love@dl.ac.uk> writes:

> Kenichi Handa <handa@m17n.org> writes:
> > I think I succeeded in loading subst-*.el not at the time of
> > customizing utf-translate-cjk-mode to t but only when it is
> > found that loading them is necessary on decoding or encoding
> > utf-8, or on running decode/encode-char.  This means that we
> > can make the default value of utf-translate-cjk-mode to t
> > without loading subst-*.el at building time.

> It doesn't fix the potential effects on non-CJK users if decoding a
> bit of Unicode text containing such a character will load the large
> tables even if they're useless to the user.  Maybe there aren't many
> people now with 48MB P133s or old SPARCs like me, in which case it's a
> reasonable default, but I suggest an entry in NEWS/PROBLEMS about it.

I'm going to modify the current entry in NEWS as below.

** The utf-8/16 coding systems have been enhanced.
By default, untranslatable utf-8 sequences are simply composed into
single quasi-characters.  User option `utf-translate-cjk-mode' (it is
turned on by default) arranges to translate many utf-8 CJK character
sequences into real Emacs characters in a similar way to the Mule-UCS
system.  As this loads a fairly big data on demand, people who are not
interested in CJK characters may want to customize it to nil.  You can
augment/amend the CJK translation via hash tables
`ucs-mule-cjk-to-unicode' and `ucs-unicode-to-mule-cjk'.  The utf-8
coding systems now also encodes characters from most of Emacs's
one-dimensional internal charsets, specifically the ISO-8859 ones.
The utf-16 coding system is affected similarly.

> > I think it's a big improvement especially for CJK users,

> I agree it should be on for CJK users anyway.  (I thought it was now
> conditional on the language environment.)

It's not.  I think we had better avoid turning on/off a user
option depending on language environment.

---
Ken'ichi HANDA
handa@m17n.org

  reply	other threads:[~2004-06-09  7:24 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <1077557604.1632.26.camel@duende>
     [not found] ` <jwv3c90bgi7.fsf-monnier+emacs/pretest@asado.iro.umontreal.ca>
     [not found]   ` <1077643915.12919.2.camel@duende>
     [not found]     ` <jwvwu6c6yca.fsf-monnier+emacs/pretest@asado.iro.umontreal.ca>
     [not found]       ` <1077682436.28482.9.camel@duende>
     [not found]         ` <buoeksjyee3.fsf@mcspd15.ucom.lsi.nec.co.jp>
     [not found]           ` <200403010815.RAA14365@etlken.m17n.org>
2004-03-18 15:34             ` Possible UTF-8 CJK Regressions in Terminal Emulators Dave Love
2004-04-07 12:30               ` Kenichi Handa
2004-04-08 11:27                 ` Dave Love
2004-04-09 11:28                   ` Kenichi Handa
2004-06-07 12:27                     ` Kenichi Handa
2004-06-07 12:36                       ` Miles Bader
2004-06-07 13:00                         ` Kenichi Handa
2004-06-08 18:02                           ` Dave Love
2004-06-09  7:37                             ` Kenichi Handa
2004-06-09  9:38                               ` Stefan Monnier
2004-06-10  0:20                                 ` Kenichi Handa
2004-06-08 17:56                       ` Dave Love
2004-06-09  7:24                         ` Kenichi Handa [this message]
2004-06-12  2:41                       ` Kenichi Handa
2004-06-12 13:46                         ` Juanma Barranquero
2004-06-13  8:42                           ` Kenichi Handa
2004-06-13 11:36                             ` Juanma Barranquero
2004-06-13 13:18                             ` Andreas Schwab
2004-06-14  1:05                               ` Kenichi Handa
2004-06-13 20:39                             ` Luc Teirlinck

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=200406090724.QAA11057@etlken.m17n.org \
    --to=handa@m17n.org \
    --cc=alexander.winston@comcast.net \
    --cc=danilo@gnome.org \
    --cc=emacs-devel@gnu.org \
    --cc=mariano@gnome.org \
    --cc=miles@gnu.org \
    --cc=monnier@iro.umontreal.ca \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).