unofficial mirror of help-gnu-emacs@gnu.org
 help / color / mirror / Atom feed
From: Kevin Rodgers <ihs_4664@yahoo.com>
Cc: Shaddy.Baddah@msa.hinet.net
Subject: Re: More unicode blocks?
Date: Wed, 28 Sep 2005 12:22:35 -0600	[thread overview]
Message-ID: <433ADF6B.1070109@yahoo.com> (raw)
In-Reply-To: <43359AB0.1050300@msa.hinet.net>

Shaddy Baddah wrote:
 > Today, I finally did what I had resolved to do some time ago. I delved
 > into emacs's unicode support facilities.
 >
 > I am a little disappointed, because it has become apparent that the
 > unicode character set support is limited to 3 specific blocks of the
 > full unicode character set, those being the blocks that start and end at
 > the indexes expressed in mule-unicode-0100-24ff, mule-unicode-2500-33ff
 > and mule-unicode-e000-ffff.
 >
 > The blocks that I am interested in are the CJK Unified Ideographs blocks
 > , that start at unicode index 0x4E00. Specifically, the characters that
 > are shared by the character set encoded via the big5 encoding scheme.

Perhaps you should try Emacs 22 (aka CVS Emacs).  Here are some items
from its etc/NEWS file:

---
*** The utf-8/16 coding systems have been enhanced.
By default, untranslatable utf-8 sequences are simply composed into
single quasi-characters.  User option `utf-translate-cjk-mode' (it is
turned on by default) arranges to translate many utf-8 CJK character
sequences into real Emacs characters in a similar way to the Mule-UCS
system.  As this loads a fairly big data on demand, people who are not
interested in CJK characters may want to customize it to nil.
You can augment/amend the CJK translation via hash tables
`ucs-mule-cjk-to-unicode' and `ucs-unicode-to-mule-cjk'.  The utf-8
coding system now also encodes characters from most of Emacs's
one-dimensional internal charsets, specifically the ISO-8859 ones.
The utf-16 coding system is affected similarly.

---
*** A new coding system `euc-tw' has been added for traditional Chinese
in CNS encoding; it accepts both Big 5 and CNS as input; on saving,
Big 5 is then converted to CNS.

---
*** New variable `utf-translate-cjk-unicode-range' controls which
Unicode characters to translate in `utf-translate-cjk-mode'.

---
*** iso-10646-1 (`Unicode') fonts can be used to display any range of
characters encodable by the utf-8 coding system.  Just specify the
fontset appropriately.

 > I have no problems displaying and editing these characters under the
 > big5 coding scheme, so they are obviously well supported by emacs (and
 > it's internal coding scheme, right?).
 >
 > So, what is the impediment, or perhaps rationale, behind the lack of
 > support for the additional unicode blocks at this stage of Emacs
 > development?
 >
 > Is it simply to do with someone having to implement some type of
 > character translation tables, or is there/how much more is there to it?

Sorry, I don't know the answers to those questions.

-- 
Kevin Rodgers

      parent reply	other threads:[~2005-09-28 18:22 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-09-24 18:28 More unicode blocks? Shaddy Baddah
2005-09-28 14:59 ` Shaddy Baddah
     [not found] ` <mailman.8992.1127919972.20277.help-gnu-emacs@gnu.org>
2005-09-28 15:31   ` Charles philip Chan
2005-09-28 15:53   ` Thien-Thi Nguyen
2005-09-28 18:22 ` Kevin Rodgers [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=433ADF6B.1070109@yahoo.com \
    --to=ihs_4664@yahoo.com \
    --cc=Shaddy.Baddah@msa.hinet.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).