From: Kevin Rodgers <ihs_4664@yahoo.com>
Cc: Shaddy.Baddah@msa.hinet.net
Subject: Re: More unicode blocks?
Date: Wed, 28 Sep 2005 12:22:35 -0600 [thread overview]
Message-ID: <433ADF6B.1070109@yahoo.com> (raw)
In-Reply-To: <43359AB0.1050300@msa.hinet.net>
Shaddy Baddah wrote:
> Today, I finally did what I had resolved to do some time ago. I delved
> into emacs's unicode support facilities.
>
> I am a little disappointed, because it has become apparent that the
> unicode character set support is limited to 3 specific blocks of the
> full unicode character set, those being the blocks that start and end at
> the indexes expressed in mule-unicode-0100-24ff, mule-unicode-2500-33ff
> and mule-unicode-e000-ffff.
>
> The blocks that I am interested in are the CJK Unified Ideographs blocks
> , that start at unicode index 0x4E00. Specifically, the characters that
> are shared by the character set encoded via the big5 encoding scheme.
Perhaps you should try Emacs 22 (aka CVS Emacs). Here are some items
from its etc/NEWS file:
---
*** The utf-8/16 coding systems have been enhanced.
By default, untranslatable utf-8 sequences are simply composed into
single quasi-characters. User option `utf-translate-cjk-mode' (it is
turned on by default) arranges to translate many utf-8 CJK character
sequences into real Emacs characters in a similar way to the Mule-UCS
system. As this loads a fairly big data on demand, people who are not
interested in CJK characters may want to customize it to nil.
You can augment/amend the CJK translation via hash tables
`ucs-mule-cjk-to-unicode' and `ucs-unicode-to-mule-cjk'. The utf-8
coding system now also encodes characters from most of Emacs's
one-dimensional internal charsets, specifically the ISO-8859 ones.
The utf-16 coding system is affected similarly.
---
*** A new coding system `euc-tw' has been added for traditional Chinese
in CNS encoding; it accepts both Big 5 and CNS as input; on saving,
Big 5 is then converted to CNS.
---
*** New variable `utf-translate-cjk-unicode-range' controls which
Unicode characters to translate in `utf-translate-cjk-mode'.
---
*** iso-10646-1 (`Unicode') fonts can be used to display any range of
characters encodable by the utf-8 coding system. Just specify the
fontset appropriately.
> I have no problems displaying and editing these characters under the
> big5 coding scheme, so they are obviously well supported by emacs (and
> it's internal coding scheme, right?).
>
> So, what is the impediment, or perhaps rationale, behind the lack of
> support for the additional unicode blocks at this stage of Emacs
> development?
>
> Is it simply to do with someone having to implement some type of
> character translation tables, or is there/how much more is there to it?
Sorry, I don't know the answers to those questions.
--
Kevin Rodgers
prev parent reply other threads:[~2005-09-28 18:22 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-09-24 18:28 More unicode blocks? Shaddy Baddah
2005-09-28 14:59 ` Shaddy Baddah
[not found] ` <mailman.8992.1127919972.20277.help-gnu-emacs@gnu.org>
2005-09-28 15:31 ` Charles philip Chan
2005-09-28 15:53 ` Thien-Thi Nguyen
2005-09-28 18:22 ` Kevin Rodgers [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.gnu.org/software/emacs/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=433ADF6B.1070109@yahoo.com \
--to=ihs_4664@yahoo.com \
--cc=Shaddy.Baddah@msa.hinet.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).