unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
From: "Stephen J. Turnbull" <stephen@xemacs.org>
To: Eli Zaretskii <eliz@gnu.org>
Cc: tomas@tuxteam.de, rms@gnu.org, emacs-devel@gnu.org
Subject: Re: Input method or help feature needed
Date: Mon, 21 Feb 2011 11:53:20 +0900	[thread overview]
Message-ID: <87k4gunlnj.fsf@uwakimon.sk.tsukuba.ac.jp> (raw)
In-Reply-To: <83hbbytmvl.fsf@gnu.org>

Eli Zaretskii writes:

 > > (Excluding Korean and Han characters and whatever else ought to be
 > > excluded).
 > 
 > Why exclude them?

Because there are 11000 of the former and 21000 (and counting) of the
latter.  The Korean Hangul are precomposed in an algorithmic fashion
from about 70 components called "jamo".  It makes very little sense to
just have many pages when you can look up the jamo in smaller lists,
and drill down to exactly the Hangul you want.  Just as it should be
possible to type "i" and get a page of all characters related to "i"
including the Turkish dotless "i" and Greek iota, etc.

Similarly, the Han characters are organized by radical and stroke
count, and it should be possible to look at the (relatively) short
list of 214 radicals, then drill down to an approximate stroke count,
and then page up and down the stroke count.  There are non-radical
components as well, many of which even total Han illiterates would be
likely to recognize.  I don't know if these are listed in the Unicode
tables, but if so they could be combined with the radical and
(optionally) approximate stroke count to drastically prune the search
tree in 90% or more of practical cases.

However a simple list of Hangul or Hanzi would be rather painful to
use, not to mention that if you don't know how to say it (every Hangul
has an algorithmically constructed pronunciation), you're probably not
fluent enough in the language to easily pick the right character out
of an array of say 400 (20 x 20 seems like a reasonable size for a
"page" of characters).  The real differences are often subtle, not to
mention that many characters have several variant glyphs, and these
variations tend to confuse the non-native speaker.

A pure list in Unicode order for these characters is better than
*nothing*, true, but it's not really an acceptable answer to Richard's
requirement.




  reply	other threads:[~2011-02-21  2:53 UTC|newest]

Thread overview: 69+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-02-17 19:14 Input method or help feature needed Richard Stallman
2011-02-17 19:27 ` Eli Zaretskii
2011-02-17 19:52   ` Stephen Berman
2011-02-17 20:24     ` Harald Hanche-Olsen
2011-02-18 10:53       ` Eli Zaretskii
2011-02-18 15:46         ` Eli Zaretskii
2011-02-18 20:00           ` Ted Zlatanov
2011-02-17 22:05     ` Stefan Monnier
2011-02-18 21:24     ` Richard Stallman
2011-02-19  7:49       ` Eli Zaretskii
2011-02-19  8:01         ` David Kastrup
2011-02-19  8:37           ` Miles Bader
2011-02-20  0:30           ` Richard Stallman
2011-02-20  0:29         ` Richard Stallman
2011-02-20  3:59           ` Eli Zaretskii
2011-02-20 21:01             ` Richard Stallman
2011-02-18 21:24   ` Richard Stallman
2011-02-17 19:31 ` Justin Lilly
2011-02-17 19:41 ` Tassilo Horn
2011-02-18 21:24   ` Richard Stallman
2011-02-19  7:30     ` Eli Zaretskii
2011-02-19  8:18       ` Stephen J. Turnbull
2011-02-19  8:33         ` Miles Bader
2011-03-04  9:10     ` Kevin Rodgers
2011-02-17 20:26 ` Paul Eggert
2011-02-17 22:50 ` Andreas Schwab
2011-02-18  0:09   ` Miles Bader
2011-02-18  5:13     ` Werner LEMBERG
2011-02-18  8:37     ` tomas
2011-02-18  8:41       ` Miles Bader
2011-02-18 11:27         ` Kenichi Handa
2011-02-20  8:27         ` tomas
2011-02-20 10:41           ` Eli Zaretskii
2011-02-20 11:16             ` David Kastrup
2011-02-20 21:01             ` Richard Stallman
2011-02-20 21:30               ` Eli Zaretskii
2011-02-21  2:53                 ` Stephen J. Turnbull [this message]
2011-02-21 22:35                 ` Richard Stallman
2011-02-21  0:59               ` Kenichi Handa
2011-02-21  7:02                 ` Eli Zaretskii
2011-02-21  7:47                   ` Kenichi Handa
2011-02-21  8:25                     ` Miles Bader
2011-02-21  8:29                     ` Eli Zaretskii
2011-02-21 11:14                       ` Kenichi Handa
2011-02-21 12:25                         ` Eli Zaretskii
2011-02-22  0:55                           ` Kenichi Handa
2011-02-22  1:23                             ` Miles Bader
2011-02-21 22:36                         ` Richard Stallman
2011-02-21 22:36                 ` Richard Stallman
2011-02-18  8:43       ` David Kastrup
2011-02-20  8:30         ` tomas
2011-02-20 10:45           ` Eli Zaretskii
2011-02-18 21:25   ` Richard Stallman
2011-02-19  7:52     ` Eli Zaretskii
2011-02-20  0:29       ` Richard Stallman
2011-02-20  7:43         ` James Cloos
2011-02-20 21:01           ` Richard Stallman
2011-02-20 22:45             ` Harald Hanche-Olsen
2011-02-21 22:35               ` Richard Stallman
2011-02-21 22:35               ` Richard Stallman
2011-02-22  6:05                 ` Harald Hanche-Olsen
2011-02-22 20:25                   ` Richard Stallman
2011-02-19 19:26     ` James Cloos
2011-02-20 21:00       ` Richard Stallman
2011-02-20 21:33         ` Drew Adams
2011-02-21 22:36           ` Richard Stallman
2011-02-21 22:51             ` Drew Adams
2011-02-22 20:25               ` Richard Stallman
  -- strict thread matches above, loose matches on Subject: below --
2011-02-18 10:39 Андрей Парамонов

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87k4gunlnj.fsf@uwakimon.sk.tsukuba.ac.jp \
    --to=stephen@xemacs.org \
    --cc=eliz@gnu.org \
    --cc=emacs-devel@gnu.org \
    --cc=rms@gnu.org \
    --cc=tomas@tuxteam.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).