unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
* describe-char and unicode data
@ 2003-05-09 18:31 James H. Cloos Jr.
  2003-05-10 10:06 ` Eli Zaretskii
                   ` (2 more replies)
  0 siblings, 3 replies; 22+ messages in thread
From: James H. Cloos Jr. @ 2003-05-09 18:31 UTC (permalink / raw)


Describe-char shows the unicode hex value of the character in question
if it exists (some chars do not translate to unicode).

Would a patch that expands that to also show the relevant data from
UnicodeData.txt be accepted?

Step one would be code to convert UnicodeData.txt to a suitable elisp
structure, generating a unicodedata.el file.  Given that, the
additional logic in describe-char is trivial.

To give an idea of the amount of data available, UnicodeData.txt is a
semicolon-separated text db with 15 fields per record, and currently
has 15100 records, so loading this may be an issue.  The related
Unihan.txt has up to 78 possible entries for each of 71098 characters.

The name entry from UnicodeData.txt and probably the kDefinition
entries from Unihan.txt would be the useful additions for
describe-char.  The rest of the data may however be useful elsewhere.

What is therefore the best structure to use for this data?

-JimC

^ permalink raw reply	[flat|nested] 22+ messages in thread

end of thread, other threads:[~2003-05-23 12:04 UTC | newest]

Thread overview: 22+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-05-09 18:31 describe-char and unicode data James H. Cloos Jr.
2003-05-10 10:06 ` Eli Zaretskii
2003-05-10 16:23   ` James H. Cloos Jr.
2003-05-10 16:23 ` Florian Weimer
2003-05-10 16:39   ` James H. Cloos Jr.
2003-05-11 12:56     ` Richard Stallman
2003-05-11 13:04       ` Florian Weimer
2003-05-10 18:52   ` Simon Josefsson
2003-05-11 13:05     ` Florian Weimer
2003-05-11 14:34       ` Simon Josefsson
2003-05-12  7:38     ` Richard Stallman
2003-05-12 11:24       ` Simon Josefsson
2003-05-13  6:07       ` Simon Josefsson
2003-05-15  4:54         ` Richard Stallman
2003-05-11 12:55 ` Richard Stallman
2003-05-11 17:24   ` Stephen J. Turnbull
2003-05-12 11:22   ` Kenichi Handa
2003-05-14 13:49     ` Richard Stallman
2003-05-21 21:52   ` James H. Cloos Jr.
2003-05-22 15:29     ` Kevin Rodgers
2003-05-22 19:25       ` James H. Cloos Jr.
2003-05-23 12:04     ` Richard Stallman

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).