all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Simon Josefsson <jas@extundo.com>
Subject: Re: More Cyrillic vs UTF-8
Date: Sat, 26 Apr 2003 23:47:21 +0200	[thread overview]
Message-ID: <ilu7k9hrkwm.fsf@latte.josefsson.org> (raw)
In-Reply-To: <841xzphrr4.fsf@lucy.is.informatik.uni-duisburg.de> (Kai Großjohann's message of "Sat, 26 Apr 2003 23:29:35 +0200")

kai.grossjohann@gmx.net (Kai Großjohann) writes:

> Simon Josefsson <jas@extundo.com> writes:
>
>> kai.grossjohann@gmx.net (Kai Großjohann) writes:
>>
>>> Simon Josefsson <jas@extundo.com> writes:
>>>
>>>> Richard Stallman <rms@gnu.org> writes:
>>>>
>>>>> Mentioning this in PROBLEMS seems like a good idea to me, but a useful
>>>>> entry needs to be stated in terms of what behavior the user sees.
>>>>> This text doesn't explain the practical consequences; a user would say
>>>>> "so what does that mean for me?"
>>>>
>>>> Is this better?
>>>
>>> Can you say what characters you're talking about, instead of just the
>>> code points?  I guess that most people haven't memorized the Unicode
>>> table (your truly included ;-).
>>
>> I agree, but I don't know which they are, and maybe the range includes
>> very many different kind of characters.  And as new characters are
>> added all the time, I fear that both the list of supported characters
>> and the list of unsupported characters would be too long to be useful.
>> Hm.
>
> Well, isn't Unicode divided into blocks so that one can list the
> blocks?  Hm.  Oh!  See http://www.unicode.org/charts/ -- looks quite
> promising.  Searching for the code blocks there and then giving the
> names ought to be useful.  WDYT?

The compiled list is below.  Does it really help anyone to list all of
them?

Supported:

Basic Latin  	Optical Character Recognition
Latin-1 Supplement 	Enclosed Alphanumerics
Latin Extended-A 	Box Drawing
Latin Extended-B 	Block Elements
IPA Extensions 	Geometric Shapes
Spacing Modifier Letters 	Miscellaneous Symbols
Combining Diacritical Marks 	Dingbats
Greek 	Miscellaneous Mathematical Symbols-A
Cyrillic 	Supplemental Arrows-A
Cyrillic Supplement 	Braille Patterns
Armenian 	Supplemental Arrows-B
Hebrew 	Miscellaneous Mathematical Symbols-B
Arabic 	Supplemental Mathematical Operators
Syriac 	CJK Radicals Supplement
Thaana 	Kangxi Radicals
Devanagari 	Ideographic Description Characters
Bengali 	CJK Symbols and Punctuation
Gurmukhi 	Hiragana
Gujarati 	Katakana
Oriya 	Bopomofo
Tamil 	Hangul Compatibility Jamo
Telugu 	Kanbun
Kannada 	Bopomofo Extended
Malayalam 	Enclosed CJK Letters and Months
Sinhala 	CJK Compatibility
Thai 	
Lao 	
Tibetan 	
Myanmar 	
Georgian 	
Hangul Jamo 	
Ethiopic 	
Cherokee 	Private Use Area
Unified Canadian Aboriginal Syllabic 	CJK Compatibility Ideographs
Ogham 	Alphabetic Presentation Forms
Runic 	Arabic Presentation Forms-A
Tagalog 	Variation Selectors
Hanunoo 	Combining Half Marks
Buhid 	CJK Compatibility Forms
Tagbanwa 	Small Form Variants
Khmer 	Arabic Presentation Forms-B
Mongolian 	Halfwidth and Fullwidth Forms
Latin Extended Additional 	Specials
Greek Extended 	
General Punctuation 	
Superscripts and Subscripts 	
Currency Symbols 	
Combining Marks for Symbols 	
Letterlike Symbols 	
Number Forms 	
Arrows 	
Mathematical Operators 	
Miscellaneous Technical 	
Control Pictures 	

Unsupported:

CJK Unified Ideographs Extension A (1.5MB)
CJK Unified Ideographs (5MB)
Yi Syllables
Yi Radicals
Hangul Syllables (7MB)
High Surrogates
Low Surrogates
Old Italic
Gothic
Deseret
Byzantine Musical Symbols
Musical Symbols
Mathematical Alphanumeric Symbols
CJK Unified Ideographs Extension B (13MB)
CJK Compatibility Ideographs Supplement
Tags
Supplementary Private Use Area-A
Supplementary Private Use Area-B

  reply	other threads:[~2003-04-26 21:47 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2003-04-25 16:35 More Cyrillic vs UTF-8 Simon Josefsson
2003-04-25 22:42 ` Eli Zaretskii
2003-04-26  0:26   ` Simon Josefsson
2003-04-26 13:45     ` Richard Stallman
2003-04-26 14:15       ` Simon Josefsson
2003-04-26 20:19         ` Kai Großjohann
2003-04-26 21:16           ` Simon Josefsson
2003-04-26 21:29             ` Kai Großjohann
2003-04-26 21:47               ` Simon Josefsson [this message]
2003-04-27  8:37                 ` Kai Großjohann
2003-04-28 12:35                   ` Kenichi Handa
2003-04-28 23:08                     ` Simon Josefsson
2003-04-29 16:51                       ` Kai Großjohann
2003-04-29 20:00                         ` Robert J. Chassell
2003-04-29  5:39                     ` Richard Stallman
2003-04-29 13:36                       ` Simon Josefsson
     [not found]                     ` <87llxusaj9.fsf@gnu.org>
2003-05-01 11:27                       ` Kenichi Handa
2003-04-28 23:38                   ` Richard Stallman
2003-04-29 16:17                     ` Benjamin Riefenstahl
2003-04-30  5:43                       ` Richard Stallman
2003-04-30  8:01                         ` Kai Großjohann
2003-04-28  4:37                 ` Richard Stallman
2003-04-28  4:37         ` Richard Stallman
2003-04-26  7:52 ` Kenichi Handa
2003-04-26 11:54   ` Simon Josefsson

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ilu7k9hrkwm.fsf@latte.josefsson.org \
    --to=jas@extundo.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.