* bug#23698: 24.5; broken character name
@ 2016-06-05 13:06 ynyaaa
2016-06-05 15:44 ` Eli Zaretskii
2016-06-06 17:49 ` Paul Eggert
0 siblings, 2 replies; 7+ messages in thread
From: ynyaaa @ 2016-06-05 13:06 UTC (permalink / raw)
To: 23698
Just after starting emacs -Q, this form returns correct value.
(get-char-code-property #xE01 'name)
=>"THAI CHARACTER KO KAI"
Then display THAI characters by typing this.
M-x list-charset-chars RET thai-iso8859-11 RET
After THAI characters are displayed, the form returns wrong value.
(get-char-code-property #xE01 'name)
=>"LETTER KO KAI"
For other THAI characters, get-char-code-property returns wrong names.
UCS correct name wrong name
E01 THAI CHARACTER KO KAI LETTER KO KAI
E02 THAI CHARACTER KHO KHAI LETTER KHO KHAI
E03 THAI CHARACTER KHO KHUAT LETTER KHO KHUAT
E04 THAI CHARACTER KHO KHWAI LETTER KHO KHWAI
E05 THAI CHARACTER KHO KHON LETTER KHO KHON
E06 THAI CHARACTER KHO RAKHANG LETTER KHO RAKHANG
E07 THAI CHARACTER NGO NGU LETTER NGO NGU
E08 THAI CHARACTER CHO CHAN LETTER CHO CHAN
E09 THAI CHARACTER CHO CHING LETTER CHO CHING
E0A THAI CHARACTER CHO CHANG LETTER CHO CHANG
E0B THAI CHARACTER SO SO LETTER SO SO
E0C THAI CHARACTER CHO CHOE LETTER CHO CHOE
E0D THAI CHARACTER YO YING LETTER YO YING
E0E THAI CHARACTER DO CHADA LETTER DO CHADA
E0F THAI CHARACTER TO PATAK LETTER TO PATAK
E10 THAI CHARACTER THO THAN LETTER THO THAN
E11 THAI CHARACTER THO NANGMONTHO LETTER THO NANGMONTHO
E12 THAI CHARACTER THO PHUTHAO LETTER THO PHUTHAO
E13 THAI CHARACTER NO NEN LETTER NO NEN
E14 THAI CHARACTER DO DEK LETTER DO DEK
E15 THAI CHARACTER TO TAO LETTER TO TAO
E16 THAI CHARACTER THO THUNG LETTER THO THUNG
E17 THAI CHARACTER THO THAHAN LETTER THO THAHAN
E18 THAI CHARACTER THO THONG LETTER THO THONG
E19 THAI CHARACTER NO NU LETTER NO NU
E1A THAI CHARACTER BO BAIMAI LETTER BO BAIMAI
E1B THAI CHARACTER PO PLA LETTER PO PLA
E1C THAI CHARACTER PHO PHUNG LETTER PHO PHUNG
E1D THAI CHARACTER FO FA LETTER FO FA
E1E THAI CHARACTER PHO PHAN LETTER PHO PHAN
E1F THAI CHARACTER FO FAN LETTER FO FAN
E20 THAI CHARACTER PHO SAMPHAO LETTER PHO SAMPHAO
E21 THAI CHARACTER MO MA LETTER MO MA
E22 THAI CHARACTER YO YAK LETTER YO YAK
E23 THAI CHARACTER RO RUA LETTER RO RUA
E24 THAI CHARACTER RU LETTER RU (Pali vowel letter)
E25 THAI CHARACTER LO LING LETTER LO LING
E26 THAI CHARACTER LU LETTER LU (Pali vowel letter)
E27 THAI CHARACTER WO WAEN LETTER WO WAEN
E28 THAI CHARACTER SO SALA LETTER SO SALA
E29 THAI CHARACTER SO RUSI LETTER SO RUSI
E2A THAI CHARACTER SO SUA LETTER SO SUA
E2B THAI CHARACTER HO HIP LETTER HO HIP
E2C THAI CHARACTER LO CHULA LETTER LO CHULA
E2D THAI CHARACTER O ANG LETTER O ANG
E2E THAI CHARACTER HO NOKHUK LETTER HO NOK HUK
E2F THAI CHARACTER PAIYANNOI PAI YAN NOI (abbreviation)
E30 THAI CHARACTER SARA A VOWEL SIGN SARA A
E31 THAI CHARACTER MAI HAN-AKAT VOWEL SIGN MAI HAN-AKAT N/S-T
E32 THAI CHARACTER SARA AA VOWEL SIGN SARA AA
E33 THAI CHARACTER SARA AM VOWEL SIGN SARA AM
E34 THAI CHARACTER SARA I VOWEL SIGN SARA I N/S-T
E35 THAI CHARACTER SARA II VOWEL SIGN SARA II N/S-T
E36 THAI CHARACTER SARA UE VOWEL SIGN SARA UE N/S-T
E37 THAI CHARACTER SARA UEE VOWEL SIGN SARA UEE N/S-T
E38 THAI CHARACTER SARA U VOWEL SIGN SARA U N/S-B
E39 THAI CHARACTER SARA UU VOWEL SIGN SARA UU N/S-B
E3A THAI CHARACTER PHINTHU VOWEL SIGN PHINTHU N/S-B (Pali virama)
E3F THAI CURRENCY SYMBOL BAHT BAHT SIGN (currency symbol)
E40 THAI CHARACTER SARA E VOWEL SIGN SARA E
E41 THAI CHARACTER SARA AE VOWEL SIGN SARA AE
E42 THAI CHARACTER SARA O VOWEL SIGN SARA O
E43 THAI CHARACTER SARA AI MAIMUAN VOWEL SIGN SARA MAI MUAN
E44 THAI CHARACTER SARA AI MAIMALAI VOWEL SIGN SARA MAI MALAI
E45 THAI CHARACTER LAKKHANGYAO LAK KHANG YAO
E46 THAI CHARACTER MAIYAMOK MAI YAMOK (repetition)
E47 THAI CHARACTER MAITAIKHU VOWEL SIGN MAI TAI KHU N/S-T
E48 THAI CHARACTER MAI EK TONE MAI EK N/S-T
E49 THAI CHARACTER MAI THO TONE MAI THO N/S-T
E4A THAI CHARACTER MAI TRI TONE MAI TRI N/S-T
E4B THAI CHARACTER MAI CHATTAWA TONE MAI CHATTAWA N/S-T
E4C THAI CHARACTER THANTHAKHAT THANTHAKHAT N/S-T (cancellation mark)
E4D THAI CHARACTER NIKHAHIT NIKKHAHIT N/S-T (final nasal)
E4E THAI CHARACTER YAMAKKAN YAMAKKAN N/S-T
E4F THAI CHARACTER FONGMAN FONRMAN
E50 THAI DIGIT ZERO DIGIT ZERO
E51 THAI DIGIT ONE DIGIT ONE
E52 THAI DIGIT TWO DIGIT TWO
E53 THAI DIGIT THREE DIGIT THREE
E54 THAI DIGIT FOUR DIGIT FOUR
E55 THAI DIGIT FIVE DIGIT FIVE
E56 THAI DIGIT SIX DIGIT SIX
E57 THAI DIGIT SEVEN DIGIT SEVEN
E58 THAI DIGIT EIGHT DIGIT EIGHT
E59 THAI DIGIT NINE DIGIT NINE
E5A THAI CHARACTER ANGKHANKHU ANGKHANKHU (ellipsis)
E5B THAI CHARACTER KHOMUT KHOMUT (beginning of religious texts)
In GNU Emacs 24.5.1 (i686-pc-mingw32)
of 2015-04-11 on LEG570
Windowing system distributor `Microsoft Corp.', version 6.0.6002
Configured using:
`configure --prefix=/c/usr --host=i686-pc-mingw32'
Important settings:
value of $LANG: JPN
locale-coding-system: cp932
Major mode: Lisp Interaction
Minor modes in effect:
tooltip-mode: t
electric-indent-mode: t
mouse-wheel-mode: t
tool-bar-mode: t
menu-bar-mode: t
file-name-shadow-mode: t
global-font-lock-mode: t
font-lock-mode: t
blink-cursor-mode: t
auto-composition-mode: t
auto-encryption-mode: t
auto-compression-mode: t
line-number-mode: t
transient-mark-mode: t
Recent messages:
Load-path shadows:
None found.
Features:
(shadow sort gnus-util mail-extr emacsbug message format-spec rfc822 mml
mml-sec mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev
gmm-utils mailheader sendmail rfc2047 rfc2045 ietf-drums mm-util
mail-prsvr mail-utils thai-util thai-word mule-util info mule-diag
help-mode easymenu advice help-fns time-date japan-util tooltip electric
uniquify ediff-hook vc-hooks lisp-float-type mwheel dos-w32 ls-lisp
w32-common-fns disp-table w32-win w32-vars tool-bar dnd fontset image
regexp-opt fringe tabulated-list newcomment lisp-mode prog-mode register
page menu-bar rfn-eshadow timer select scroll-bar mouse jit-lock
font-lock syntax facemenu font-core frame cham georgian utf-8-lang
misc-lang vietnamese tibetan thai tai-viet lao korean japanese hebrew
greek romanian slovak czech european ethiopic indian cyrillic chinese
case-table epa-hook jka-cmpr-hook help simple abbrev minibuffer nadvice
loaddefs button faces cus-face macroexp files text-properties overlay
sha1 md5 base64 format env code-pages mule custom widget
hashtable-print-readable backquote make-network-process w32notify w32
multi-tty emacs)
Memory information:
((conses 8 164471 6457)
(symbols 32 27891 0)
(miscs 32 55 271)
(strings 16 24013 5282)
(string-bytes 1 514589)
(vectors 8 11535)
(vector-slots 4 515427 5354)
(floats 8 65 359)
(intervals 28 317 19)
(buffers 508 16))
^ permalink raw reply [flat|nested] 7+ messages in thread
* bug#23698: 24.5; broken character name
2016-06-05 13:06 bug#23698: 24.5; broken character name ynyaaa
@ 2016-06-05 15:44 ` Eli Zaretskii
2016-06-05 16:44 ` Eli Zaretskii
2016-06-09 14:32 ` handa
2016-06-06 17:49 ` Paul Eggert
1 sibling, 2 replies; 7+ messages in thread
From: Eli Zaretskii @ 2016-06-05 15:44 UTC (permalink / raw)
To: ynyaaa, Kenichi Handa; +Cc: 23698
> From: ynyaaa@gmail.com
> Date: Sun, 05 Jun 2016 22:06:23 +0900
>
>
> Just after starting emacs -Q, this form returns correct value.
>
> (get-char-code-property #xE01 'name)
> =>"THAI CHARACTER KO KAI"
>
> Then display THAI characters by typing this.
> M-x list-charset-chars RET thai-iso8859-11 RET
>
> After THAI characters are displayed, the form returns wrong value.
>
> (get-char-code-property #xE01 'name)
> =>"LETTER KO KAI"
Displaying the list of Thai characters loads thai-util.el, which
deliberately overwrites the names derived from the Unicode Character
Database with its own variants. I'm CC'ing Handa-san, who added that
code back in 2008, in the hope that he could tell why are we doing
that, and whether this is still needed nowadays.
Thanks.
^ permalink raw reply [flat|nested] 7+ messages in thread
* bug#23698: 24.5; broken character name
2016-06-05 15:44 ` Eli Zaretskii
@ 2016-06-05 16:44 ` Eli Zaretskii
2016-06-09 14:32 ` handa
1 sibling, 0 replies; 7+ messages in thread
From: Eli Zaretskii @ 2016-06-05 16:44 UTC (permalink / raw)
To: Kenichi Handa; +Cc: 23698, ynyaaa
> Date: Sun, 05 Jun 2016 18:44:56 +0300
> From: Eli Zaretskii <eliz@gnu.org>
> Cc: 23698@debbugs.gnu.org
>
> Displaying the list of Thai characters loads thai-util.el, which
> deliberately overwrites the names derived from the Unicode Character
> Database with its own variants. I'm CC'ing Handa-san, who added that
> code back in 2008, in the hope that he could tell why are we doing
> that, and whether this is still needed nowadays.
Actually, this code is much older: we have it since 1997, i.e. before
we started using the UCD for these purposes. So I think we can either
remove it or use a property that doesn't clash with the Unicode
standard properties.
Btw, the same problem exists with Lao (see lao-util.el).
^ permalink raw reply [flat|nested] 7+ messages in thread
* bug#23698: 24.5; broken character name
2016-06-05 15:44 ` Eli Zaretskii
2016-06-05 16:44 ` Eli Zaretskii
@ 2016-06-09 14:32 ` handa
2016-06-09 14:37 ` Eli Zaretskii
1 sibling, 1 reply; 7+ messages in thread
From: handa @ 2016-06-09 14:32 UTC (permalink / raw)
To: Eli Zaretskii; +Cc: 23698, ynyaaa
In article <83y46jtzbb.fsf@gnu.org>, Eli Zaretskii <eliz@gnu.org> writes:
> Displaying the list of Thai characters loads thai-util.el, which
> deliberately overwrites the names derived from the Unicode Character
> Database with its own variants. I'm CC'ing Handa-san, who added that
> code back in 2008, in the hope that he could tell why are we doing
> that, and whether this is still needed nowadays.
Long ago, I discussed with Thai/Lao people about how to support their
languages in Mule. At that time, as I can't speak those languages, we
refer each character by name in the discussion. And, for debugging the
support code, name property was very useful. That's the reason for
those properties. Now, I think we can get rid them.
---
K. Handa
handa@gnu.org
^ permalink raw reply [flat|nested] 7+ messages in thread
* bug#23698: 24.5; broken character name
2016-06-09 14:32 ` handa
@ 2016-06-09 14:37 ` Eli Zaretskii
0 siblings, 0 replies; 7+ messages in thread
From: Eli Zaretskii @ 2016-06-09 14:37 UTC (permalink / raw)
To: handa; +Cc: ynyaaa, 23698-done
> From: handa <handa@gnu.org>
> Cc: ynyaaa@gmail.com, 23698@debbugs.gnu.org
> Date: Thu, 09 Jun 2016 23:32:21 +0900
>
> In article <83y46jtzbb.fsf@gnu.org>, Eli Zaretskii <eliz@gnu.org> writes:
>
> > Displaying the list of Thai characters loads thai-util.el, which
> > deliberately overwrites the names derived from the Unicode Character
> > Database with its own variants. I'm CC'ing Handa-san, who added that
> > code back in 2008, in the hope that he could tell why are we doing
> > that, and whether this is still needed nowadays.
>
> Long ago, I discussed with Thai/Lao people about how to support their
> languages in Mule. At that time, as I can't speak those languages, we
> refer each character by name in the discussion. And, for debugging the
> support code, name property was very useful. That's the reason for
> those properties. Now, I think we can get rid them.
OK, thanks. So I guess Paul's change did TRT, and we can close this
bug report.
^ permalink raw reply [flat|nested] 7+ messages in thread
* bug#23698: 24.5; broken character name
2016-06-05 13:06 bug#23698: 24.5; broken character name ynyaaa
2016-06-05 15:44 ` Eli Zaretskii
@ 2016-06-06 17:49 ` Paul Eggert
2016-06-06 19:00 ` Eli Zaretskii
1 sibling, 1 reply; 7+ messages in thread
From: Paul Eggert @ 2016-06-06 17:49 UTC (permalink / raw)
To: ynyaaa; +Cc: 23698-done
[-- Attachment #1: Type: text/plain, Size: 399 bytes --]
> we have it since 1997, i.e. before
> we started using the UCD for these purposes. So I think we can either
> remove it or use a property that doesn't clash with the Unicode
> standard properties.
It appears to be unnecessary these days, so I removed it in the attached
patch to master and am marking the bug as done. If I'm wrong and we need
it we can use a different property as you suggest.
[-- Attachment #2: 0001-Use-standard-Unicode-names-for-Thai-Lao.txt --]
[-- Type: text/plain, Size: 15518 bytes --]
From 130aa7166b940fef59e832c68e76eef87cdc96e5 Mon Sep 17 00:00:00 2001
From: Paul Eggert <eggert@cs.ucla.edu>
Date: Mon, 6 Jun 2016 10:43:10 -0700
Subject: [PATCH] Use standard Unicode names for Thai, Lao
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
* lisp/language/lao-util.el, lisp/language/thai-util.el:
Don’t override standard Unicode character name (Bug#23698).
---
lisp/language/lao-util.el | 191 ++++++++++++++++++++++-----------------------
lisp/language/thai-util.el | 191 ++++++++++++++++++++++-----------------------
2 files changed, 190 insertions(+), 192 deletions(-)
diff --git a/lisp/language/lao-util.el b/lisp/language/lao-util.el
index 6103860..a42904f 100644
--- a/lisp/language/lao-util.el
+++ b/lisp/language/lao-util.el
@@ -38,100 +38,100 @@ ?s
(define-category ?v "Lao upper/lower vowel" lao-category-table)
(define-category ?t "Lao tone" lao-category-table)
-(let ((l '((?ກ consonant "LETTER KOR KAI'" "CHICKEN")
- (?ຂ consonant "LETTER KHOR KHAI'" "EGG")
- (? invalid nil)
- (?ຄ consonant "LETTER QHOR QHWARGN" "BUFFALO")
- (? invalid nil)
- (? invalid nil)
- (?ງ consonant "LETTER NGOR NGUU" "SNAKE")
- (?ຈ consonant "LETTER JOR JUA" "BUDDHIST NOVICE")
- (?ຉ invalid nil)
- (?ຊ consonant "LETTER XOR X\"ARNG" "ELEPHANT")
- (? invalid nil)
- (?ຌ invalid nil)
- (?ຍ consonant "LETTER YOR YUNG" "MOSQUITO")
- (?ຎ invalid nil)
- (?ຎ invalid nil)
- (?ຎ invalid nil)
- (?ຎ invalid nil)
- (?ຎ invalid nil)
- (?ຎ invalid nil)
- (?ດ consonant "LETTER DOR DANG" "NOSE")
- (?ຕ consonant "LETTER TOR TAR" "EYE")
- (?ຖ consonant "LETTER THOR THUNG" "TO ASK,QUESTION")
- (?ທ consonant "LETTER DHOR DHARM" "FLAG")
- (?ຘ invalid nil)
- (?ນ consonant "LETTER NOR NOK" "BIRD")
- (?ບ consonant "LETTER BOR BED" "FISHHOOK")
- (?ປ consonant "LETTER POR PAR" "FISH")
- (?ຜ consonant "LETTER HPOR HPER\"" "BEE")
- (?ຝ consonant "LETTER FHOR FHAR" "WALL")
- (?ພ consonant "LETTER PHOR PHUU" "MOUNTAIN")
- (?ຟ consonant "LETTER FOR FAI" "FIRE")
- (?ຠ invalid nil)
- (?ມ consonant "LETTER MOR MAR\"" "HORSE")
- (?ຢ consonant "LETTER GNOR GNAR" "MEDICINE")
- (?ຣ consonant "LETTER ROR ROD" "CAR")
- (? invalid nil)
- (?ລ consonant "LETTER LOR LIING" "MONKEY")
- (? invalid nil)
- (?ວ consonant "LETTER WOR WII" "HAND FAN")
- (?ຨ invalid nil)
- (?ຩ invalid nil)
- (?ສ consonant "LETTER SOR SEA" "TIGER")
- (?ຫ consonant "LETTER HHOR HHAI" "JAR")
- (?ຬ invalid nil)
- (?ອ consonant "LETTER OR OOW" "TAKE")
- (?ຮ consonant "LETTER HOR HEA" "BOAT")
- (?ຯ special "ELLIPSIS")
- (?ະ vowel-base "VOWEL SIGN SARA A")
- (?ັ vowel-upper "VOWEL SIGN MAI KAN")
- (?າ vowel-base "VOWEL SIGN SARA AR")
- (?ຳ vowel-base "VOWEL SIGN SARA AM")
- (?ິ vowel-upper "VOWEL SIGN SARA I")
- (?ີ vowel-upper "VOWEL SIGN SARA II")
- (?ຶ vowel-upper "VOWEL SIGN SARA EU")
- (?ື vowel-upper "VOWEL SIGN SARA UR")
- (?ຸ vowel-lower "VOWEL SIGN SARA U")
- (?ູ vowel-lower "VOWEL SIGN SARA UU")
- (?຺ invalid nil)
- (?ົ vowel-upper "VOWEL SIGN MAI KONG")
- (?ຼ semivowel-lower "SEMIVOWEL SIGN LO")
- (?ຽ vowel-base "SEMIVOWEL SIGN SARA IA")
- (? invalid nil)
- (? invalid nil)
- (?ເ vowel-base "VOWEL SIGN SARA EE")
- (?ແ vowel-base "VOWEL SIGN SARA AA")
- (?ໂ vowel-base "VOWEL SIGN SARA OO")
- (?ໃ vowel-base "VOWEL SIGN SARA EI MAI MUAN\"")
- (?ໄ vowel-base "VOWEL SIGN SARA AI MAI MAY")
- (? invalid nil)
- (?ໆ special "KO LA (REPETITION)")
- (? invalid nil)
- (?່ tone "TONE MAI EK")
- (?້ tone "TONE MAI THO")
- (?໊ tone "TONE MAI TI")
- (?໋ tone "TONE MAI JADTAWAR")
- (?໌ tone "CANCELLATION MARK")
- (?ໍ vowel-upper "VOWEL SIGN SARA OR")
- (?໎ invalid nil)
- (? invalid nil)
- (?໐ special "DIGIT ZERO")
- (?໑ special "DIGIT ONE")
- (?໒ special "DIGIT TWO")
- (?໓ special "DIGIT THREE")
- (?໔ special "DIGIT FOUR")
- (?໕ special "DIGIT FIVE")
- (?໖ special "DIGIT SIX")
- (?໗ special "DIGIT SEVEN")
- (?໘ special "DIGIT EIGHT")
- (?໙ special "DIGIT NINE")
- (? invalid nil)
- (? invalid nil)
- (?ໜ consonant "LETTER NHOR NHUU" "MOUSE")
- (?ໝ consonant "LETTER MHOR MHAR" "DOG")
- (?ໞ invalid nil)))
+(let ((l '((?ກ consonant "CHICKEN")
+ (?ຂ consonant "EGG")
+ (? invalid)
+ (?ຄ consonant "BUFFALO")
+ (? invalid)
+ (? invalid)
+ (?ງ consonant "SNAKE")
+ (?ຈ consonant "BUDDHIST NOVICE")
+ (?ຉ invalid)
+ (?ຊ consonant "ELEPHANT")
+ (? invalid)
+ (?ຌ invalid)
+ (?ຍ consonant "MOSQUITO")
+ (?ຎ invalid)
+ (?ຎ invalid)
+ (?ຎ invalid)
+ (?ຎ invalid)
+ (?ຎ invalid)
+ (?ຎ invalid)
+ (?ດ consonant "NOSE")
+ (?ຕ consonant "EYE")
+ (?ຖ consonant "TO ASK,QUESTION")
+ (?ທ consonant "FLAG")
+ (?ຘ invalid)
+ (?ນ consonant "BIRD")
+ (?ບ consonant "FISHHOOK")
+ (?ປ consonant "FISH")
+ (?ຜ consonant "BEE")
+ (?ຝ consonant "WALL")
+ (?ພ consonant "MOUNTAIN")
+ (?ຟ consonant "FIRE")
+ (?ຠ invalid)
+ (?ມ consonant "HORSE")
+ (?ຢ consonant "MEDICINE")
+ (?ຣ consonant "CAR")
+ (? invalid)
+ (?ລ consonant "MONKEY")
+ (? invalid)
+ (?ວ consonant "HAND FAN")
+ (?ຨ invalid)
+ (?ຩ invalid)
+ (?ສ consonant "TIGER")
+ (?ຫ consonant "JAR")
+ (?ຬ invalid)
+ (?ອ consonant "TAKE")
+ (?ຮ consonant "BOAT")
+ (?ຯ special)
+ (?ະ vowel-base)
+ (?ັ vowel-upper)
+ (?າ vowel-base)
+ (?ຳ vowel-base)
+ (?ິ vowel-upper)
+ (?ີ vowel-upper)
+ (?ຶ vowel-upper)
+ (?ື vowel-upper)
+ (?ຸ vowel-lower)
+ (?ູ vowel-lower)
+ (?຺ invalid)
+ (?ົ vowel-upper)
+ (?ຼ semivowel-lower)
+ (?ຽ vowel-base)
+ (? invalid)
+ (? invalid)
+ (?ເ vowel-base)
+ (?ແ vowel-base)
+ (?ໂ vowel-base)
+ (?ໃ vowel-base)
+ (?ໄ vowel-base)
+ (? invalid)
+ (?ໆ special)
+ (? invalid)
+ (?່ tone)
+ (?້ tone)
+ (?໊ tone)
+ (?໋ tone)
+ (?໌ tone)
+ (?ໍ vowel-upper)
+ (?໎ invalid)
+ (? invalid)
+ (?໐ special)
+ (?໑ special)
+ (?໒ special)
+ (?໓ special)
+ (?໔ special)
+ (?໕ special)
+ (?໖ special)
+ (?໗ special)
+ (?໘ special)
+ (?໙ special)
+ (? invalid)
+ (? invalid)
+ (?ໜ consonant "MOUSE")
+ (?ໝ consonant "DOG")
+ (?ໞ invalid)))
elm)
(while l
(setq elm (car l) l (cdr l))
@@ -146,8 +146,7 @@ ?t
((eq ptype 'tone)
(modify-category-entry char ?t lao-category-table)))
(put-char-code-property char 'phonetic-type ptype)
- (put-char-code-property char 'name (nth 2 elm))
- (put-char-code-property char 'meaning (nth 3 elm)))))
+ (put-char-code-property char 'meaning (nth 2 elm)))))
;; The general composing rules are as follows:
;;
diff --git a/lisp/language/thai-util.el b/lisp/language/thai-util.el
index a9af5cf..618af57 100644
--- a/lisp/language/thai-util.el
+++ b/lisp/language/thai-util.el
@@ -55,100 +55,100 @@ thai-composition-pattern
"\\cc\\(\\cu\\|\\cI\\cU\\|\\cv\\ct?\\)\\|\\cv\\ct\\|\\cI\\cU"
"Regular expression matching a Thai composite sequence.")
-(let ((l '((?ก consonant "LETTER KO KAI") ; 0xA1
- (?ข consonant "LETTER KHO KHAI") ; 0xA2
- (?ฃ consonant "LETTER KHO KHUAT") ; 0xA3
- (?ค consonant "LETTER KHO KHWAI") ; 0xA4
- (?ฅ consonant "LETTER KHO KHON") ; 0xA5
- (?ฆ consonant "LETTER KHO RAKHANG") ; 0xA6
- (?ง consonant "LETTER NGO NGU") ; 0xA7
- (?จ consonant "LETTER CHO CHAN") ; 0xA8
- (?ฉ consonant "LETTER CHO CHING") ; 0xA9
- (?ช consonant "LETTER CHO CHANG") ; 0xAA
- (?ซ consonant "LETTER SO SO") ; 0xAB
- (?ฌ consonant "LETTER CHO CHOE") ; 0xAC
- (?ญ consonant "LETTER YO YING") ; 0xAD
- (?ฎ consonant "LETTER DO CHADA") ; 0xAE
- (?ฏ consonant "LETTER TO PATAK") ; 0xAF
- (?ฐ consonant "LETTER THO THAN") ; 0xB0
- (?ฑ consonant "LETTER THO NANGMONTHO") ; 0xB1
- (?ฒ consonant "LETTER THO PHUTHAO") ; 0xB2
- (?ณ consonant "LETTER NO NEN") ; 0xB3
- (?ด consonant "LETTER DO DEK") ; 0xB4
- (?ต consonant "LETTER TO TAO") ; 0xB5
- (?ถ consonant "LETTER THO THUNG") ; 0xB6
- (?ท consonant "LETTER THO THAHAN") ; 0xB7
- (?ธ consonant "LETTER THO THONG") ; 0xB8
- (?น consonant "LETTER NO NU") ; 0xB9
- (?บ consonant "LETTER BO BAIMAI") ; 0xBA
- (?ป consonant "LETTER PO PLA") ; 0xBB
- (?ผ consonant "LETTER PHO PHUNG") ; 0xBC
- (?ฝ consonant "LETTER FO FA") ; 0xBD
- (?พ consonant "LETTER PHO PHAN") ; 0xBE
- (?ฟ consonant "LETTER FO FAN") ; 0xBF
- (?ภ consonant "LETTER PHO SAMPHAO") ; 0xC0
- (?ม consonant "LETTER MO MA") ; 0xC1
- (?ย consonant "LETTER YO YAK") ; 0xC2
- (?ร consonant "LETTER RO RUA") ; 0xC3
- (?ฤ vowel-base "LETTER RU (Pali vowel letter)") ; 0xC4
- (?ล consonant "LETTER LO LING") ; 0xC5
- (?ฦ vowel-base "LETTER LU (Pali vowel letter)") ; 0xC6
- (?ว consonant "LETTER WO WAEN") ; 0xC7
- (?ศ consonant "LETTER SO SALA") ; 0xC8
- (?ษ consonant "LETTER SO RUSI") ; 0xC9
- (?ส consonant "LETTER SO SUA") ; 0xCA
- (?ห consonant "LETTER HO HIP") ; 0xCB
- (?ฬ consonant "LETTER LO CHULA") ; 0xCC
- (?อ consonant "LETTER O ANG") ; 0xCD
- (?ฮ consonant "LETTER HO NOK HUK") ; 0xCE
- (?ฯ special "PAI YAN NOI (abbreviation)") ; 0xCF
- (?ะ vowel-base "VOWEL SIGN SARA A") ; 0xD0
- (?ั vowel-upper "VOWEL SIGN MAI HAN-AKAT N/S-T") ; 0xD1
- (?า vowel-base "VOWEL SIGN SARA AA") ; 0xD2
- (?ำ vowel-base "VOWEL SIGN SARA AM") ; 0xD3
- (?ิ vowel-upper "VOWEL SIGN SARA I N/S-T") ; 0xD4
- (?ี vowel-upper "VOWEL SIGN SARA II N/S-T") ; 0xD5
- (?ึ vowel-upper "VOWEL SIGN SARA UE N/S-T") ; 0xD6
- (?ื vowel-upper "VOWEL SIGN SARA UEE N/S-T") ; 0xD7
- (?ุ vowel-lower "VOWEL SIGN SARA U N/S-B") ; 0xD8
- (?ู vowel-lower "VOWEL SIGN SARA UU N/S-B") ; 0xD9
- (?ฺ vowel-lower "VOWEL SIGN PHINTHU N/S-B (Pali virama)") ; 0xDA
- (? invalid nil) ; 0xDA
- (? invalid nil) ; 0xDC
- (? invalid nil) ; 0xDC
- (? invalid nil) ; 0xDC
- (?฿ special "BAHT SIGN (currency symbol)") ; 0xDF
- (?เ vowel-base "VOWEL SIGN SARA E") ; 0xE0
- (?แ vowel-base "VOWEL SIGN SARA AE") ; 0xE1
- (?โ vowel-base "VOWEL SIGN SARA O") ; 0xE2
- (?ใ vowel-base "VOWEL SIGN SARA MAI MUAN") ; 0xE3
- (?ไ vowel-base "VOWEL SIGN SARA MAI MALAI") ; 0xE4
- (?ๅ vowel-base "LAK KHANG YAO") ; 0xE5
- (?ๆ special "MAI YAMOK (repetition)") ; 0xE6
- (?็ sign-upper "VOWEL SIGN MAI TAI KHU N/S-T") ; 0xE7
- (?่ tone "TONE MAI EK N/S-T") ; 0xE8
- (?้ tone "TONE MAI THO N/S-T") ; 0xE9
- (?๊ tone "TONE MAI TRI N/S-T") ; 0xEA
- (?๋ tone "TONE MAI CHATTAWA N/S-T") ; 0xEB
- (?์ sign-upper "THANTHAKHAT N/S-T (cancellation mark)") ; 0xEC
- (?ํ sign-upper "NIKKHAHIT N/S-T (final nasal)") ; 0xED
- (?๎ sign-upper "YAMAKKAN N/S-T") ; 0xEE
- (?๏ special "FONRMAN") ; 0xEF
- (?๐ special "DIGIT ZERO") ; 0xF0
- (?๑ special "DIGIT ONE") ; 0xF1
- (?๒ special "DIGIT TWO") ; 0xF2
- (?๓ special "DIGIT THREE") ; 0xF3
- (?๔ special "DIGIT FOUR") ; 0xF4
- (?๕ special "DIGIT FIVE") ; 0xF5
- (?๖ special "DIGIT SIX") ; 0xF6
- (?๗ special "DIGIT SEVEN") ; 0xF7
- (?๘ special "DIGIT EIGHT") ; 0xF8
- (?๙ special "DIGIT NINE") ; 0xF9
- (?๚ special "ANGKHANKHU (ellipsis)") ; 0xFA
- (?๛ special "KHOMUT (beginning of religious texts)") ; 0xFB
- (? invalid nil) ; 0xFC
- (? invalid nil) ; 0xFD
- (? invalid nil) ; 0xFE
+(let ((l '((?ก consonant) ; 0xA1
+ (?ข consonant) ; 0xA2
+ (?ฃ consonant) ; 0xA3
+ (?ค consonant) ; 0xA4
+ (?ฅ consonant) ; 0xA5
+ (?ฆ consonant) ; 0xA6
+ (?ง consonant) ; 0xA7
+ (?จ consonant) ; 0xA8
+ (?ฉ consonant) ; 0xA9
+ (?ช consonant) ; 0xAA
+ (?ซ consonant) ; 0xAB
+ (?ฌ consonant) ; 0xAC
+ (?ญ consonant) ; 0xAD
+ (?ฎ consonant) ; 0xAE
+ (?ฏ consonant) ; 0xAF
+ (?ฐ consonant) ; 0xB0
+ (?ฑ consonant) ; 0xB1
+ (?ฒ consonant) ; 0xB2
+ (?ณ consonant) ; 0xB3
+ (?ด consonant) ; 0xB4
+ (?ต consonant) ; 0xB5
+ (?ถ consonant) ; 0xB6
+ (?ท consonant) ; 0xB7
+ (?ธ consonant) ; 0xB8
+ (?น consonant) ; 0xB9
+ (?บ consonant) ; 0xBA
+ (?ป consonant) ; 0xBB
+ (?ผ consonant) ; 0xBC
+ (?ฝ consonant) ; 0xBD
+ (?พ consonant) ; 0xBE
+ (?ฟ consonant) ; 0xBF
+ (?ภ consonant) ; 0xC0
+ (?ม consonant) ; 0xC1
+ (?ย consonant) ; 0xC2
+ (?ร consonant) ; 0xC3
+ (?ฤ vowel-base) ; 0xC4
+ (?ล consonant) ; 0xC5
+ (?ฦ vowel-base) ; 0xC6
+ (?ว consonant) ; 0xC7
+ (?ศ consonant) ; 0xC8
+ (?ษ consonant) ; 0xC9
+ (?ส consonant) ; 0xCA
+ (?ห consonant) ; 0xCB
+ (?ฬ consonant) ; 0xCC
+ (?อ consonant) ; 0xCD
+ (?ฮ consonant) ; 0xCE
+ (?ฯ special) ; 0xCF
+ (?ะ vowel-base) ; 0xD0
+ (?ั vowel-upper) ; 0xD1
+ (?า vowel-base) ; 0xD2
+ (?ำ vowel-base) ; 0xD3
+ (?ิ vowel-upper) ; 0xD4
+ (?ี vowel-upper) ; 0xD5
+ (?ึ vowel-upper) ; 0xD6
+ (?ื vowel-upper) ; 0xD7
+ (?ุ vowel-lower) ; 0xD8
+ (?ู vowel-lower) ; 0xD9
+ (?ฺ vowel-lower) ; 0xDA
+ (? invalid) ; 0xDA
+ (? invalid) ; 0xDC
+ (? invalid) ; 0xDC
+ (? invalid) ; 0xDC
+ (?฿ special) ; 0xDF
+ (?เ vowel-base) ; 0xE0
+ (?แ vowel-base) ; 0xE1
+ (?โ vowel-base) ; 0xE2
+ (?ใ vowel-base) ; 0xE3
+ (?ไ vowel-base) ; 0xE4
+ (?ๅ vowel-base) ; 0xE5
+ (?ๆ special) ; 0xE6
+ (?็ sign-upper) ; 0xE7
+ (?่ tone) ; 0xE8
+ (?้ tone) ; 0xE9
+ (?๊ tone) ; 0xEA
+ (?๋ tone) ; 0xEB
+ (?์ sign-upper) ; 0xEC
+ (?ํ sign-upper) ; 0xED
+ (?๎ sign-upper) ; 0xEE
+ (?๏ special) ; 0xEF
+ (?๐ special) ; 0xF0
+ (?๑ special) ; 0xF1
+ (?๒ special) ; 0xF2
+ (?๓ special) ; 0xF3
+ (?๔ special) ; 0xF4
+ (?๕ special) ; 0xF5
+ (?๖ special) ; 0xF6
+ (?๗ special) ; 0xF7
+ (?๘ special) ; 0xF8
+ (?๙ special) ; 0xF9
+ (?๚ special) ; 0xFA
+ (?๛ special) ; 0xFB
+ (? invalid) ; 0xFC
+ (? invalid) ; 0xFD
+ (? invalid) ; 0xFE
))
elm)
(while l
@@ -170,8 +170,7 @@ thai-composition-pattern
(modify-category-entry char ?u thai-category-table)
(if (= char ?์)
;; Give category `U' to "THANTHAKHAT".
- (modify-category-entry char ?U thai-category-table))))
- (put-char-code-property char 'name (nth 2 elm)))))
+ (modify-category-entry char ?U thai-category-table)))))))
(defun thai-compose-syllable (beg end &optional category-set string)
(or category-set
--
2.5.5
^ permalink raw reply related [flat|nested] 7+ messages in thread
* bug#23698: 24.5; broken character name
2016-06-06 17:49 ` Paul Eggert
@ 2016-06-06 19:00 ` Eli Zaretskii
0 siblings, 0 replies; 7+ messages in thread
From: Eli Zaretskii @ 2016-06-06 19:00 UTC (permalink / raw)
To: Paul Eggert; +Cc: 23698, ynyaaa
> Cc: 23698-done@debbugs.gnu.org, Eli Zaretskii <eliz@gnu.org>,
> Kenichi Handa <handa@gnu.org>
> From: Paul Eggert <eggert@cs.ucla.edu>
> Date: Mon, 6 Jun 2016 10:49:12 -0700
>
> > we have it since 1997, i.e. before
> > we started using the UCD for these purposes. So I think we can either
> > remove it or use a property that doesn't clash with the Unicode
> > standard properties.
> It appears to be unnecessary these days, so I removed it in the attached
> patch to master and am marking the bug as done. If I'm wrong and we need
> it we can use a different property as you suggest.
I'd still like to hear Handa-san's opinions on this.
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2016-06-09 14:37 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-06-05 13:06 bug#23698: 24.5; broken character name ynyaaa
2016-06-05 15:44 ` Eli Zaretskii
2016-06-05 16:44 ` Eli Zaretskii
2016-06-09 14:32 ` handa
2016-06-09 14:37 ` Eli Zaretskii
2016-06-06 17:49 ` Paul Eggert
2016-06-06 19:00 ` Eli Zaretskii
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/emacs.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).