all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
* bug#23698: 24.5; broken character name
@ 2016-06-05 13:06 ynyaaa
  2016-06-05 15:44 ` Eli Zaretskii
  2016-06-06 17:49 ` Paul Eggert
  0 siblings, 2 replies; 7+ messages in thread
From: ynyaaa @ 2016-06-05 13:06 UTC (permalink / raw)
  To: 23698


Just after starting emacs -Q, this form returns correct value.

(get-char-code-property #xE01 'name)
=>"THAI CHARACTER KO KAI"

Then display THAI characters by typing this.
  M-x list-charset-chars RET thai-iso8859-11 RET

After THAI characters are displayed, the form returns wrong value.

(get-char-code-property #xE01 'name)
=>"LETTER KO KAI"

For other THAI characters, get-char-code-property returns wrong names.

UCS  correct name                    wrong name
E01  THAI CHARACTER KO KAI           LETTER KO KAI
E02  THAI CHARACTER KHO KHAI         LETTER KHO KHAI
E03  THAI CHARACTER KHO KHUAT        LETTER KHO KHUAT
E04  THAI CHARACTER KHO KHWAI        LETTER KHO KHWAI
E05  THAI CHARACTER KHO KHON         LETTER KHO KHON
E06  THAI CHARACTER KHO RAKHANG      LETTER KHO RAKHANG
E07  THAI CHARACTER NGO NGU          LETTER NGO NGU
E08  THAI CHARACTER CHO CHAN         LETTER CHO CHAN
E09  THAI CHARACTER CHO CHING        LETTER CHO CHING
E0A  THAI CHARACTER CHO CHANG        LETTER CHO CHANG
E0B  THAI CHARACTER SO SO            LETTER SO SO
E0C  THAI CHARACTER CHO CHOE         LETTER CHO CHOE
E0D  THAI CHARACTER YO YING          LETTER YO YING
E0E  THAI CHARACTER DO CHADA         LETTER DO CHADA
E0F  THAI CHARACTER TO PATAK         LETTER TO PATAK
E10  THAI CHARACTER THO THAN         LETTER THO THAN
E11  THAI CHARACTER THO NANGMONTHO   LETTER THO NANGMONTHO
E12  THAI CHARACTER THO PHUTHAO      LETTER THO PHUTHAO
E13  THAI CHARACTER NO NEN           LETTER NO NEN
E14  THAI CHARACTER DO DEK           LETTER DO DEK
E15  THAI CHARACTER TO TAO           LETTER TO TAO
E16  THAI CHARACTER THO THUNG        LETTER THO THUNG
E17  THAI CHARACTER THO THAHAN       LETTER THO THAHAN
E18  THAI CHARACTER THO THONG        LETTER THO THONG
E19  THAI CHARACTER NO NU            LETTER NO NU
E1A  THAI CHARACTER BO BAIMAI        LETTER BO BAIMAI
E1B  THAI CHARACTER PO PLA           LETTER PO PLA
E1C  THAI CHARACTER PHO PHUNG        LETTER PHO PHUNG
E1D  THAI CHARACTER FO FA            LETTER FO FA
E1E  THAI CHARACTER PHO PHAN         LETTER PHO PHAN
E1F  THAI CHARACTER FO FAN           LETTER FO FAN
E20  THAI CHARACTER PHO SAMPHAO      LETTER PHO SAMPHAO
E21  THAI CHARACTER MO MA            LETTER MO MA
E22  THAI CHARACTER YO YAK           LETTER YO YAK
E23  THAI CHARACTER RO RUA           LETTER RO RUA
E24  THAI CHARACTER RU               LETTER RU (Pali vowel letter)
E25  THAI CHARACTER LO LING          LETTER LO LING
E26  THAI CHARACTER LU               LETTER LU (Pali vowel letter)
E27  THAI CHARACTER WO WAEN          LETTER WO WAEN
E28  THAI CHARACTER SO SALA          LETTER SO SALA
E29  THAI CHARACTER SO RUSI          LETTER SO RUSI
E2A  THAI CHARACTER SO SUA           LETTER SO SUA
E2B  THAI CHARACTER HO HIP           LETTER HO HIP
E2C  THAI CHARACTER LO CHULA         LETTER LO CHULA
E2D  THAI CHARACTER O ANG            LETTER O ANG
E2E  THAI CHARACTER HO NOKHUK        LETTER HO NOK HUK
E2F  THAI CHARACTER PAIYANNOI        PAI YAN NOI (abbreviation)
E30  THAI CHARACTER SARA A           VOWEL SIGN SARA A
E31  THAI CHARACTER MAI HAN-AKAT     VOWEL SIGN MAI HAN-AKAT N/S-T
E32  THAI CHARACTER SARA AA          VOWEL SIGN SARA AA
E33  THAI CHARACTER SARA AM          VOWEL SIGN SARA AM
E34  THAI CHARACTER SARA I           VOWEL SIGN SARA I N/S-T
E35  THAI CHARACTER SARA II          VOWEL SIGN SARA II N/S-T
E36  THAI CHARACTER SARA UE          VOWEL SIGN SARA UE N/S-T
E37  THAI CHARACTER SARA UEE         VOWEL SIGN SARA UEE N/S-T
E38  THAI CHARACTER SARA U           VOWEL SIGN SARA U N/S-B
E39  THAI CHARACTER SARA UU          VOWEL SIGN SARA UU N/S-B
E3A  THAI CHARACTER PHINTHU          VOWEL SIGN PHINTHU N/S-B (Pali virama)
E3F  THAI CURRENCY SYMBOL BAHT       BAHT SIGN (currency symbol)
E40  THAI CHARACTER SARA E           VOWEL SIGN SARA E
E41  THAI CHARACTER SARA AE          VOWEL SIGN SARA AE
E42  THAI CHARACTER SARA O           VOWEL SIGN SARA O
E43  THAI CHARACTER SARA AI MAIMUAN  VOWEL SIGN SARA MAI MUAN
E44  THAI CHARACTER SARA AI MAIMALAI VOWEL SIGN SARA MAI MALAI
E45  THAI CHARACTER LAKKHANGYAO      LAK KHANG YAO
E46  THAI CHARACTER MAIYAMOK         MAI YAMOK (repetition)
E47  THAI CHARACTER MAITAIKHU        VOWEL SIGN MAI TAI KHU N/S-T
E48  THAI CHARACTER MAI EK           TONE MAI EK N/S-T
E49  THAI CHARACTER MAI THO          TONE MAI THO N/S-T
E4A  THAI CHARACTER MAI TRI          TONE MAI TRI N/S-T
E4B  THAI CHARACTER MAI CHATTAWA     TONE MAI CHATTAWA N/S-T
E4C  THAI CHARACTER THANTHAKHAT      THANTHAKHAT N/S-T (cancellation mark)
E4D  THAI CHARACTER NIKHAHIT         NIKKHAHIT N/S-T (final nasal)
E4E  THAI CHARACTER YAMAKKAN         YAMAKKAN N/S-T
E4F  THAI CHARACTER FONGMAN          FONRMAN
E50  THAI DIGIT ZERO                 DIGIT ZERO
E51  THAI DIGIT ONE                  DIGIT ONE
E52  THAI DIGIT TWO                  DIGIT TWO
E53  THAI DIGIT THREE                DIGIT THREE
E54  THAI DIGIT FOUR                 DIGIT FOUR
E55  THAI DIGIT FIVE                 DIGIT FIVE
E56  THAI DIGIT SIX                  DIGIT SIX
E57  THAI DIGIT SEVEN                DIGIT SEVEN
E58  THAI DIGIT EIGHT                DIGIT EIGHT
E59  THAI DIGIT NINE                 DIGIT NINE
E5A  THAI CHARACTER ANGKHANKHU       ANGKHANKHU (ellipsis)
E5B  THAI CHARACTER KHOMUT           KHOMUT (beginning of religious texts)




In GNU Emacs 24.5.1 (i686-pc-mingw32)
 of 2015-04-11 on LEG570
Windowing system distributor `Microsoft Corp.', version 6.0.6002
Configured using:
 `configure --prefix=/c/usr --host=i686-pc-mingw32'

Important settings:
  value of $LANG: JPN
  locale-coding-system: cp932

Major mode: Lisp Interaction

Minor modes in effect:
  tooltip-mode: t
  electric-indent-mode: t
  mouse-wheel-mode: t
  tool-bar-mode: t
  menu-bar-mode: t
  file-name-shadow-mode: t
  global-font-lock-mode: t
  font-lock-mode: t
  blink-cursor-mode: t
  auto-composition-mode: t
  auto-encryption-mode: t
  auto-compression-mode: t
  line-number-mode: t
  transient-mark-mode: t

Recent messages:

Load-path shadows:
None found.

Features:
(shadow sort gnus-util mail-extr emacsbug message format-spec rfc822 mml
mml-sec mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev
gmm-utils mailheader sendmail rfc2047 rfc2045 ietf-drums mm-util
mail-prsvr mail-utils thai-util thai-word mule-util info mule-diag
help-mode easymenu advice help-fns time-date japan-util tooltip electric
uniquify ediff-hook vc-hooks lisp-float-type mwheel dos-w32 ls-lisp
w32-common-fns disp-table w32-win w32-vars tool-bar dnd fontset image
regexp-opt fringe tabulated-list newcomment lisp-mode prog-mode register
page menu-bar rfn-eshadow timer select scroll-bar mouse jit-lock
font-lock syntax facemenu font-core frame cham georgian utf-8-lang
misc-lang vietnamese tibetan thai tai-viet lao korean japanese hebrew
greek romanian slovak czech european ethiopic indian cyrillic chinese
case-table epa-hook jka-cmpr-hook help simple abbrev minibuffer nadvice
loaddefs button faces cus-face macroexp files text-properties overlay
sha1 md5 base64 format env code-pages mule custom widget
hashtable-print-readable backquote make-network-process w32notify w32
multi-tty emacs)

Memory information:
((conses 8 164471 6457)
 (symbols 32 27891 0)
 (miscs 32 55 271)
 (strings 16 24013 5282)
 (string-bytes 1 514589)
 (vectors 8 11535)
 (vector-slots 4 515427 5354)
 (floats 8 65 359)
 (intervals 28 317 19)
 (buffers 508 16))





^ permalink raw reply	[flat|nested] 7+ messages in thread

* bug#23698: 24.5; broken character name
  2016-06-05 13:06 bug#23698: 24.5; broken character name ynyaaa
@ 2016-06-05 15:44 ` Eli Zaretskii
  2016-06-05 16:44   ` Eli Zaretskii
  2016-06-09 14:32   ` handa
  2016-06-06 17:49 ` Paul Eggert
  1 sibling, 2 replies; 7+ messages in thread
From: Eli Zaretskii @ 2016-06-05 15:44 UTC (permalink / raw)
  To: ynyaaa, Kenichi Handa; +Cc: 23698

> From: ynyaaa@gmail.com
> Date: Sun, 05 Jun 2016 22:06:23 +0900
> 
> 
> Just after starting emacs -Q, this form returns correct value.
> 
> (get-char-code-property #xE01 'name)
> =>"THAI CHARACTER KO KAI"
> 
> Then display THAI characters by typing this.
>   M-x list-charset-chars RET thai-iso8859-11 RET
> 
> After THAI characters are displayed, the form returns wrong value.
> 
> (get-char-code-property #xE01 'name)
> =>"LETTER KO KAI"

Displaying the list of Thai characters loads thai-util.el, which
deliberately overwrites the names derived from the Unicode Character
Database with its own variants.  I'm CC'ing Handa-san, who added that
code back in 2008, in the hope that he could tell why are we doing
that, and whether this is still needed nowadays.

Thanks.





^ permalink raw reply	[flat|nested] 7+ messages in thread

* bug#23698: 24.5; broken character name
  2016-06-05 15:44 ` Eli Zaretskii
@ 2016-06-05 16:44   ` Eli Zaretskii
  2016-06-09 14:32   ` handa
  1 sibling, 0 replies; 7+ messages in thread
From: Eli Zaretskii @ 2016-06-05 16:44 UTC (permalink / raw)
  To: Kenichi Handa; +Cc: 23698, ynyaaa

> Date: Sun, 05 Jun 2016 18:44:56 +0300
> From: Eli Zaretskii <eliz@gnu.org>
> Cc: 23698@debbugs.gnu.org
> 
> Displaying the list of Thai characters loads thai-util.el, which
> deliberately overwrites the names derived from the Unicode Character
> Database with its own variants.  I'm CC'ing Handa-san, who added that
> code back in 2008, in the hope that he could tell why are we doing
> that, and whether this is still needed nowadays.

Actually, this code is much older: we have it since 1997, i.e. before
we started using the UCD for these purposes.  So I think we can either
remove it or use a property that doesn't clash with the Unicode
standard properties.

Btw, the same problem exists with Lao (see lao-util.el).





^ permalink raw reply	[flat|nested] 7+ messages in thread

* bug#23698: 24.5; broken character name
  2016-06-05 13:06 bug#23698: 24.5; broken character name ynyaaa
  2016-06-05 15:44 ` Eli Zaretskii
@ 2016-06-06 17:49 ` Paul Eggert
  2016-06-06 19:00   ` Eli Zaretskii
  1 sibling, 1 reply; 7+ messages in thread
From: Paul Eggert @ 2016-06-06 17:49 UTC (permalink / raw)
  To: ynyaaa; +Cc: 23698-done

[-- Attachment #1: Type: text/plain, Size: 399 bytes --]

> we have it since 1997, i.e. before
> we started using the UCD for these purposes.  So I think we can either
> remove it or use a property that doesn't clash with the Unicode
> standard properties.
It appears to be unnecessary these days, so I removed it in the attached 
patch to master and am marking the bug as done. If I'm wrong and we need 
it we can use a different property as you suggest.


[-- Attachment #2: 0001-Use-standard-Unicode-names-for-Thai-Lao.txt --]
[-- Type: text/plain, Size: 15518 bytes --]

From 130aa7166b940fef59e832c68e76eef87cdc96e5 Mon Sep 17 00:00:00 2001
From: Paul Eggert <eggert@cs.ucla.edu>
Date: Mon, 6 Jun 2016 10:43:10 -0700
Subject: [PATCH] Use standard Unicode names for Thai, Lao
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

* lisp/language/lao-util.el, lisp/language/thai-util.el:
Don’t override standard Unicode character name (Bug#23698).
---
 lisp/language/lao-util.el  | 191 ++++++++++++++++++++++-----------------------
 lisp/language/thai-util.el | 191 ++++++++++++++++++++++-----------------------
 2 files changed, 190 insertions(+), 192 deletions(-)

diff --git a/lisp/language/lao-util.el b/lisp/language/lao-util.el
index 6103860..a42904f 100644
--- a/lisp/language/lao-util.el
+++ b/lisp/language/lao-util.el
@@ -38,100 +38,100 @@ ?s
 (define-category ?v "Lao upper/lower vowel" lao-category-table)
 (define-category ?t "Lao tone" lao-category-table)
 
-(let ((l '((?ກ consonant "LETTER KOR  KAI'" "CHICKEN")
-	   (?ຂ consonant "LETTER KHOR KHAI'" "EGG")
-	   (?຃ invalid nil)
-	   (?ຄ consonant "LETTER QHOR QHWARGN" "BUFFALO")
-	   (?຅ invalid nil)
-	   (?  invalid nil)
-	   (?ງ consonant "LETTER NGOR NGUU" "SNAKE")
-	   (?ຈ consonant "LETTER JOR JUA" "BUDDHIST NOVICE")
-	   (?ຉ invalid nil)
-	   (?ຊ consonant "LETTER XOR X\"ARNG" "ELEPHANT")
-	   (?຋ invalid nil)
-	   (?ຌ invalid nil)
-	   (?ຍ consonant "LETTER YOR YUNG" "MOSQUITO")
-	   (?ຎ invalid nil)
-	   (?ຎ invalid nil)
-	   (?ຎ invalid nil)
-	   (?ຎ invalid nil)
-	   (?ຎ invalid nil)
-	   (?ຎ invalid nil)
-	   (?ດ consonant "LETTER DOR DANG" "NOSE")
-	   (?ຕ consonant "LETTER TOR TAR" "EYE")
-	   (?ຖ consonant "LETTER THOR THUNG" "TO ASK,QUESTION")
-	   (?ທ consonant "LETTER DHOR DHARM" "FLAG")
-	   (?ຘ invalid nil)
-	   (?ນ consonant "LETTER NOR NOK" "BIRD")
-	   (?ບ consonant "LETTER BOR BED" "FISHHOOK")
-	   (?ປ consonant "LETTER POR PAR" "FISH")
-	   (?ຜ consonant "LETTER HPOR HPER\"" "BEE")
-	   (?ຝ consonant "LETTER FHOR FHAR" "WALL")
-	   (?ພ consonant "LETTER PHOR PHUU" "MOUNTAIN")
-	   (?ຟ consonant "LETTER FOR FAI" "FIRE")
-	   (?ຠ invalid nil)
-	   (?ມ consonant "LETTER MOR MAR\"" "HORSE")
-	   (?ຢ consonant "LETTER GNOR GNAR" "MEDICINE")
-	   (?ຣ consonant "LETTER ROR ROD" "CAR")
-	   (?຤ invalid nil)
-	   (?ລ consonant "LETTER LOR LIING" "MONKEY")
-	   (?຦ invalid nil)
-	   (?ວ consonant "LETTER WOR WII" "HAND FAN")
-	   (?ຨ invalid nil)
-	   (?ຩ invalid nil)
-	   (?ສ consonant "LETTER SOR SEA" "TIGER")
-	   (?ຫ consonant "LETTER HHOR HHAI" "JAR")
-	   (?ຬ invalid nil)
-	   (?ອ consonant "LETTER OR OOW" "TAKE")
-	   (?ຮ consonant "LETTER HOR HEA" "BOAT")
-	   (?ຯ special "ELLIPSIS")
-	   (?ະ vowel-base "VOWEL SIGN SARA A")
-	   (?ັ vowel-upper "VOWEL SIGN MAI KAN")
-	   (?າ vowel-base "VOWEL SIGN SARA AR")
-	   (?ຳ vowel-base "VOWEL SIGN SARA AM")
-	   (?ິ vowel-upper "VOWEL SIGN SARA I")
-	   (?ີ vowel-upper "VOWEL SIGN SARA II")
-	   (?ຶ vowel-upper "VOWEL SIGN SARA EU")
-	   (?ື vowel-upper "VOWEL SIGN SARA UR")
-	   (?ຸ vowel-lower "VOWEL SIGN SARA U")
-	   (?ູ vowel-lower "VOWEL SIGN SARA UU")
-	   (?຺ invalid nil)
-	   (?ົ vowel-upper "VOWEL SIGN MAI KONG")
-	   (?ຼ semivowel-lower "SEMIVOWEL SIGN LO")
-	   (?ຽ vowel-base "SEMIVOWEL SIGN SARA IA")
-	   (?຾ invalid nil)
-	   (?຿ invalid nil)
-	   (?ເ vowel-base "VOWEL SIGN SARA EE")
-	   (?ແ vowel-base "VOWEL SIGN SARA AA")
-	   (?ໂ vowel-base "VOWEL SIGN SARA OO")
-	   (?ໃ vowel-base "VOWEL SIGN SARA EI MAI MUAN\"")
-	   (?ໄ vowel-base "VOWEL SIGN SARA AI MAI MAY")
-	   (?໅ invalid nil)
-	   (?ໆ special "KO LA (REPETITION)")
-	   (?໇ invalid nil)
-	   (?່ tone "TONE MAI EK")
-	   (?້ tone "TONE MAI THO")
-	   (?໊ tone "TONE MAI TI")
-	   (?໋ tone "TONE MAI JADTAWAR")
-	   (?໌ tone "CANCELLATION MARK")
-	   (?ໍ vowel-upper "VOWEL SIGN SARA OR")
-	   (?໎ invalid nil)
-	   (?໏ invalid nil)
-	   (?໐ special "DIGIT ZERO")
-	   (?໑ special "DIGIT ONE")
-	   (?໒ special "DIGIT TWO")
-	   (?໓ special "DIGIT THREE")
-	   (?໔ special "DIGIT FOUR")
-	   (?໕ special "DIGIT FIVE")
-	   (?໖ special "DIGIT SIX")
-	   (?໗ special "DIGIT SEVEN")
-	   (?໘ special "DIGIT EIGHT")
-	   (?໙ special "DIGIT NINE")
-	   (?໚ invalid nil)
-	   (?໛ invalid nil)
-	   (?ໜ consonant "LETTER NHOR NHUU" "MOUSE")
-	   (?ໝ consonant "LETTER MHOR MHAR" "DOG")
-	   (?ໞ invalid nil)))
+(let ((l '((?ກ consonant "CHICKEN")
+	   (?ຂ consonant "EGG")
+	   (?຃ invalid)
+	   (?ຄ consonant "BUFFALO")
+	   (?຅ invalid)
+	   (?  invalid)
+	   (?ງ consonant "SNAKE")
+	   (?ຈ consonant "BUDDHIST NOVICE")
+	   (?ຉ invalid)
+	   (?ຊ consonant "ELEPHANT")
+	   (?຋ invalid)
+	   (?ຌ invalid)
+	   (?ຍ consonant "MOSQUITO")
+	   (?ຎ invalid)
+	   (?ຎ invalid)
+	   (?ຎ invalid)
+	   (?ຎ invalid)
+	   (?ຎ invalid)
+	   (?ຎ invalid)
+	   (?ດ consonant "NOSE")
+	   (?ຕ consonant "EYE")
+	   (?ຖ consonant "TO ASK,QUESTION")
+	   (?ທ consonant "FLAG")
+	   (?ຘ invalid)
+	   (?ນ consonant "BIRD")
+	   (?ບ consonant "FISHHOOK")
+	   (?ປ consonant "FISH")
+	   (?ຜ consonant "BEE")
+	   (?ຝ consonant "WALL")
+	   (?ພ consonant "MOUNTAIN")
+	   (?ຟ consonant "FIRE")
+	   (?ຠ invalid)
+	   (?ມ consonant "HORSE")
+	   (?ຢ consonant "MEDICINE")
+	   (?ຣ consonant "CAR")
+	   (?຤ invalid)
+	   (?ລ consonant "MONKEY")
+	   (?຦ invalid)
+	   (?ວ consonant "HAND FAN")
+	   (?ຨ invalid)
+	   (?ຩ invalid)
+	   (?ສ consonant "TIGER")
+	   (?ຫ consonant "JAR")
+	   (?ຬ invalid)
+	   (?ອ consonant "TAKE")
+	   (?ຮ consonant "BOAT")
+	   (?ຯ special)
+	   (?ະ vowel-base)
+	   (?ັ vowel-upper)
+	   (?າ vowel-base)
+	   (?ຳ vowel-base)
+	   (?ິ vowel-upper)
+	   (?ີ vowel-upper)
+	   (?ຶ vowel-upper)
+	   (?ື vowel-upper)
+	   (?ຸ vowel-lower)
+	   (?ູ vowel-lower)
+	   (?຺ invalid)
+	   (?ົ vowel-upper)
+	   (?ຼ semivowel-lower)
+	   (?ຽ vowel-base)
+	   (?຾ invalid)
+	   (?຿ invalid)
+	   (?ເ vowel-base)
+	   (?ແ vowel-base)
+	   (?ໂ vowel-base)
+	   (?ໃ vowel-base)
+	   (?ໄ vowel-base)
+	   (?໅ invalid)
+	   (?ໆ special)
+	   (?໇ invalid)
+	   (?່ tone)
+	   (?້ tone)
+	   (?໊ tone)
+	   (?໋ tone)
+	   (?໌ tone)
+	   (?ໍ vowel-upper)
+	   (?໎ invalid)
+	   (?໏ invalid)
+	   (?໐ special)
+	   (?໑ special)
+	   (?໒ special)
+	   (?໓ special)
+	   (?໔ special)
+	   (?໕ special)
+	   (?໖ special)
+	   (?໗ special)
+	   (?໘ special)
+	   (?໙ special)
+	   (?໚ invalid)
+	   (?໛ invalid)
+	   (?ໜ consonant "MOUSE")
+	   (?ໝ consonant "DOG")
+	   (?ໞ invalid)))
       elm)
   (while l
     (setq elm (car l) l (cdr l))
@@ -146,8 +146,7 @@ ?t
 	    ((eq ptype 'tone)
 	     (modify-category-entry char ?t lao-category-table)))
       (put-char-code-property char 'phonetic-type ptype)
-      (put-char-code-property char 'name (nth 2 elm))
-      (put-char-code-property char 'meaning (nth 3 elm)))))
+      (put-char-code-property char 'meaning (nth 2 elm)))))
 
 ;; The general composing rules are as follows:
 ;;
diff --git a/lisp/language/thai-util.el b/lisp/language/thai-util.el
index a9af5cf..618af57 100644
--- a/lisp/language/thai-util.el
+++ b/lisp/language/thai-util.el
@@ -55,100 +55,100 @@ thai-composition-pattern
   "\\cc\\(\\cu\\|\\cI\\cU\\|\\cv\\ct?\\)\\|\\cv\\ct\\|\\cI\\cU"
   "Regular expression matching a Thai composite sequence.")
 
-(let ((l '((?ก consonant "LETTER KO KAI")				; 0xA1
-	   (?ข consonant "LETTER KHO KHAI")				; 0xA2
-	   (?ฃ consonant "LETTER KHO KHUAT")				; 0xA3
-	   (?ค consonant "LETTER KHO KHWAI")				; 0xA4
-	   (?ฅ consonant "LETTER KHO KHON")				; 0xA5
-	   (?ฆ consonant "LETTER KHO RAKHANG")				; 0xA6
-	   (?ง consonant "LETTER NGO NGU")				; 0xA7
-	   (?จ consonant "LETTER CHO CHAN")				; 0xA8
-	   (?ฉ consonant "LETTER CHO CHING")				; 0xA9
-	   (?ช consonant "LETTER CHO CHANG")				; 0xAA
-	   (?ซ consonant "LETTER SO SO")				; 0xAB
-	   (?ฌ consonant "LETTER CHO CHOE")				; 0xAC
-	   (?ญ consonant "LETTER YO YING")				; 0xAD
-	   (?ฎ consonant "LETTER DO CHADA")				; 0xAE
-	   (?ฏ consonant "LETTER TO PATAK")				; 0xAF
-	   (?ฐ consonant "LETTER THO THAN")				; 0xB0
-	   (?ฑ consonant "LETTER THO NANGMONTHO")			; 0xB1
-	   (?ฒ consonant "LETTER THO PHUTHAO")				; 0xB2
-	   (?ณ consonant "LETTER NO NEN")				; 0xB3
-	   (?ด consonant "LETTER DO DEK")				; 0xB4
-	   (?ต consonant "LETTER TO TAO")				; 0xB5
-	   (?ถ consonant "LETTER THO THUNG")				; 0xB6
-	   (?ท consonant "LETTER THO THAHAN")				; 0xB7
-	   (?ธ consonant "LETTER THO THONG")				; 0xB8
-	   (?น consonant "LETTER NO NU")				; 0xB9
-	   (?บ consonant "LETTER BO BAIMAI")				; 0xBA
-	   (?ป consonant "LETTER PO PLA")				; 0xBB
-	   (?ผ consonant "LETTER PHO PHUNG")				; 0xBC
-	   (?ฝ consonant "LETTER FO FA")				; 0xBD
-	   (?พ consonant "LETTER PHO PHAN")				; 0xBE
-	   (?ฟ consonant "LETTER FO FAN")				; 0xBF
-	   (?ภ consonant "LETTER PHO SAMPHAO")				; 0xC0
-	   (?ม consonant "LETTER MO MA")				; 0xC1
-	   (?ย consonant "LETTER YO YAK")				; 0xC2
-	   (?ร consonant "LETTER RO RUA")				; 0xC3
-	   (?ฤ vowel-base "LETTER RU (Pali vowel letter)")		; 0xC4
-	   (?ล consonant "LETTER LO LING")				; 0xC5
-	   (?ฦ vowel-base "LETTER LU (Pali vowel letter)")		; 0xC6
-	   (?ว consonant "LETTER WO WAEN")				; 0xC7
-	   (?ศ consonant "LETTER SO SALA")				; 0xC8
-	   (?ษ consonant "LETTER SO RUSI")				; 0xC9
-	   (?ส consonant "LETTER SO SUA")				; 0xCA
-	   (?ห consonant "LETTER HO HIP")				; 0xCB
-	   (?ฬ consonant "LETTER LO CHULA")				; 0xCC
-	   (?อ consonant "LETTER O ANG")				; 0xCD
-	   (?ฮ consonant "LETTER HO NOK HUK")				; 0xCE
-	   (?ฯ special "PAI YAN NOI (abbreviation)")			; 0xCF
-	   (?ะ vowel-base "VOWEL SIGN SARA A")				; 0xD0
-	   (?ั vowel-upper "VOWEL SIGN MAI HAN-AKAT N/S-T")		; 0xD1
-	   (?า vowel-base "VOWEL SIGN SARA AA")				; 0xD2
-	   (?ำ vowel-base "VOWEL SIGN SARA AM")				; 0xD3
-	   (?ิ vowel-upper "VOWEL SIGN SARA I N/S-T")			; 0xD4
-	   (?ี vowel-upper "VOWEL SIGN SARA II N/S-T")			; 0xD5
-	   (?ึ vowel-upper "VOWEL SIGN SARA UE N/S-T")			; 0xD6
-	   (?ื vowel-upper "VOWEL SIGN SARA UEE N/S-T")			; 0xD7
-	   (?ุ vowel-lower "VOWEL SIGN SARA U N/S-B")			; 0xD8
-	   (?ู vowel-lower "VOWEL SIGN SARA UU N/S-B")			; 0xD9
-	   (?ฺ vowel-lower "VOWEL SIGN PHINTHU N/S-B (Pali virama)")	; 0xDA
-	   (?฻ invalid nil)						; 0xDA
-	   (?฼ invalid nil)						; 0xDC
-	   (?฽ invalid nil)						; 0xDC
-	   (?฾ invalid nil)						; 0xDC
-	   (?฿ special "BAHT SIGN (currency symbol)")			; 0xDF
-	   (?เ vowel-base "VOWEL SIGN SARA E")				; 0xE0
-	   (?แ vowel-base "VOWEL SIGN SARA AE")				; 0xE1
-	   (?โ vowel-base "VOWEL SIGN SARA O")				; 0xE2
-	   (?ใ vowel-base "VOWEL SIGN SARA MAI MUAN")			; 0xE3
-	   (?ไ vowel-base "VOWEL SIGN SARA MAI MALAI")			; 0xE4
-	   (?ๅ vowel-base "LAK KHANG YAO")				; 0xE5
-	   (?ๆ special "MAI YAMOK (repetition)")			; 0xE6
-	   (?็ sign-upper "VOWEL SIGN MAI TAI KHU N/S-T")		; 0xE7
-	   (?่ tone "TONE MAI EK N/S-T")				; 0xE8
-	   (?้ tone "TONE MAI THO N/S-T")				; 0xE9
-	   (?๊ tone "TONE MAI TRI N/S-T")				; 0xEA
-	   (?๋ tone "TONE MAI CHATTAWA N/S-T")				; 0xEB
-	   (?์ sign-upper "THANTHAKHAT N/S-T (cancellation mark)")	; 0xEC
-	   (?ํ sign-upper "NIKKHAHIT N/S-T (final nasal)")		; 0xED
-	   (?๎ sign-upper "YAMAKKAN N/S-T")				; 0xEE
-	   (?๏ special "FONRMAN")					; 0xEF
-	   (?๐ special "DIGIT ZERO")					; 0xF0
-	   (?๑ special "DIGIT ONE")					; 0xF1
-	   (?๒ special "DIGIT TWO")					; 0xF2
-	   (?๓ special "DIGIT THREE")					; 0xF3
-	   (?๔ special "DIGIT FOUR")					; 0xF4
-	   (?๕ special "DIGIT FIVE")					; 0xF5
-	   (?๖ special "DIGIT SIX")					; 0xF6
-	   (?๗ special "DIGIT SEVEN")					; 0xF7
-	   (?๘ special "DIGIT EIGHT")					; 0xF8
-	   (?๙ special "DIGIT NINE")					; 0xF9
-	   (?๚ special "ANGKHANKHU (ellipsis)")				; 0xFA
-	   (?๛ special "KHOMUT (beginning of religious texts)")		; 0xFB
-	   (?๜ invalid nil)						; 0xFC
-	   (?๝ invalid nil)						; 0xFD
-	   (?๞ invalid nil)						; 0xFE
+(let ((l '((?ก consonant)	; 0xA1
+	   (?ข consonant)	; 0xA2
+	   (?ฃ consonant)	; 0xA3
+	   (?ค consonant)	; 0xA4
+	   (?ฅ consonant)	; 0xA5
+	   (?ฆ consonant)	; 0xA6
+	   (?ง consonant)	; 0xA7
+	   (?จ consonant)	; 0xA8
+	   (?ฉ consonant)	; 0xA9
+	   (?ช consonant)	; 0xAA
+	   (?ซ consonant)	; 0xAB
+	   (?ฌ consonant)	; 0xAC
+	   (?ญ consonant)	; 0xAD
+	   (?ฎ consonant)	; 0xAE
+	   (?ฏ consonant)	; 0xAF
+	   (?ฐ consonant)	; 0xB0
+	   (?ฑ consonant)	; 0xB1
+	   (?ฒ consonant)	; 0xB2
+	   (?ณ consonant)	; 0xB3
+	   (?ด consonant)	; 0xB4
+	   (?ต consonant)	; 0xB5
+	   (?ถ consonant)	; 0xB6
+	   (?ท consonant)	; 0xB7
+	   (?ธ consonant)	; 0xB8
+	   (?น consonant)	; 0xB9
+	   (?บ consonant)	; 0xBA
+	   (?ป consonant)	; 0xBB
+	   (?ผ consonant)	; 0xBC
+	   (?ฝ consonant)	; 0xBD
+	   (?พ consonant)	; 0xBE
+	   (?ฟ consonant)	; 0xBF
+	   (?ภ consonant)	; 0xC0
+	   (?ม consonant)	; 0xC1
+	   (?ย consonant)	; 0xC2
+	   (?ร consonant)	; 0xC3
+	   (?ฤ vowel-base)	; 0xC4
+	   (?ล consonant)	; 0xC5
+	   (?ฦ vowel-base)	; 0xC6
+	   (?ว consonant)	; 0xC7
+	   (?ศ consonant)	; 0xC8
+	   (?ษ consonant)	; 0xC9
+	   (?ส consonant)	; 0xCA
+	   (?ห consonant)	; 0xCB
+	   (?ฬ consonant)	; 0xCC
+	   (?อ consonant)	; 0xCD
+	   (?ฮ consonant)	; 0xCE
+	   (?ฯ special)	; 0xCF
+	   (?ะ vowel-base)	; 0xD0
+	   (?ั vowel-upper)	; 0xD1
+	   (?า vowel-base)	; 0xD2
+	   (?ำ vowel-base)	; 0xD3
+	   (?ิ vowel-upper)	; 0xD4
+	   (?ี vowel-upper)	; 0xD5
+	   (?ึ vowel-upper)	; 0xD6
+	   (?ื vowel-upper)	; 0xD7
+	   (?ุ vowel-lower)	; 0xD8
+	   (?ู vowel-lower)	; 0xD9
+	   (?ฺ vowel-lower)	; 0xDA
+	   (?฻ invalid)	; 0xDA
+	   (?฼ invalid)	; 0xDC
+	   (?฽ invalid)	; 0xDC
+	   (?฾ invalid)	; 0xDC
+	   (?฿ special)	; 0xDF
+	   (?เ vowel-base)	; 0xE0
+	   (?แ vowel-base)	; 0xE1
+	   (?โ vowel-base)	; 0xE2
+	   (?ใ vowel-base)	; 0xE3
+	   (?ไ vowel-base)	; 0xE4
+	   (?ๅ vowel-base)	; 0xE5
+	   (?ๆ special)	; 0xE6
+	   (?็ sign-upper)	; 0xE7
+	   (?่ tone)		; 0xE8
+	   (?้ tone)		; 0xE9
+	   (?๊ tone)		; 0xEA
+	   (?๋ tone)		; 0xEB
+	   (?์ sign-upper)	; 0xEC
+	   (?ํ sign-upper)	; 0xED
+	   (?๎ sign-upper)	; 0xEE
+	   (?๏ special)	; 0xEF
+	   (?๐ special)	; 0xF0
+	   (?๑ special)	; 0xF1
+	   (?๒ special)	; 0xF2
+	   (?๓ special)	; 0xF3
+	   (?๔ special)	; 0xF4
+	   (?๕ special)	; 0xF5
+	   (?๖ special)	; 0xF6
+	   (?๗ special)	; 0xF7
+	   (?๘ special)	; 0xF8
+	   (?๙ special)	; 0xF9
+	   (?๚ special)	; 0xFA
+	   (?๛ special)	; 0xFB
+	   (?๜ invalid)	; 0xFC
+	   (?๝ invalid)	; 0xFD
+	   (?๞ invalid)	; 0xFE
 	   ))
       elm)
   (while l
@@ -170,8 +170,7 @@ thai-composition-pattern
 	     (modify-category-entry char ?u thai-category-table)
 	     (if (= char ?์)
 		 ;; Give category `U' to "THANTHAKHAT".
-		 (modify-category-entry char ?U thai-category-table))))
-      (put-char-code-property char 'name (nth 2 elm)))))
+		 (modify-category-entry char ?U thai-category-table)))))))
 
 (defun thai-compose-syllable (beg end &optional category-set string)
   (or category-set
-- 
2.5.5


^ permalink raw reply related	[flat|nested] 7+ messages in thread

* bug#23698: 24.5; broken character name
  2016-06-06 17:49 ` Paul Eggert
@ 2016-06-06 19:00   ` Eli Zaretskii
  0 siblings, 0 replies; 7+ messages in thread
From: Eli Zaretskii @ 2016-06-06 19:00 UTC (permalink / raw)
  To: Paul Eggert; +Cc: 23698, ynyaaa

> Cc: 23698-done@debbugs.gnu.org, Eli Zaretskii <eliz@gnu.org>,
>  Kenichi Handa <handa@gnu.org>
> From: Paul Eggert <eggert@cs.ucla.edu>
> Date: Mon, 6 Jun 2016 10:49:12 -0700
> 
> > we have it since 1997, i.e. before
> > we started using the UCD for these purposes.  So I think we can either
> > remove it or use a property that doesn't clash with the Unicode
> > standard properties.
> It appears to be unnecessary these days, so I removed it in the attached 
> patch to master and am marking the bug as done. If I'm wrong and we need 
> it we can use a different property as you suggest.

I'd still like to hear Handa-san's opinions on this.





^ permalink raw reply	[flat|nested] 7+ messages in thread

* bug#23698: 24.5; broken character name
  2016-06-05 15:44 ` Eli Zaretskii
  2016-06-05 16:44   ` Eli Zaretskii
@ 2016-06-09 14:32   ` handa
  2016-06-09 14:37     ` Eli Zaretskii
  1 sibling, 1 reply; 7+ messages in thread
From: handa @ 2016-06-09 14:32 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: 23698, ynyaaa

In article <83y46jtzbb.fsf@gnu.org>, Eli Zaretskii <eliz@gnu.org> writes:

> Displaying the list of Thai characters loads thai-util.el, which
> deliberately overwrites the names derived from the Unicode Character
> Database with its own variants.  I'm CC'ing Handa-san, who added that
> code back in 2008, in the hope that he could tell why are we doing
> that, and whether this is still needed nowadays.

Long ago, I discussed with Thai/Lao people about how to support their
languages in Mule.  At that time, as I can't speak those languages, we
refer each character by name in the discussion.  And, for debugging the
support code, name property was very useful.  That's the reason for
those properties.   Now, I think we can get rid them.

---
K. Handa
handa@gnu.org





^ permalink raw reply	[flat|nested] 7+ messages in thread

* bug#23698: 24.5; broken character name
  2016-06-09 14:32   ` handa
@ 2016-06-09 14:37     ` Eli Zaretskii
  0 siblings, 0 replies; 7+ messages in thread
From: Eli Zaretskii @ 2016-06-09 14:37 UTC (permalink / raw)
  To: handa; +Cc: ynyaaa, 23698-done

> From: handa <handa@gnu.org>
> Cc: ynyaaa@gmail.com, 23698@debbugs.gnu.org
> Date: Thu, 09 Jun 2016 23:32:21 +0900
> 
> In article <83y46jtzbb.fsf@gnu.org>, Eli Zaretskii <eliz@gnu.org> writes:
> 
> > Displaying the list of Thai characters loads thai-util.el, which
> > deliberately overwrites the names derived from the Unicode Character
> > Database with its own variants.  I'm CC'ing Handa-san, who added that
> > code back in 2008, in the hope that he could tell why are we doing
> > that, and whether this is still needed nowadays.
> 
> Long ago, I discussed with Thai/Lao people about how to support their
> languages in Mule.  At that time, as I can't speak those languages, we
> refer each character by name in the discussion.  And, for debugging the
> support code, name property was very useful.  That's the reason for
> those properties.   Now, I think we can get rid them.

OK, thanks.  So I guess Paul's change did TRT, and we can close this
bug report.





^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2016-06-09 14:37 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-06-05 13:06 bug#23698: 24.5; broken character name ynyaaa
2016-06-05 15:44 ` Eli Zaretskii
2016-06-05 16:44   ` Eli Zaretskii
2016-06-09 14:32   ` handa
2016-06-09 14:37     ` Eli Zaretskii
2016-06-06 17:49 ` Paul Eggert
2016-06-06 19:00   ` Eli Zaretskii

Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.