* get-char-code-property: reports incorrect char names?
@ 2010-09-01 0:36 T.V. Raman
2010-09-01 6:38 ` Eli Zaretskii
2010-09-01 7:05 ` Stefan Monnier
0 siblings, 2 replies; 3+ messages in thread
From: T.V. Raman @ 2010-09-01 0:36 UTC (permalink / raw)
To: emacs-devel; +Cc: tv.raman.tv
This is using the latest bzr snapshot, with the latest
unicodedata.txt from the Unicode consortium.
I noticed that I was hearing incorrect character names for
unicode chars from Emacspeak --- and it's possibly due to a bug
in get-char-code-property.
The solution that works describe-char-unicode-data --- is
declared to be semi-obsolete.
To see the problem, see code fragments and result of evaluation:
The character used is the "cent sign" ==
¢ inserted via C-x 8c in the buffer.
(loop for p in
(mapcar 'car char-code-property-alist)
do
(insert
(format "%s: %s\n" p (get-char-code-property ?¢ p ))))
titlecase: nil
lowercase: nil
uppercase: nil
iso-10646-comment: nil
old-name: nil
mirrored: N
numeric-value: nil
digit-value: nil
decimal-digit-value: nil
decomposition: nil
bidi-class: ET
canonical-combining-class: 0
general-category: Sc
name: SUPERSET NYA FA SIGN
Compared to what describe-char-unicode-data returns for the same
character: note particularly the Name field.
(describe-char-unicode-data ?¢)
(("Name" "CENT SIGN")
("Category" "Symbol, Currency")
("Combining class" "Sc")
("Bidi category" "Sc")
("Decomposition" nil)
("Decimal digit value" nil)
("Digit value" nil)
("Numeric value" nil)
("Mirrored" nil)
("Old name" nil)
("ISO 10646 comment" nil)
("Uppercase" nil)
...)
--
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: get-char-code-property: reports incorrect char names?
2010-09-01 0:36 get-char-code-property: reports incorrect char names? T.V. Raman
@ 2010-09-01 6:38 ` Eli Zaretskii
2010-09-01 7:05 ` Stefan Monnier
1 sibling, 0 replies; 3+ messages in thread
From: Eli Zaretskii @ 2010-09-01 6:38 UTC (permalink / raw)
To: T.V. Raman; +Cc: emacs-devel
> Date: Tue, 31 Aug 2010 17:36:35 -0700
> From: "T.V. Raman" <tv.raman.tv@gmail.com>
> Cc: tv.raman.tv@gmail.com
>
> This is using the latest bzr snapshot, with the latest
> unicodedata.txt from the Unicode consortium.
Did you actually mean "from the Unicode consortium", i.e. you are
using a file that is different from what we have in the repository?
Introduction of a new UnicodeData.txt file requires to run several
commands in the admin subdirectory, and then rebuild several Lisp
files. Did you do all that?
> I noticed that I was hearing incorrect character names for
> unicode chars from Emacspeak --- and it's possibly due to a bug
> in get-char-code-property.
>
> The solution that works describe-char-unicode-data --- is
> declared to be semi-obsolete.
>
> To see the problem, see code fragments and result of evaluation:
>
> The character used is the "cent sign" ==
> ¢ inserted via C-x 8c in the buffer.
>
> (loop for p in
> (mapcar 'car char-code-property-alist)
> do
> (insert
> (format "%s: %s\n" p (get-char-code-property ?¢ p ))))
>
> titlecase: nil
> lowercase: nil
> uppercase: nil
> iso-10646-comment: nil
> old-name: nil
> mirrored: N
> numeric-value: nil
> digit-value: nil
> decimal-digit-value: nil
> decomposition: nil
> bidi-class: ET
> canonical-combining-class: 0
> general-category: Sc
> name: SUPERSET NYA FA SIGN
I cannot reproduce this with today's build of the trunk. I get "CENT
SIGN" for the `name' property of a character inserted by "C-x 8c".
Also, there's no character named "SUPERSET NYA FA SIGN" in
UnicodeData.txt that's in the repository.
Does this happen for you in "emacs -Q"? If not, there's some other
factor at work here.
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: get-char-code-property: reports incorrect char names?
2010-09-01 0:36 get-char-code-property: reports incorrect char names? T.V. Raman
2010-09-01 6:38 ` Eli Zaretskii
@ 2010-09-01 7:05 ` Stefan Monnier
1 sibling, 0 replies; 3+ messages in thread
From: Stefan Monnier @ 2010-09-01 7:05 UTC (permalink / raw)
To: T.V. Raman; +Cc: emacs-devel
> I noticed that I was hearing incorrect character names for
> unicode chars from Emacspeak --- and it's possibly due to a bug
> in get-char-code-property.
Please use M-x report-emacs-bug rather than posting to emacs-devel.
I can't reproduce your problem. It's probably due to a some
coding-system problem.
Stefan
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2010-09-01 7:05 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-09-01 0:36 get-char-code-property: reports incorrect char names? T.V. Raman
2010-09-01 6:38 ` Eli Zaretskii
2010-09-01 7:05 ` Stefan Monnier
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/emacs.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).