get-char-code-property: reports incorrect char names?

unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed

* get-char-code-property: reports incorrect char names?
@ 2010-09-01  0:36 T.V. Raman
  2010-09-01  6:38 ` Eli Zaretskii
  2010-09-01  7:05 ` Stefan Monnier
  0 siblings, 2 replies; 3+ messages in thread
From: T.V. Raman @ 2010-09-01  0:36 UTC (permalink / raw)
  To: emacs-devel; +Cc: tv.raman.tv

This is using the latest bzr snapshot, with the latest
unicodedata.txt from the Unicode  consortium.

I noticed that I was hearing incorrect character names for
unicode chars from Emacspeak --- and it's possibly due to a bug
in get-char-code-property.

The solution that works  describe-char-unicode-data --- is
declared to be semi-obsolete.

To see the problem, see code fragments and result of evaluation:

The  character used is the "cent sign" ==
¢ inserted via C-x 8c in the buffer.

(loop for p in
      (mapcar 'car char-code-property-alist)
      do
      (insert
       (format "%s: %s\n" p (get-char-code-property ?¢ p ))))

titlecase: nil
lowercase: nil
uppercase: nil
iso-10646-comment: nil
old-name: nil
mirrored: N
numeric-value: nil
digit-value: nil
decimal-digit-value: nil
decomposition: nil
bidi-class: ET
canonical-combining-class: 0
general-category: Sc
name: SUPERSET NYA FA SIGN

Compared to what describe-char-unicode-data returns for the same
character: note particularly the Name field.
(describe-char-unicode-data ?¢)
(("Name" "CENT SIGN")
 ("Category" "Symbol, Currency")
 ("Combining class" "Sc")
 ("Bidi category" "Sc")
 ("Decomposition" nil)
 ("Decimal digit value" nil)
 ("Digit value" nil)
 ("Numeric value" nil)
 ("Mirrored" nil)
 ("Old name" nil)
 ("ISO 10646 comment" nil)
 ("Uppercase" nil)
 ...)

--

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: get-char-code-property: reports incorrect char names?
  2010-09-01  0:36 get-char-code-property: reports incorrect char names? T.V. Raman
@ 2010-09-01  6:38 ` Eli Zaretskii
  2010-09-01  7:05 ` Stefan Monnier
  1 sibling, 0 replies; 3+ messages in thread
From: Eli Zaretskii @ 2010-09-01  6:38 UTC (permalink / raw)
  To: T.V. Raman; +Cc: emacs-devel

> Date: Tue, 31 Aug 2010 17:36:35 -0700
> From: "T.V. Raman" <tv.raman.tv@gmail.com>
> Cc: tv.raman.tv@gmail.com
> 
> This is using the latest bzr snapshot, with the latest
> unicodedata.txt from the Unicode  consortium.

Did you actually mean "from the Unicode  consortium", i.e. you are
using a file that is different from what we have in the repository?

Introduction of a new UnicodeData.txt file requires to run several
commands in the admin subdirectory, and then rebuild several Lisp
files.  Did you do all that?

> I noticed that I was hearing incorrect character names for
> unicode chars from Emacspeak --- and it's possibly due to a bug
> in get-char-code-property.
> 
> The solution that works  describe-char-unicode-data --- is
> declared to be semi-obsolete.
> 
> To see the problem, see code fragments and result of evaluation:
> 
> The  character used is the "cent sign" ==
> ¢ inserted via C-x 8c in the buffer.
> 
> (loop for p in
>       (mapcar 'car char-code-property-alist)
>       do
>       (insert
>        (format "%s: %s\n" p (get-char-code-property ?¢ p ))))
> 
> titlecase: nil
> lowercase: nil
> uppercase: nil
> iso-10646-comment: nil
> old-name: nil
> mirrored: N
> numeric-value: nil
> digit-value: nil
> decimal-digit-value: nil
> decomposition: nil
> bidi-class: ET
> canonical-combining-class: 0
> general-category: Sc
> name: SUPERSET NYA FA SIGN

I cannot reproduce this with today's build of the trunk.  I get "CENT
SIGN" for the `name' property of a character inserted by "C-x 8c".
Also, there's no character named "SUPERSET NYA FA SIGN" in
UnicodeData.txt that's in the repository.

Does this happen for you in "emacs -Q"?  If not, there's some other
factor at work here.



^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: get-char-code-property: reports incorrect char names?
  2010-09-01  0:36 get-char-code-property: reports incorrect char names? T.V. Raman
  2010-09-01  6:38 ` Eli Zaretskii
@ 2010-09-01  7:05 ` Stefan Monnier
  1 sibling, 0 replies; 3+ messages in thread
From: Stefan Monnier @ 2010-09-01  7:05 UTC (permalink / raw)
  To: T.V. Raman; +Cc: emacs-devel

> I noticed that I was hearing incorrect character names for
> unicode chars from Emacspeak --- and it's possibly due to a bug
> in get-char-code-property.

Please use M-x report-emacs-bug rather than posting to emacs-devel.
I can't reproduce your problem.  It's probably due to a some
coding-system problem.


        Stefan



^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2010-09-01  7:05 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-09-01  0:36 get-char-code-property: reports incorrect char names? T.V. Raman
2010-09-01  6:38 ` Eli Zaretskii
2010-09-01  7:05 ` Stefan Monnier

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).