From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: Unicode Char Names: describe-char-unicode-data vs get-char-code-property Date: Fri, 27 Dec 2013 22:31:59 +0200 Message-ID: <8361qa3uio.fsf@gnu.org> References: Reply-To: Eli Zaretskii NNTP-Posting-Host: plane.gmane.org X-Trace: ger.gmane.org 1388176414 21689 80.91.229.3 (27 Dec 2013 20:33:34 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 27 Dec 2013 20:33:34 +0000 (UTC) Cc: emacs-devel@gnu.org To: "T.V. Raman" Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Dec 27 21:33:39 2013 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Vwe68-0006YF-9V for ged-emacs-devel@m.gmane.org; Fri, 27 Dec 2013 21:33:32 +0100 Original-Received: from localhost ([::1]:50236 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vwe67-00026c-TH for ged-emacs-devel@m.gmane.org; Fri, 27 Dec 2013 15:33:31 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:46581) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vwe5x-00025f-42 for emacs-devel@gnu.org; Fri, 27 Dec 2013 15:33:28 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Vwe5p-0004DJ-Ol for emacs-devel@gnu.org; Fri, 27 Dec 2013 15:33:21 -0500 Original-Received: from mtaout22.012.net.il ([80.179.55.172]:40688) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vwe5p-0004DD-Gn for emacs-devel@gnu.org; Fri, 27 Dec 2013 15:33:13 -0500 Original-Received: from conversion-daemon.a-mtaout22.012.net.il by a-mtaout22.012.net.il (HyperSendmail v2007.08) id <0MYH00C00FJ1SL00@a-mtaout22.012.net.il> for emacs-devel@gnu.org; Fri, 27 Dec 2013 22:32:07 +0200 (IST) Original-Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout22.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0MYH00CZ2FPJNA50@a-mtaout22.012.net.il>; Fri, 27 Dec 2013 22:32:07 +0200 (IST) In-reply-to: X-012-Sender: halo1@inter.net.il X-detected-operating-system: by eggs.gnu.org: Solaris 10 X-Received-From: 80.179.55.172 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:166921 Archived-At: > Date: Fri, 27 Dec 2013 12:09:36 -0800 > From: "T.V. Raman" > > I use describe-char-unicode-data to speak unicode chars in > emacspeak when navigating by character. > > That function claims to be semi-obsolete -- and recommends > get-char-code-property instead; however the answers are not > consistent in the two solutions. > > Try non-breaking space \240 (octal) > > (setq char ? ) > 160 (#o240, #xa0, ? ) > (describe-char-unicode-data char) returns > (("Name" "NO-BREAK SPACE") ("Category" "Separator, Space") > ("Combining class" "Zs") ("Bidi category" "Zs") ("Decomposition" > "noBreak ") ("Decimal digit value" nil) ("Digit value" nil) > ("Numeric value" nil) ("Mirrored" nil) ("Old name" "NON-BREAKING > SPACE") ("ISO 10646 comment" nil) ("Uppercase" nil) ...) > whereas > (get-char-code-property char 'name) returns > "COMMA NO-SHE IOTA ARABIC DIAERESIS" > also looking up char in ucs-names yields something different > (car (rassoc char (ucs-names))) returns > "NON-NEITHER UNDER CM COMMA" > Note that all of the above is with the unicdoedata file present > and correctly found by emacs. I cannot reproduce any of these. For me, both describe-char-unicode-data and get-char-code-property returns the same name, "NO-BREAK SPACE", and from ucs-names I get "NON-BREAKING SPACE", which is the "Old Name" property of that character. > Also, and perhaps related: > > C-x 8 spc inserts non-breaking space correctly; however when you > use C-x 8 ret, non-breaking space doesn't show up in the > available completions -- either using it's "name" or "old-name" This also works as expected on my system. Please dig deeper into your configuration and customizations, the answer is somewhere there.