From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: "T.V. Raman" Newsgroups: gmane.emacs.devel Subject: Unicode Char Names: describe-char-unicode-data vs get-char-code-property Date: Fri, 27 Dec 2013 12:09:36 -0800 Message-ID: NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 X-Trace: ger.gmane.org 1388174981 7274 80.91.229.3 (27 Dec 2013 20:09:41 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 27 Dec 2013 20:09:41 +0000 (UTC) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Dec 27 21:09:45 2013 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Vwdj5-0005GY-0K for ged-emacs-devel@m.gmane.org; Fri, 27 Dec 2013 21:09:43 +0100 Original-Received: from localhost ([::1]:50172 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vwdj4-00053w-6I for ged-emacs-devel@m.gmane.org; Fri, 27 Dec 2013 15:09:42 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:42619) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vwdj0-00053q-VG for emacs-devel@gnu.org; Fri, 27 Dec 2013 15:09:39 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Vwdiz-0006CD-3a for emacs-devel@gnu.org; Fri, 27 Dec 2013 15:09:38 -0500 Original-Received: from mail-vc0-x22b.google.com ([2607:f8b0:400c:c03::22b]:56294) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vwdiy-0006C8-Um for emacs-devel@gnu.org; Fri, 27 Dec 2013 15:09:37 -0500 Original-Received: by mail-vc0-f171.google.com with SMTP id ik5so4979029vcb.16 for ; Fri, 27 Dec 2013 12:09:36 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=OinQBkcfLVjV8cP/IvKCUZRnYHWUQ6ev3eTeCWgWq3o=; b=xmQ4zzXrmhBnIIvgdQSOfOROA8oQZUpEWuX8CFwfeCU+fFRw+mblDHRmfzE6o1GtQw n6DGuZPGXEUqArJ3JCp+73AxJI9fbKpPhBLqz8Eh+hyqHF+0DX/C53xmeJI4nbZOEKZ0 RI99F59H7XVR469yBRlD0peD2t4ZlGCyMGyWDU3tKMCkZR03uJOcRD03KeRLhHE2LN2B 8WN2oapfj3dPCucI/oxQ2KE39/uCWJRUh/vxuIaDuln5E/Tdt8kgWFT9lmmXCvLNuZeC 1G+k7aITNxqa3pai8WCqsDHwxjtewUbMpmb5fhudZ0P00QcxN53oK8VXd8bthEndYlwp W+ng== X-Received: by 10.52.33.178 with SMTP id s18mr8338662vdi.39.1388174976145; Fri, 27 Dec 2013 12:09:36 -0800 (PST) Original-Received: by 10.58.109.132 with HTTP; Fri, 27 Dec 2013 12:09:36 -0800 (PST) X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2607:f8b0:400c:c03::22b X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:166918 Archived-At: I use describe-char-unicode-data to speak unicode chars in emacspeak when navigating by character. That function claims to be semi-obsolete -- and recommends get-char-code-property instead; however the answers are not consistent in the two solutions. Try non-breaking space \240 (octal) (setq char ? ) 160 (#o240, #xa0, ? ) (describe-char-unicode-data char) returns (("Name" "NO-BREAK SPACE") ("Category" "Separator, Space") ("Combining class" "Zs") ("Bidi category" "Zs") ("Decomposition" "noBreak ") ("Decimal digit value" nil) ("Digit value" nil) ("Numeric value" nil) ("Mirrored" nil) ("Old name" "NON-BREAKING SPACE") ("ISO 10646 comment" nil) ("Uppercase" nil) ...) whereas (get-char-code-property char 'name) returns "COMMA NO-SHE IOTA ARABIC DIAERESIS" also looking up char in ucs-names yields something different (car (rassoc char (ucs-names))) returns "NON-NEITHER UNDER CM COMMA" Note that all of the above is with the unicdoedata file present and correctly found by emacs. Also, and perhaps related: C-x 8 spc inserts non-breaking space correctly; however when you use C-x 8 ret, non-breaking space doesn't show up in the available completions -- either using it's "name" or "old-name" --