From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: "T.V. Raman" Newsgroups: gmane.emacs.devel Subject: Unicode Char Names: describe-char-unicode-data vs get-char-code-property Date: Fri, 27 Dec 2013 13:05:49 -0800 Message-ID: References: NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 X-Trace: ger.gmane.org 1388178347 8409 80.91.229.3 (27 Dec 2013 21:05:47 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 27 Dec 2013 21:05:47 +0000 (UTC) To: emacs-devel@gnu.org, raman@google.com Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Dec 27 22:05:54 2013 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1VwebR-0002QV-TL for ged-emacs-devel@m.gmane.org; Fri, 27 Dec 2013 22:05:54 +0100 Original-Received: from localhost ([::1]:50282 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1VwebR-0007q4-CM for ged-emacs-devel@m.gmane.org; Fri, 27 Dec 2013 16:05:53 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:51718) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1VwebP-0007or-0e for emacs-devel@gnu.org; Fri, 27 Dec 2013 16:05:51 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1VwebN-0004en-VF for emacs-devel@gnu.org; Fri, 27 Dec 2013 16:05:50 -0500 Original-Received: from mail-vc0-x22c.google.com ([2607:f8b0:400c:c03::22c]:57263) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1VwebN-0004ee-Pa for emacs-devel@gnu.org; Fri, 27 Dec 2013 16:05:49 -0500 Original-Received: by mail-vc0-f172.google.com with SMTP id ij19so4922075vcb.17 for ; Fri, 27 Dec 2013 13:05:49 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=sBTgVEQ+0NqyCLGEG9tK5efuANMRBIIyQqi/lIjiVC8=; b=i2TY9BcoK460wT876rSwB1V8JlbG9ICNeIzTrA4ElHZcUGb/L2ZAwGkHXcihwADzXw 2+J3kkAP+bbu/uF+IcL+2yPfn98zdXLuXEWHSJ66cBlkIhT2j6hfOpFVFyOrgHDrlsOt 1jr8OLXDqtcM746ULWSBLJPxdWgnppXcCRtIRR+o31L9wrOTGhi2+es6dj6QUTiFxVZW uKsEoyEFDCj+4spK9gDJZNXHzEgpnYiv3ndR5lJYKeyCQxHOP1AVpGmLEmW6QyMp+MEk Io5zz3jpMkpGNn2Aih0KG7AiZf2EnzSRhGjTuejFVhTA6L5zogQ+0FndThqL87gr60IB e65w== X-Received: by 10.220.11.5 with SMTP id r5mr7293188vcr.50.1388178349098; Fri, 27 Dec 2013 13:05:49 -0800 (PST) Original-Received: by 10.58.109.132 with HTTP; Fri, 27 Dec 2013 13:05:49 -0800 (PST) In-Reply-To: X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2607:f8b0:400c:c03::22c X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:166924 Archived-At: Following-up to myself and Eli: I moved out my .emacs.d/elpa directory and the problem goes away --- it's going to be a lot harder to identify the offending elpa installed package:-) -- -- On 12/27/13, T.V. Raman wrote: > I use describe-char-unicode-data to speak unicode chars in > emacspeak when navigating by character. > > That function claims to be semi-obsolete -- and recommends > get-char-code-property instead; however the answers are not > consistent in the two solutions. > > Try non-breaking space \240 (octal) > > (setq char ? ) > 160 (#o240, #xa0, ? ) > (describe-char-unicode-data char) returns > (("Name" "NO-BREAK SPACE") ("Category" "Separator, Space") > ("Combining class" "Zs") ("Bidi category" "Zs") ("Decomposition" > "noBreak ") ("Decimal digit value" nil) ("Digit value" nil) > ("Numeric value" nil) ("Mirrored" nil) ("Old name" "NON-BREAKING > SPACE") ("ISO 10646 comment" nil) ("Uppercase" nil) ...) > whereas > (get-char-code-property char 'name) returns > "COMMA NO-SHE IOTA ARABIC DIAERESIS" > also looking up char in ucs-names yields something different > (car (rassoc char (ucs-names))) returns > "NON-NEITHER UNDER CM COMMA" > Note that all of the above is with the unicdoedata file present > and correctly found by emacs. > > > Also, and perhaps related: > > C-x 8 spc inserts non-breaking space correctly; however when you > use C-x 8 ret, non-breaking space doesn't show up in the > available completions -- either using it's "name" or "old-name" > -- >