From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.help,gmane.comp.python.general Subject: Re: Different names for Unicode codepoint Date: Thu, 21 Apr 2016 22:40:17 +0300 Message-ID: <83h9eu699a.fsf@gnu.org> References: <87wpnqsrzz.fsf@metapensiero.it> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1461267685 7738 80.91.229.3 (21 Apr 2016 19:41:25 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 21 Apr 2016 19:41:25 +0000 (UTC) Cc: python-list@python.org To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Thu Apr 21 21:41:17 2016 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1atKTT-0007qD-UG for geh-help-gnu-emacs@m.gmane.org; Thu, 21 Apr 2016 21:41:16 +0200 Original-Received: from localhost ([::1]:46941 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1atKTT-0004rI-EH for geh-help-gnu-emacs@m.gmane.org; Thu, 21 Apr 2016 15:41:15 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:55800) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1atKSr-0003vC-1z for help-gnu-emacs@gnu.org; Thu, 21 Apr 2016 15:40:37 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1atKSm-0004w1-Rw for help-gnu-emacs@gnu.org; Thu, 21 Apr 2016 15:40:36 -0400 Original-Received: from fencepost.gnu.org ([2001:4830:134:3::e]:43496) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1atKSb-0004sn-Pw; Thu, 21 Apr 2016 15:40:21 -0400 Original-Received: from 84.94.185.246.cable.012.net.il ([84.94.185.246]:1555 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.82) (envelope-from ) id 1atKSb-0002PJ-6D; Thu, 21 Apr 2016 15:40:21 -0400 In-reply-to: <87wpnqsrzz.fsf@metapensiero.it> (message from Lele Gaifax on Thu, 21 Apr 2016 21:04:32 +0200) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "help-gnu-emacs" Xref: news.gmane.org gmane.emacs.help:109801 gmane.comp.python.general:794105 Archived-At: > From: Lele Gaifax > Date: Thu, 21 Apr 2016 21:04:32 +0200 > Cc: python-list@python.org > > is there a particular reason for the slightly different names that Emacs > (version 25.0.92) and Python (version 3.6.0a0) give to a single Unicode entity? They don't. > Just to mention one codepoint, ⋖ is called "LESS THAN WITH DOT" accordingly to > Emacs' C-x 8 RET TAB menu, while in Python: > > >>> import unicodedata > >>> unicodedata.name('⋖') > 'LESS-THAN WITH DOT' > >>> print("\N{LESS THAN WITH DOT}") > File "", line 1 > SyntaxError: (unicode error) ...: unknown Unicode character name Emacs shows both the "Name" and the "Old Name" properties of characters as completion candidates, while Python evidently supports only "Name". If you type "C-x 8 RET LESS TAB", then you will see among the completion candidates both "LESS THAN WITH DOT" and "LESS-THAN WITH DOT". The former is the "old name" of this character, according to the Unicode Character Database (which is where Emacs obtains the names and other properties of characters).