From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Philipp Stephani Newsgroups: gmane.emacs.bugs Subject: bug#26477: what-cursor-position should mention "U+" Date: Thu, 20 Apr 2017 10:12:55 +0000 Message-ID: References: <87k26ok2mh.fsf@jidanni.org> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=001a114b391a132bd1054d9663b0 X-Trace: blaine.gmane.org 1492683256 9740 195.159.176.226 (20 Apr 2017 10:14:16 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Thu, 20 Apr 2017 10:14:16 +0000 (UTC) To: =?UTF-8?Q?=E7=A9=8D=E4=B8=B9=E5=B0=BC?= Dan Jacobson , 26477@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Thu Apr 20 12:14:12 2017 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1d196J-0002OB-SL for geb-bug-gnu-emacs@m.gmane.org; Thu, 20 Apr 2017 12:14:12 +0200 Original-Received: from localhost ([::1]:52772 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1d196N-0000Z2-R8 for geb-bug-gnu-emacs@m.gmane.org; Thu, 20 Apr 2017 06:14:15 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:37644) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1d196F-0000XT-Vg for bug-gnu-emacs@gnu.org; Thu, 20 Apr 2017 06:14:09 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1d196A-0001LH-UC for bug-gnu-emacs@gnu.org; Thu, 20 Apr 2017 06:14:07 -0400 Original-Received: from debbugs.gnu.org ([208.118.235.43]:59784) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1d196A-0001L8-Ny for bug-gnu-emacs@gnu.org; Thu, 20 Apr 2017 06:14:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1d196A-0004Bc-IR for bug-gnu-emacs@gnu.org; Thu, 20 Apr 2017 06:14:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Philipp Stephani Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Thu, 20 Apr 2017 10:14:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 26477 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 26477-submit@debbugs.gnu.org id=B26477.149268319416017 (code B ref 26477); Thu, 20 Apr 2017 10:14:02 +0000 Original-Received: (at 26477) by debbugs.gnu.org; 20 Apr 2017 10:13:14 +0000 Original-Received: from localhost ([127.0.0.1]:57983 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1d195N-0004AH-LR for submit@debbugs.gnu.org; Thu, 20 Apr 2017 06:13:13 -0400 Original-Received: from mail-wm0-f43.google.com ([74.125.82.43]:34959) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1d195M-0004A1-6g for 26477@debbugs.gnu.org; Thu, 20 Apr 2017 06:13:12 -0400 Original-Received: by mail-wm0-f43.google.com with SMTP id w64so99924599wma.0 for <26477@debbugs.gnu.org>; Thu, 20 Apr 2017 03:13:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=e+9SjUzWcgHAtKBEbqUAOoMb+RIJIFYZKjmMhAPQL8w=; b=jpVqWtrWEy0644h35G2pXcxhsRu54td7GinUV/CLAMJ+exMlqm/skMyd7f+UUSwjDH AXP1krlRPOKQuxJWIyVGVkUOlXVscM0PfPwKaRb5YpLXJBGAXA6kfSW0YDuTpsM6Ydef DsqjqgntSlQ4CZVf6IWzTdTWhSGmOK/0a9mxV/h7wqH8IaAByMoOnz6WbXJkccwMwXPu wt8fV77B8Lmg0TNe75wlM+XTvI5QTqTwB3DVP0YlJpQ1Qvts7gQ5jplygWQv7IizCkaU WNyFRHlMSQqfUd/V1/9d0zHK7aHfmYpfoURLsOerjRCTYY9TRWUAvlrxv/XX/Wa/FaC6 6B8Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=e+9SjUzWcgHAtKBEbqUAOoMb+RIJIFYZKjmMhAPQL8w=; b=Cry64fghZe/Fj6vCXcOs/MIJzYTXveNNkk88zOewRX8d87AwWpTnnvh3wTAchRQ2me OTV+gJbo2GHqa6FbP339/u8eiWlcfUDzSUSVCJw8mt3/EYTOX+5svI2DSV35LOUOiC8W d2xBEMUdUkqrF3LShBg4ldcoOVxC9LzcolrIL0dNQ8/3QVv32mwMTb5IPMHq+8LuDKDN fA0WSwRwWq6t4gl9HhXcA49FfSC0m8mjLyIgSFdZ/GFZfo5sCVPAG5k9AhFI6qNQ0X96 ApbrPc3K5RilQpStQEBwF//FBu6oMsTxh7+md4szi2304hEQUeLoCvVrRZCpugMg/fBW DD8A== X-Gm-Message-State: AN3rC/6tHBNPTKi54nXhpwz527PopdBb1iaDFfS5aTstQFbtDVIwgUeq FGxU77Efe1l2l7jE6hMbQ3YHm936WHKK X-Received: by 10.28.156.13 with SMTP id f13mr2376571wme.44.1492683186181; Thu, 20 Apr 2017 03:13:06 -0700 (PDT) In-Reply-To: <87k26ok2mh.fsf@jidanni.org> X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:131797 Archived-At: --001a114b391a132bd1054d9663b0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable =E7=A9=8D=E4=B8=B9=E5=B0=BC Dan Jacobson schrieb am D= o., 13. Apr. 2017 um 13:48 Uhr: > C-x =3D (translated from ) runs the command what-cursor-position > (found in global-map), which is an interactive compiled Lisp function > in =E2=80=98simple.el=E2=80=99. > > says: > > position: 120538 of 121236 (99%), column: 0 > character: =E2=97=8B (displayed as =E2=97=8B) (codepoint 9675= , #o22713, #x25cb) > preferred charset: chinese-big5-1 (Frequently used part (A141-C67E) o= f > Big5 (Chinese traditional)) > code point in charset: 0x2172 > script: symbol > syntax: _ which means: symbol > category: .:Base, c:Chinese, h:Korean, j:Japanese > to input: type "C-x 8 RET 25cb" or "C-x 8 RET WHITE CIRCLE" > buffer code: #xE2 #x97 #x8B > file code: #xE2 #x97 #x8B (encoded by coding system utf-8-uni= x) > display: by this font (glyph code) > x:-eten-fixed-medium-r-normal--16-150-75-75-c-160-big5.eten-0 (#xA1B3= ) > > Character code properties: customize what to show > name: WHITE CIRCLE > general-category: So (Symbol, Other) > decomposition: (9675) ('=E2=97=8B') > > Why can't it say U+25CB WHITE CIRCLE > anywhere (except fragmented all over the place)? > If you ask for the reason, I guess it's just legacy. I assume `describe-char' is much older than the Unicode support in Emacs, and it was never thoroughly redesigned (the Unicode properties are all at the bottom). My suggestion would be to replace the "(codepoint ...)" part with the standard code point description "(U+NNNN character name)", and either get rid of most of the non-Unicode properties (preferred charset, code point in charset, buffer code, file code) or move them further down. That should be a relatively simple change in the code of `describe-char'. --001a114b391a132bd1054d9663b0 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable


=E7=A9= =8D=E4=B8=B9=E5=B0=BC Dan Jacobson <jidanni@jidanni.org> schrieb am Do., 13. Apr. 2017 um 13:48=C2=A0= Uhr:
C-x =3D (translated from <r= eturn>) runs the command what-cursor-position
(found in global-map), which is an interactive compiled Lisp function
in =E2=80=98simple.el=E2=80=99.

says:

=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0position: 120538 of 121236 = (99%), column: 0
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 character: =E2=97=8B (displayed a= s =E2=97=8B) (codepoint 9675, #o22713, #x25cb)
=C2=A0 =C2=A0 preferred charset: chinese-big5-1 (Frequently used part (A141= -C67E) of Big5 (Chinese traditional))
code point in charset: 0x2172
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0script: symbol
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0syntax: _=C2=A0 =C2= =A0 =C2=A0 =C2=A0 which means: symbol
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0category: .:Base, c:Chinese= , h:Korean, j:Japanese
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0to input: type "C-x 8 = RET 25cb" or "C-x 8 RET WHITE CIRCLE"
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 buffer code: #xE2 #x97 #x8B
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 file code: #xE2 #x97 #x8B (encode= d by coding system utf-8-unix)
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 display: by this font (gly= ph code)
=C2=A0 =C2=A0 x:-eten-fixed-medium-r-normal--16-150-75-75-c-160-big5.eten-0= (#xA1B3)

Character code properties: customize what to show
=C2=A0 name: WHITE CIRCLE
=C2=A0 general-category: So (Symbol, Other)
=C2=A0 decomposition: (9675) ('=E2=97=8B')

Why can't it say U+25CB WHITE CIRCLE
anywhere (except fragmented all over the place)?

If you ask for the reason, I guess it'= s just legacy. I assume `describe-char' is much older than the Unicode = support in Emacs, and it was never thoroughly redesigned (the Unicode prope= rties are all at the bottom).
My suggestion would be to replace t= he "(codepoint ...)" part with the standard code point descriptio= n "(U+NNNN character name)", and either get rid of most of the no= n-Unicode properties (preferred charset, code point in charset, buffer code= , file code) or move them further down. That should be a relatively simple = change in the code of `describe-char'.
--001a114b391a132bd1054d9663b0--