unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* bug#26477: what-cursor-position should mention "U+"
@ 2017-04-13 10:45 積丹尼 Dan Jacobson
  2017-04-13 11:59 ` npostavs
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: 積丹尼 Dan Jacobson @ 2017-04-13 10:45 UTC (permalink / raw)
  To: 26477

C-x = (translated from <return>) runs the command what-cursor-position
(found in global-map), which is an interactive compiled Lisp function
in ‘simple.el’.

says:

             position: 120538 of 121236 (99%), column: 0
            character: ○ (displayed as ○) (codepoint 9675, #o22713, #x25cb)
    preferred charset: chinese-big5-1 (Frequently used part (A141-C67E) of Big5 (Chinese traditional))
code point in charset: 0x2172
               script: symbol
               syntax: _ 	which means: symbol
             category: .:Base, c:Chinese, h:Korean, j:Japanese
             to input: type "C-x 8 RET 25cb" or "C-x 8 RET WHITE CIRCLE"
          buffer code: #xE2 #x97 #x8B
            file code: #xE2 #x97 #x8B (encoded by coding system utf-8-unix)
              display: by this font (glyph code)
    x:-eten-fixed-medium-r-normal--16-150-75-75-c-160-big5.eten-0 (#xA1B3)

Character code properties: customize what to show
  name: WHITE CIRCLE
  general-category: So (Symbol, Other)
  decomposition: (9675) ('○')

Why can't it say U+25CB WHITE CIRCLE
anywhere (except fragmented all over the place)?

$ unicode ○
U+25CB WHITE CIRCLE
UTF-8: e2 97 8b UTF-16BE: 25cb Decimal: &#9675; Octal: \022713
○
Category: So (Symbol, Other)
Unicode block: 25A0..25FF; Geometric Shapes
Bidi: ON (Other Neutrals)

P.S., "translated from return"
was due to  C-x <escape> <escape> [repeat-complex-command].





^ permalink raw reply	[flat|nested] 4+ messages in thread

* bug#26477: what-cursor-position should mention "U+"
  2017-04-13 10:45 bug#26477: what-cursor-position should mention "U+" 積丹尼 Dan Jacobson
@ 2017-04-13 11:59 ` npostavs
  2017-04-13 12:10 ` 積丹尼 Dan Jacobson
  2017-04-20 10:12 ` Philipp Stephani
  2 siblings, 0 replies; 4+ messages in thread
From: npostavs @ 2017-04-13 11:59 UTC (permalink / raw)
  To: 積丹尼 Dan Jacobson; +Cc: 26477

severity 26477 wishlist
quit

積丹尼 Dan Jacobson <jidanni@jidanni.org> writes:

>
> Why can't it say U+25CB WHITE CIRCLE
> anywhere (except fragmented all over the place)?

What is "U+", and why should we mention it?






^ permalink raw reply	[flat|nested] 4+ messages in thread

* bug#26477: what-cursor-position should mention "U+"
  2017-04-13 10:45 bug#26477: what-cursor-position should mention "U+" 積丹尼 Dan Jacobson
  2017-04-13 11:59 ` npostavs
@ 2017-04-13 12:10 ` 積丹尼 Dan Jacobson
  2017-04-20 10:12 ` Philipp Stephani
  2 siblings, 0 replies; 4+ messages in thread
From: 積丹尼 Dan Jacobson @ 2017-04-13 12:10 UTC (permalink / raw)
  To: npostavs; +Cc: 26477

http://stackoverflow.com/questions/1273693/why-is-u-used-to-designate-a-unicode-code-point





^ permalink raw reply	[flat|nested] 4+ messages in thread

* bug#26477: what-cursor-position should mention "U+"
  2017-04-13 10:45 bug#26477: what-cursor-position should mention "U+" 積丹尼 Dan Jacobson
  2017-04-13 11:59 ` npostavs
  2017-04-13 12:10 ` 積丹尼 Dan Jacobson
@ 2017-04-20 10:12 ` Philipp Stephani
  2 siblings, 0 replies; 4+ messages in thread
From: Philipp Stephani @ 2017-04-20 10:12 UTC (permalink / raw)
  To: 積丹尼 Dan Jacobson, 26477

[-- Attachment #1: Type: text/plain, Size: 1876 bytes --]

積丹尼 Dan Jacobson <jidanni@jidanni.org> schrieb am Do., 13. Apr. 2017 um
13:48 Uhr:

> C-x = (translated from <return>) runs the command what-cursor-position
> (found in global-map), which is an interactive compiled Lisp function
> in ‘simple.el’.
>
> says:
>
>              position: 120538 of 121236 (99%), column: 0
>             character: ○ (displayed as ○) (codepoint 9675, #o22713, #x25cb)
>     preferred charset: chinese-big5-1 (Frequently used part (A141-C67E) of
> Big5 (Chinese traditional))
> code point in charset: 0x2172
>                script: symbol
>                syntax: _        which means: symbol
>              category: .:Base, c:Chinese, h:Korean, j:Japanese
>              to input: type "C-x 8 RET 25cb" or "C-x 8 RET WHITE CIRCLE"
>           buffer code: #xE2 #x97 #x8B
>             file code: #xE2 #x97 #x8B (encoded by coding system utf-8-unix)
>               display: by this font (glyph code)
>     x:-eten-fixed-medium-r-normal--16-150-75-75-c-160-big5.eten-0 (#xA1B3)
>
> Character code properties: customize what to show
>   name: WHITE CIRCLE
>   general-category: So (Symbol, Other)
>   decomposition: (9675) ('○')
>
> Why can't it say U+25CB WHITE CIRCLE
> anywhere (except fragmented all over the place)?
>

If you ask for the reason, I guess it's just legacy. I assume
`describe-char' is much older than the Unicode support in Emacs, and it was
never thoroughly redesigned (the Unicode properties are all at the bottom).
My suggestion would be to replace the "(codepoint ...)" part with the
standard code point description "(U+NNNN character name)", and either get
rid of most of the non-Unicode properties (preferred charset, code point in
charset, buffer code, file code) or move them further down. That should be
a relatively simple change in the code of `describe-char'.

[-- Attachment #2: Type: text/html, Size: 2338 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2017-04-20 10:12 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-04-13 10:45 bug#26477: what-cursor-position should mention "U+" 積丹尼 Dan Jacobson
2017-04-13 11:59 ` npostavs
2017-04-13 12:10 ` 積丹尼 Dan Jacobson
2017-04-20 10:12 ` Philipp Stephani

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).