all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Eli Zaretskii <eliz@gnu.org>
To: help-gnu-emacs@gnu.org
Subject: Re: How to get the script name symbols of a specific character?
Date: Mon, 11 Feb 2013 22:08:56 +0200	[thread overview]
Message-ID: <837gme62av.fsf@gnu.org> (raw)
In-Reply-To: <878v6ur5cn.fsf@gmail.com>

> From: Jambunathan K <kjambunathan@gmail.com>
> Date: Tue, 12 Feb 2013 01:27:28 +0530
> Cc: help-gnu-emacs@gnu.org
> 
> YE Qianchuan <stool.ye@gmail.com> writes:
> 
> > On 02/11/2013 07:34 PM, Jambunathan K wrote:
> >> Put your cursor on the box and type
> >>          C-u C-x =
> > In fact, it's the same as `describe-char'. This command invokes
> > `what-cursor-position', which invokes `describe-char' eventually.
> >>
> >> It will give more useful pointers.  The codepoint of a particular
> >> character.  The name of the character, in the example below is prefixed
> >> by the script it comes from etc.
> > Cool, I didn't notice its name may be prefixed by its script. It does
> > make a lot sense.
> >
> > However sadly, not all characters do so. For example, a CJK character
> > has prefix CJK.
> > But cjk is not a script name (though there's a script called cjk-misc)
> > and it should belong
> > to `han'.
> >
> > What's worse is, some characters don't show their names at all, even
> > if I assign a font to it.
> >
> > For example:
> >              position: 806 of 1031 (78%), column: 1
> >             character: 😀 (displayed as 😀) (codepoint 128512, #o373000,
> > #x1f600)
> >     preferred charset: unicode (Unicode (ISO10646))
> > code point in charset: 0x1F600
> >                syntax: w     which means: word
> >              category: L:Left-to-right (strong)
> >           buffer code: #xF0 #x9F #x98 #x80
> >             file code: #xF0 #x9F #x98 #x80 (encoded by coding system
> > utf-8-unix)
> >               display: no font available
> >
> > Character code properties: customize what to show
> >   general-category: Cn (Other, Not Assigned)
> >   decomposition: (128512) ('😀')
> 
> This is what I get.  Emacs reports that it is a GRINNING FACE.  
> 
> I run Emacs from trunk though.  I am not sure this makes any actuall
> difference.

The names come from the Unicode character database (UCD) that is
processed into a bunch of Emacs Lisp files and then preloaded into
Emacs.  The version of the Unicode database built into Emacs
determines which codepoints have names and which don't.

> I think it would be useful to have one browse different Unicode Blocks
> or have C-u C-x = report the block name of a character.

If that data is not in the UCD, Emacs cannot know it, unless someone
adds it to Emacs.




  reply	other threads:[~2013-02-11 20:08 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-02-10 15:59 How to get the script name symbols of a specific character? YE Qianchuan
2013-02-11  2:55 ` Jambunathan K
2013-02-11 10:48   ` YE Qianchuan
2013-02-11 11:00     ` Jambunathan K
2013-02-11 14:50       ` YE Qianchuan
2013-02-11 11:34 ` Jambunathan K
2013-02-11 15:07   ` YE Qianchuan
2013-02-11 15:17     ` YE Qianchuan
2013-02-11 19:57     ` Jambunathan K
2013-02-11 20:08       ` Eli Zaretskii [this message]
2013-02-11 21:46         ` Jambunathan K
2013-02-11 15:57   ` Stefan Monnier
2013-02-12 15:22     ` YE Qianchuan
2013-02-11 20:11   ` T.F. Torrey
2013-02-12 15:12 ` YE Qianchuan

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=837gme62av.fsf@gnu.org \
    --to=eliz@gnu.org \
    --cc=help-gnu-emacs@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.