From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: YE Qianchuan Newsgroups: gmane.emacs.help Subject: Re: How to get the script name symbols of a specific character? Date: Mon, 11 Feb 2013 23:17:58 +0800 Message-ID: <51190BA6.1070500@gmail.com> References: <5117C3FC.5020608@gmail.com> <87bobr13v8.fsf@gmail.com> <51190927.7070807@gmail.com> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1360595992 14867 80.91.229.3 (11 Feb 2013 15:19:52 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 11 Feb 2013 15:19:52 +0000 (UTC) Cc: help-gnu-emacs@gnu.org To: Jambunathan K Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Mon Feb 11 16:20:14 2013 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1U4vAz-0002sy-5Z for geh-help-gnu-emacs@m.gmane.org; Mon, 11 Feb 2013 16:20:13 +0100 Original-Received: from localhost ([::1]:60731 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U4vAf-00030e-SG for geh-help-gnu-emacs@m.gmane.org; Mon, 11 Feb 2013 10:19:53 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:33718) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U4vAY-000309-KL for help-gnu-emacs@gnu.org; Mon, 11 Feb 2013 10:19:48 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1U4vAS-0007De-DN for help-gnu-emacs@gnu.org; Mon, 11 Feb 2013 10:19:46 -0500 Original-Received: from mail-pa0-f50.google.com ([209.85.220.50]:48810) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U4vAS-0007DT-71 for help-gnu-emacs@gnu.org; Mon, 11 Feb 2013 10:19:40 -0500 Original-Received: by mail-pa0-f50.google.com with SMTP id fa11so3063346pad.37 for ; Mon, 11 Feb 2013 07:19:39 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:message-id:date:from:user-agent:mime-version:to:cc :subject:references:in-reply-to:content-type :content-transfer-encoding; bh=oZt/NJhVAbiKBI8R88/xqnpC35uiV5dB+6wUuBBTHTk=; b=Jvst9EQc6yC3hNJM6MhaYZpvhOcmokUcue7GWR3SL8Q6pon8W61xiH8BrotQe4lPO6 6O8k1TbnFwWuWyi/DlxZaTQ6gRSacJjHYgdVt8sdssl/BSe4Fl3vMP2X2cki2LncRu9H rUy7ZWARZik02rs4Ch1ucnvmA4lgJ75IrFvDhMX4DClJ3CQI2z/tkyUNgfb80HVsP8tF MZUCBBcA2hZDlHF8SMUon7lDE5c9q3CSaYQJxjcjoDsCHWgePkcQmRw6HErdHJC99U9X axBPGdYv20lrbjTjY9xfPzIHo0AaxR+uYlJhJD/jwe1QkEAzfVL5nWh98NR34zQ+689D 7y7Q== X-Received: by 10.66.73.5 with SMTP id h5mr41762257pav.11.1360595979327; Mon, 11 Feb 2013 07:19:39 -0800 (PST) Original-Received: from [192.168.1.101] ([210.38.1.83]) by mx.google.com with ESMTPS id z6sm41255604pav.3.2013.02.11.07.19.36 (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Mon, 11 Feb 2013 07:19:38 -0800 (PST) User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130109 Thunderbird/17.0.2 In-Reply-To: <51190927.7070807@gmail.com> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x [fuzzy] X-Received-From: 209.85.220.50 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:89075 Archived-At: On 02/11/2013 11:07 PM, YE Qianchuan wrote: > On 02/11/2013 07:34 PM, Jambunathan K wrote: >> Put your cursor on the box and type >> C-u C-x = > In fact, it's the same as `describe-char'. This command invokes > `what-cursor-position', which invokes `describe-char' eventually. >> >> It will give more useful pointers. The codepoint of a particular >> character. The name of the character, in the example below is prefixed >> by the script it comes from etc. > Cool, I didn't notice its name may be prefixed by its script. It does > make a lot sense. > > However sadly, not all characters do so. For example, a CJK character > has prefix CJK. > But cjk is not a script name (though there's a script called cjk-misc) > and it should belong > to `han'. > > What's worse is, some characters don't show their names at all, even > if I assign a font to it. > > For example: > position: 806 of 1031 (78%), column: 1 > character: 😀 (displayed as 😀) (codepoint 128512, > #o373000, #x1f600) > preferred charset: unicode (Unicode (ISO10646)) > code point in charset: 0x1F600 > syntax: w which means: word > category: L:Left-to-right (strong) > buffer code: #xF0 #x9F #x98 #x80 > file code: #xF0 #x9F #x98 #x80 (encoded by coding system > utf-8-unix) > display: no font available > > Character code properties: customize what to show > general-category: Cn (Other, Not Assigned) > decomposition: (128512) ('😀') Additional example, this character's name don't show any connection to its script. How do you get its script symbol? position: 870 of 1031 (84%), column: 65 character: 😠 (displayed as 😠) (codepoint 128544, #o373040, #x1f620) preferred charset: unicode (Unicode (ISO10646)) code point in charset: 0x1F620 syntax: w which means: word category: .:Base buffer code: #xF0 #x9F #x98 #xA0 file code: #xF0 #x9F #x98 #xA0 (encoded by coding system utf-8-unix) display: by this font (glyph code) xft:-unknown-Symbola-normal-normal-semi-condensed-*-15-*-*-*-*-0-iso10646-1 (#x1ADE) Character code properties: customize what to show name: ANGRY FACE general-category: So (Symbol, Other) decomposition: (128544) ('😠')