From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Jambunathan K Newsgroups: gmane.emacs.help Subject: Re: How to get the script name symbols of a specific character? Date: Tue, 12 Feb 2013 01:27:28 +0530 Message-ID: <878v6ur5cn.fsf@gmail.com> References: <5117C3FC.5020608@gmail.com> <87bobr13v8.fsf@gmail.com> <51190927.7070807@gmail.com> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1360612696 23401 80.91.229.3 (11 Feb 2013 19:58:16 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 11 Feb 2013 19:58:16 +0000 (UTC) Cc: help-gnu-emacs@gnu.org To: YE Qianchuan Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Mon Feb 11 20:58:37 2013 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1U4zWH-0007XP-5N for geh-help-gnu-emacs@m.gmane.org; Mon, 11 Feb 2013 20:58:29 +0100 Original-Received: from localhost ([::1]:53219 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U4zVw-0006y7-TC for geh-help-gnu-emacs@m.gmane.org; Mon, 11 Feb 2013 14:58:08 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:54298) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U4zVp-0006xq-BP for help-gnu-emacs@gnu.org; Mon, 11 Feb 2013 14:58:04 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1U4zVn-0007Wi-QL for help-gnu-emacs@gnu.org; Mon, 11 Feb 2013 14:58:01 -0500 Original-Received: from mail-pa0-f53.google.com ([209.85.220.53]:64693) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U4zVn-0007Wc-Jj for help-gnu-emacs@gnu.org; Mon, 11 Feb 2013 14:57:59 -0500 Original-Received: by mail-pa0-f53.google.com with SMTP id bg4so3177450pad.26 for ; Mon, 11 Feb 2013 11:57:58 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version:content-type :content-transfer-encoding; bh=2u6f0pmZz4hps61KeN35SR5MMx/Y7N9KnZeDh7p/4bc=; b=F9fChXsqaSvknBCgzsa73OrtNmQMPZDslsWoFf5q31ESMcGVGqtkdfA6ffBXQJX/Vl S+urWzAW2c18BSgzbI6Z4EQvroD+OElDFz1/XxsQugs+wsqXcXGs+lzA57iUkDmfGqu/ hQ9W5cR67kO1FudhgnovCwi92fMkQNs3pksruDJ0xjFNOknaV08APhUfuXBmhZMJAjiz OYllqIMJR+omr4do1AUVj54azo/AUOy1/3EbnCL7DKGZ1c9+pnQJf0ES70fDQAHiYc3X DXkuQ3TuuqeL1XI8e+0lI0jk2uteUQeIzFl/o0QxF96W927co9vz530a2JiHOhDP+gAK 2yTA== X-Received: by 10.66.72.233 with SMTP id g9mr44065654pav.29.1360612678466; Mon, 11 Feb 2013 11:57:58 -0800 (PST) Original-Received: from debian-6.05 ([115.242.166.47]) by mx.google.com with ESMTPS id d1sm68845913pav.6.2013.02.11.11.57.54 (version=TLSv1.1 cipher=RC4-SHA bits=128/128); Mon, 11 Feb 2013 11:57:56 -0800 (PST) In-Reply-To: <51190927.7070807@gmail.com> (YE Qianchuan's message of "Mon, 11 Feb 2013 23:07:19 +0800") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3.50 (gnu/linux) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x [fuzzy] X-Received-From: 209.85.220.53 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:89078 Archived-At: YE Qianchuan writes: > On 02/11/2013 07:34 PM, Jambunathan K wrote: >> Put your cursor on the box and type >> C-u C-x =3D > In fact, it's the same as `describe-char'. This command invokes > `what-cursor-position', which invokes `describe-char' eventually. >> >> It will give more useful pointers. The codepoint of a particular >> character. The name of the character, in the example below is prefixed >> by the script it comes from etc. > Cool, I didn't notice its name may be prefixed by its script. It does > make a lot sense. > > However sadly, not all characters do so. For example, a CJK character > has prefix CJK. > But cjk is not a script name (though there's a script called cjk-misc) > and it should belong > to `han'. > > What's worse is, some characters don't show their names at all, even > if I assign a font to it. > > For example: > position: 806 of 1031 (78%), column: 1 > character: =F0=9F=98=80 (displayed as =F0=9F=98=80) (codepoin= t 128512, #o373000, > #x1f600) > preferred charset: unicode (Unicode (ISO10646)) > code point in charset: 0x1F600 > syntax: w which means: word > category: L:Left-to-right (strong) > buffer code: #xF0 #x9F #x98 #x80 > file code: #xF0 #x9F #x98 #x80 (encoded by coding system > utf-8-unix) > display: no font available > > Character code properties: customize what to show > general-category: Cn (Other, Not Assigned) > decomposition: (128512) ('=F0=9F=98=80') This is what I get. Emacs reports that it is a GRINNING FACE.=20=20 I run Emacs from trunk though. I am not sure this makes any actuall difference. I think it would be useful to have one browse different Unicode Blocks or have C-u C-x =3D report the block name of a character. I am just going by what the below mentioned Wikipedia article suggests. ,---- http://en.wikipedia.org/wiki/Unicode_block | U+1F600..U+1F64F Emoticons 80 1 SMP Common `---- position: 1706 of 2799 (61%), column: 28 character: =F0=9F=98=80 (displayed as =F0=9F=98=80) (codepoint = 128512, #o373000, #x1f600) preferred charset: unicode (Unicode (ISO10646)) code point in charset: 0x1F600 syntax: w which means: word category: .:Base to input: type "C-x 8 RET HEX-CODEPOINT" or "C-x 8 RET NAME" buffer code: #xF0 #x9F #x98 #x80 file code: not encodable by coding system undecided-unix display: no font available Character code properties: customize what to show name: GRINNING FACE general-category: So (Symbol, Other) decomposition: (128512) ('=F0=9F=98=80') --=20