From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: YE Qianchuan Newsgroups: gmane.emacs.help Subject: Re: How to get the script name symbols of a specific character? Date: Mon, 11 Feb 2013 23:07:19 +0800 Message-ID: <51190927.7070807@gmail.com> References: <5117C3FC.5020608@gmail.com> <87bobr13v8.fsf@gmail.com> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1360595357 7847 80.91.229.3 (11 Feb 2013 15:09:17 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 11 Feb 2013 15:09:17 +0000 (UTC) Cc: help-gnu-emacs@gnu.org To: Jambunathan K Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Mon Feb 11 16:09:36 2013 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1U4v0c-0007Jf-5a for geh-help-gnu-emacs@m.gmane.org; Mon, 11 Feb 2013 16:09:30 +0100 Original-Received: from localhost ([::1]:47326 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U4v0I-0003IF-SK for geh-help-gnu-emacs@m.gmane.org; Mon, 11 Feb 2013 10:09:10 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:59302) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U4v0C-0003HO-0J for help-gnu-emacs@gnu.org; Mon, 11 Feb 2013 10:09:06 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1U4v08-0004sN-Do for help-gnu-emacs@gnu.org; Mon, 11 Feb 2013 10:09:03 -0500 Original-Received: from mail-da0-f42.google.com ([209.85.210.42]:54251) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U4v08-0004sA-0u for help-gnu-emacs@gnu.org; Mon, 11 Feb 2013 10:09:00 -0500 Original-Received: by mail-da0-f42.google.com with SMTP id z17so2775815dal.15 for ; Mon, 11 Feb 2013 07:08:59 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:message-id:date:from:user-agent:mime-version:to:cc :subject:references:in-reply-to:content-type :content-transfer-encoding; bh=p8bE0f+8VOpjDkeQJ98N+agdmVPwuVxu+4aTG9OZENI=; b=BMEb50CeClCCOxE6tVpUgi8rMepWds4+S0BApEoTj4o23NUghkBpNls1231LSGpnyq x+lchiVCE5Q2+OiP8uhRalMBCTfO593o1C4r+JZT64BxdSyUXbNF4avuYI12Vz1ujr92 v/inqr9a0Ow4osZ0gtOUsFr0I/X3xulaJw8HlzSPq7hcEQmekDxT9Uy+KsCsX1XzD36T rLJZTOkMM3A74pZSVGGGrxZn/qWjI+9ag+kcWxLWI9a/wcbx3Ughin0bEFOXp8+0AXaB J76PPFgbE1V1IKnF0v/x5oMl33lpx2iSYNLeVTr2S+8B1J52vFkL+uUzfGvS6r6zeKi8 qN4g== X-Received: by 10.66.192.162 with SMTP id hh2mr27054143pac.79.1360595339154; Mon, 11 Feb 2013 07:08:59 -0800 (PST) Original-Received: from [192.168.1.101] ([210.38.1.91]) by mx.google.com with ESMTPS id a4sm67198613paw.21.2013.02.11.07.08.56 (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Mon, 11 Feb 2013 07:08:58 -0800 (PST) User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130109 Thunderbird/17.0.2 In-Reply-To: <87bobr13v8.fsf@gmail.com> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x [fuzzy] X-Received-From: 209.85.210.42 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:89073 Archived-At: On 02/11/2013 07:34 PM, Jambunathan K wrote: > Put your cursor on the box and type > C-u C-x = In fact, it's the same as `describe-char'. This command invokes `what-cursor-position', which invokes `describe-char' eventually. > > It will give more useful pointers. The codepoint of a particular > character. The name of the character, in the example below is prefixed > by the script it comes from etc. Cool, I didn't notice its name may be prefixed by its script. It does make a lot sense. However sadly, not all characters do so. For example, a CJK character has prefix CJK. But cjk is not a script name (though there's a script called cjk-misc) and it should belong to `han'. What's worse is, some characters don't show their names at all, even if I assign a font to it. For example: position: 806 of 1031 (78%), column: 1 character: 😀 (displayed as 😀) (codepoint 128512, #o373000, #x1f600) preferred charset: unicode (Unicode (ISO10646)) code point in charset: 0x1F600 syntax: w which means: word category: L:Left-to-right (strong) buffer code: #xF0 #x9F #x98 #x80 file code: #xF0 #x9F #x98 #x80 (encoded by coding system utf-8-unix) display: no font available Character code properties: customize what to show general-category: Cn (Other, Not Assigned) decomposition: (128512) ('😀') > ,---- > | position: 192 of 196 (97%), column: 0 > | character: ஜ (displayed as ஜ) (codepoint 2972, #o5634, #xb9c) > | preferred charset: unicode (Unicode (ISO10646)) > | code point in charset: 0x0B9C > | syntax: w which means: word > | category: .:Base, L:Left-to-right (strong) > | to input: type "ja" with tamil-itrans input method > | buffer code: #xE0 #xAE #x9C > | file code: #xE0 #xAE #x9C (encoded by coding system utf-8) > | display: by this font (glyph code) > | xft:-unknown-Lohit Tamil-normal-normal-normal-*-24-*-*-*-*-0-iso10646-1 (#x44) > | > | Character code properties: customize what to show > | name: TAMIL LETTER JA > | general-category: Lo (Letter, Other) > | decomposition: (2972) ('ஜ') > | > | There are text properties here: > | fontified t > `---- > > Also you may want to look at this page: > http://en.wikipedia.org/wiki/Unicode_block > >> How can I achieve this? Do I miss something? >> Thanks for your help. >> >> >>