From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#36070: 27; feature request '(Describe Char Unidata List) to include 'kDefinition' value Date: Mon, 03 Jun 2019 18:06:32 +0300 Message-ID: <83tvd6u2rr.fsf@gnu.org> References: <8C3E021E-FB25-4948-8E5F-1395590BAA66@scratch.space> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="252129"; mail-complaints-to="usenet@blaine.gmane.org" Cc: 36070@debbugs.gnu.org To: Van L Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Mon Jun 03 17:23:41 2019 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([209.51.188.17]) by blaine.gmane.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:256) (Exim 4.89) (envelope-from ) id 1hXooG-0013RR-Px for geb-bug-gnu-emacs@m.gmane.org; Mon, 03 Jun 2019 17:23:40 +0200 Original-Received: from localhost ([127.0.0.1]:36598 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hXooF-0000Jd-Kn for geb-bug-gnu-emacs@m.gmane.org; Mon, 03 Jun 2019 11:23:39 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:41024) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hXoYR-0003W5-T2 for bug-gnu-emacs@gnu.org; Mon, 03 Jun 2019 11:07:21 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1hXoYK-0005se-TR for bug-gnu-emacs@gnu.org; Mon, 03 Jun 2019 11:07:16 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:57383) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1hXoYA-0005eB-RY for bug-gnu-emacs@gnu.org; Mon, 03 Jun 2019 11:07:06 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1hXoYA-0000zj-K6 for bug-gnu-emacs@gnu.org; Mon, 03 Jun 2019 11:07:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Mon, 03 Jun 2019 15:07:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 36070 X-GNU-PR-Package: emacs Original-Received: via spool by 36070-submit@debbugs.gnu.org id=B36070.15595744203815 (code B ref 36070); Mon, 03 Jun 2019 15:07:02 +0000 Original-Received: (at 36070) by debbugs.gnu.org; 3 Jun 2019 15:07:00 +0000 Original-Received: from localhost ([127.0.0.1]:42694 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hXoY8-0000zT-0E for submit@debbugs.gnu.org; Mon, 03 Jun 2019 11:07:00 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:54805) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hXoY4-0000zC-8C for 36070@debbugs.gnu.org; Mon, 03 Jun 2019 11:06:57 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:46157) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hXoXv-0005HA-O3; Mon, 03 Jun 2019 11:06:47 -0400 Original-Received: from [176.228.60.248] (port=3783 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1hXoXq-0003RK-Ee; Mon, 03 Jun 2019 11:06:43 -0400 In-reply-to: <8C3E021E-FB25-4948-8E5F-1395590BAA66@scratch.space> (message from Van L on Mon, 3 Jun 2019 22:00:30 +1000) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 209.51.188.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:160055 Archived-At: > From: Van L > Date: Mon, 3 Jun 2019 22:00:30 +1000 > > The details retrieved by 'M-x describe-char' on '入' show the following > > --8<---------------cut here---------------start------------->8--- > Character code properties: customize what to show > name: CJK IDEOGRAPH-5165 > general-category: Lo (Letter, Other) > decomposition: (20837) ('入') > --8<---------------cut here---------------end--------------->8--- This comes from UnicodeData.txt, our source for the Unicode properties of all the characters. We parse it into uni-*.el files as part of the build. > Following the customize link to 'Describe Char Unidata List' > I find more information can be had from [1] . > > The Readings table, in particular, is nice to have for the 'kDefinition'. > > --8<---------------cut here---------------start------------->8--- > | Data type | Value | > |-------------+--------------------------| > | kDefinition | enter, come in(to), join | > | | | > --8<---------------cut here---------------end--------------->8--- This comes from Unihan_Reading.txt, a different file that is part of the Unihan database. We don't currently have a property where to put this value, so we need first to extend the properties. And then we will need to parse the above file and populate the property. Patches welcome. Bonus points for reviewing other properties of the Unihan DB and adding whatever is useful. See UAX#38 (http://www.unicode.org/reports/tr38/), for the description of the properties. Thanks.