From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: jsbien@mimuw.edu.pl (Janusz S. =?UTF-8?Q?Bie=C5=84?=) Newsgroups: gmane.emacs.bugs Subject: bug#32599: 25.2; Feature request: input PUA characters by name Date: Sun, 26 May 2019 17:18:21 +0200 Message-ID: <86pno56y82.fsf@mimuw.edu.pl> References: <86sh30fg4q.fsf@mimuw.edu.pl> <868t4n3wbt.fsf@mimuw.edu.pl> <877ek72c7b.fsf@gmail.com> <868t4m3nrs.fsf@mimuw.edu.pl> <8736uu3eb3.fsf@gmail.com> <86r2iey9vk.fsf@mimuw.edu.pl> <867ead8wmd.fsf@mimuw.edu.pl> <83lfyt2s25.fsf@gnu.org> Reply-To: jsbien@mimuw.edu.pl Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="250091"; mail-complaints-to="usenet@blaine.gmane.org" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) Cc: 32599@debbugs.gnu.org To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Sun May 26 17:35:15 2019 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([209.51.188.17]) by blaine.gmane.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:256) (Exim 4.89) (envelope-from ) id 1hUvB5-0012wh-LM for geb-bug-gnu-emacs@m.gmane.org; Sun, 26 May 2019 17:35:15 +0200 Original-Received: from localhost ([127.0.0.1]:56544 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hUvB4-0001pu-Mh for geb-bug-gnu-emacs@m.gmane.org; Sun, 26 May 2019 11:35:14 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:35239) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hUvAu-0001p8-7A for bug-gnu-emacs@gnu.org; Sun, 26 May 2019 11:35:05 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1hUuvO-0001VJ-6Q for bug-gnu-emacs@gnu.org; Sun, 26 May 2019 11:19:03 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:37877) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1hUuvO-0001Uv-2G for bug-gnu-emacs@gnu.org; Sun, 26 May 2019 11:19:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1hUuvN-0001rU-TD for bug-gnu-emacs@gnu.org; Sun, 26 May 2019 11:19:01 -0400 X-Loop: help-debbugs@gnu.org Resent-From: jsbien@mimuw.edu.pl (Janusz S. =?UTF-8?Q?Bie=C5=84?=) Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sun, 26 May 2019 15:19:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 32599 X-GNU-PR-Package: emacs Original-Received: via spool by 32599-submit@debbugs.gnu.org id=B32599.15588839137111 (code B ref 32599); Sun, 26 May 2019 15:19:01 +0000 Original-Received: (at 32599) by debbugs.gnu.org; 26 May 2019 15:18:33 +0000 Original-Received: from localhost ([127.0.0.1]:51421 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hUuus-0001qY-1B for submit@debbugs.gnu.org; Sun, 26 May 2019 11:18:33 -0400 Original-Received: from mail.mimuw.edu.pl ([193.0.96.6]:60555) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hUuuq-0001qJ-GK for 32599@debbugs.gnu.org; Sun, 26 May 2019 11:18:29 -0400 Original-Received: from localhost (localhost [127.0.0.1]) by duch.mimuw.edu.pl (Postfix) with ESMTP id C3E916018DB79; Sun, 26 May 2019 17:18:25 +0200 (CEST) X-Virus-Scanned: amavisd-new at mimuw.edu.pl Original-Received: from duch.mimuw.edu.pl ([127.0.0.1]) by localhost (mail.mimuw.edu.pl [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id o6bWRkuz8SLM; Sun, 26 May 2019 17:18:23 +0200 (CEST) Original-Received: from VivoPC-D8.mimuw.edu.pl (unknown [176.221.122.194]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by duch.mimuw.edu.pl (Postfix) with ESMTPSA; Sun, 26 May 2019 17:18:22 +0200 (CEST) In-Reply-To: <83lfyt2s25.fsf@gnu.org> (Eli Zaretskii's message of "Sun, 26 May 2019 17:45:06 +0300") X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 209.51.188.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:159783 Archived-At: On Sun, May 26 2019 at 17:45 +03, Eli Zaretskii wrote: >> From: jsbien@mimuw.edu.pl (Janusz S. Bie=C5=84) >> Date: Sun, 26 May 2019 10:10:02 +0200 >>=20 >> > First, the MUFI data in a more convenient form are available here: >> > >> > On Mon, Aug 27 2018 at 9:00 +0200, jsbien@mimuw.edu.pl writes: >> > >> > [...] >> > >> >> https://bitbucket.org/jsbien/unihistext/src/master/example/ >>=20 >> If you prefer a file pattern after UnicodeData.txt, you can find it >> here: >>=20 >> http://www.kreativekorp.com/charset/PUADATA/PUBLIC/MUFI/ >>=20 >> > >> > Secondly, other users may be interested in other sets of PUA character= s, >> > cf. >> > >> > http://andron-typeforum.xobor.de/t10f13-Towards-a-linguistic-corporate= -use-area-LINCUA.html >> > https://en.wikipedia.org/wiki/ConScript_Unicode_Registry >>=20 >> or Under-ConScript Unicode Registry: >>=20 >> http://www.kreativekorp.com/ucsur/ > > The UnicodeData.txt file is compiled into Emacs, I know and I'm curious whether it is really needed. Why it cannot be loaded at the startup? The advantage would be the user can use always the up-to-date version of UnicodeData.txt (have you noticed that since 7th May we have now Unicode 12.1 because SQUARE ERA NAME REIWA was added?). > but the files you mention cannot be compiled into it, because they > vary, and because different users might want different lists of > characters to be supported. So we need to design how this will work. My naive idea is to "cheat" Emacs by providing it with the extended data without changing the original logic. Efficiency is less important than convenience, perhaps you can "advice" the 'describe-char' function to look for the data elsewhere. > In addition, I think PUA codepoints aren't really treated as > characters in Emacs, so there's a need for some infrastructure > changes. I do not propose to support the supplemental PUA planes. For the BMP this probably boils down to the availability of the property information. As we have now a pseudo-UnicodeData.txt for the PUA characters (at least thise I'm interested in) this doesn't seem to me a big problem). > Patches welcome. Unfortunately I'm unable to provide them myself. Best regards Janusz --=20 ,=20=20=20 Janusz S. Bien emeryt (emeritus) https://sites.google.com/view/jsbien