From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: handa@gnu.org (K. Handa) Newsgroups: gmane.emacs.devel Subject: Re: ucs-normalize and diacritics Date: Tue, 31 Jul 2018 21:37:40 +0900 Message-ID: <87muu7h97v.fsf@gnu.org> References: <83zhydrwoh.fsf@gnu.org> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: blaine.gmane.org 1533040583 31565 195.159.176.226 (31 Jul 2018 12:36:23 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Tue, 31 Jul 2018 12:36:23 +0000 (UTC) Cc: rpluim@gmail.com, emacs-devel@gnu.org To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Tue Jul 31 14:36:18 2018 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1fkTsw-00087f-Ja for ged-emacs-devel@m.gmane.org; Tue, 31 Jul 2018 14:36:18 +0200 Original-Received: from localhost ([::1]:58595 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fkTv3-0007xo-Dz for ged-emacs-devel@m.gmane.org; Tue, 31 Jul 2018 08:38:29 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:52359) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fkTuP-0007e8-V2 for emacs-devel@gnu.org; Tue, 31 Jul 2018 08:37:55 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1fkTuP-0006xM-5D for emacs-devel@gnu.org; Tue, 31 Jul 2018 08:37:49 -0400 Original-Received: from fencepost.gnu.org ([2001:4830:134:3::e]:49299) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fkTuL-0006v4-4s; Tue, 31 Jul 2018 08:37:45 -0400 Original-Received: from fl1-119-240-85-224.iba.mesh.ad.jp ([119.240.85.224]:57904 helo=mongkok) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1fkTuK-0005Hs-H1; Tue, 31 Jul 2018 08:37:44 -0400 Original-Received: from handa by mongkok with local (Exim 4.90_1) (envelope-from ) id 1fkTuG-0006ND-TB; Tue, 31 Jul 2018 21:37:40 +0900 In-Reply-To: <83zhydrwoh.fsf@gnu.org> (message from Eli Zaretskii on Fri, 27 Jul 2018 09:59:26 +0300) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.org gmane.emacs.devel:228037 Archived-At: In article <83zhydrwoh.fsf@gnu.org>, Eli Zaretskii writes: > > > - It doesn't show this for the whole range 0200 to 0377, only for 0240 > > > to 0377, eg for 0200: > > > > That's a bug, somewhere. I will look into it when I have time. > Not a bug, an unintended consequence of how we set up charsets. Now > fixed on the emacs-26 branch. > I wonder whether we can mark tis620-2533 a :supplementary-p charset. > Kenichi, do you envision any problems that could be caused by that? If we put :supplementary-p t to tis620-2533, M-x list-character-sets lists that charset under "Supplementary Character Sets" section, which will surprise tis620-2533 users (I think there are still many). And, anyway, that change won't change the behavior of char-charset used by describe-char. I have not yet read all mails in this thread, but if the problem is that describe-char does not show eight-bit as a charset of eight-bit characters, perhaps the best way is to modify describe-char as this. diff --git a/lisp/descr-text.el b/lisp/descr-text.el index d8f8188eb1..aea090b907 100644 --- a/lisp/descr-text.el +++ b/lisp/descr-text.el @@ -403,6 +403,7 @@ describe-char (eight-bit-p (and (not enable-multibyte-characters) (>= char 128))) (charset (if eight-bit-p 'eight-bit (or (get-text-property pos 'charset) + (and (encode-char char 'eight-bit) 'eight-bit) (char-charset char)))) (composition (find-composition pos nil nil t)) (component-chars nil) --- K. Handa handa@gnu.org