From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Juri Linkov Newsgroups: gmane.emacs.bugs Subject: bug#42256: 27.0.50; composition Date: Tue, 14 Jul 2020 03:13:12 +0300 Organization: LINKOV.NET Message-ID: <874kqbar8n.fsf@mail.linkov.net> References: <83o8oqqed1.fsf@gnu.org> <83tuygpoej.fsf@gnu.org> <87v9iwclbq.fsf@mail.linkov.net> <83mu47q48x.fsf@gnu.org> <87mu451uo8.fsf@mail.linkov.net> <835zasoi6i.fsf@gnu.org> <87365wux23.fsf@mail.linkov.net> <831rlgnkz3.fsf@gnu.org> <83sgdvmt3q.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="33486"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (x86_64-pc-linux-gnu) Cc: rms@gnu.org, 42256@debbugs.gnu.org To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Tue Jul 14 02:56:29 2020 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1jv9FD-0008bE-9U for geb-bug-gnu-emacs@m.gmane-mx.org; Tue, 14 Jul 2020 02:56:27 +0200 Original-Received: from localhost ([::1]:52888 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jv9FC-0007se-97 for geb-bug-gnu-emacs@m.gmane-mx.org; Mon, 13 Jul 2020 20:56:26 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:54244) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jv9Eo-0007Rn-EO for bug-gnu-emacs@gnu.org; Mon, 13 Jul 2020 20:56:02 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:38115) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1jv9Eo-0001fT-2j for bug-gnu-emacs@gnu.org; Mon, 13 Jul 2020 20:56:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1jv9Eo-0001CU-1v for bug-gnu-emacs@gnu.org; Mon, 13 Jul 2020 20:56:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Juri Linkov Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 14 Jul 2020 00:56:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 42256 X-GNU-PR-Package: emacs Original-Received: via spool by 42256-submit@debbugs.gnu.org id=B42256.15946881304569 (code B ref 42256); Tue, 14 Jul 2020 00:56:02 +0000 Original-Received: (at 42256) by debbugs.gnu.org; 14 Jul 2020 00:55:30 +0000 Original-Received: from localhost ([127.0.0.1]:49660 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jv9EH-0001Bc-VT for submit@debbugs.gnu.org; Mon, 13 Jul 2020 20:55:30 -0400 Original-Received: from relay3-d.mail.gandi.net ([217.70.183.195]:60269) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jv9EG-0001BE-BK for 42256@debbugs.gnu.org; Mon, 13 Jul 2020 20:55:29 -0400 X-Originating-IP: 91.129.103.18 Original-Received: from mail.gandi.net (m91-129-103-18.cust.tele2.ee [91.129.103.18]) (Authenticated sender: juri@linkov.net) by relay3-d.mail.gandi.net (Postfix) with ESMTPSA id BDE4260002; Tue, 14 Jul 2020 00:55:20 +0000 (UTC) In-Reply-To: <83sgdvmt3q.fsf@gnu.org> (Eli Zaretskii's message of "Mon, 13 Jul 2020 16:39:53 +0300") X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:182975 Archived-At: > Why do you need that, may I ask? Why is it important to know which > characters were composed, and in what usage scenario? For the same reason why there is the need to see the name of the base character. Displaying information about only part of composition (only its first character) is incomplete to see what characters participate in the composition, especially to know the names of usually small glyphs of accents that are hardly distinguishable when composed with the base character. > Please also update the Emacs manual, where it describes this display, > because now the text there is outdated. Done. > Actually, the results are inaccurate or even incorrect, at least in > some cases. Here's one case where the results are wrong: > > emacs -Q > C-h h > C-u 411 M-g c > C-u C-x = > > You will see towards the end of the *help* buffer: > > Composed with the following character(s) "്" using this font: > harfbuzz:-outline-Kartika-normal-normal-normal-serif-13-*-*-*-p-*-iso10646-1 > by these glyphs: > [0 1 3384 337 12 0 12 9 0 nil] > from these character(s): > സ (#xd38) MALAYALAM LETTER SA > ് (#xd4d) MALAYALAM SIGN VIRAMA > ക (#xd15) MALAYALAM LETTER KA > ാ (#xd3e) MALAYALAM VOWEL SIGN AA I tried, but got a different output: Composed with the following character(s) "്കാ" using this font: ftcrhb:-PfEd-Lohit Malayalam-normal-normal-normal-*-13-*-*-*-*-0-iso10646-1 by these glyphs: [0 3 3384 184 14 0 15 8 5 nil] [0 3 3405 71 6 0 6 8 0 nil] from these character(s): സ (#xd38) MALAYALAM LETTER SA ് (#xd4d) MALAYALAM SIGN VIRAMA ക (#xd15) MALAYALAM LETTER KA ാ (#xd3e) MALAYALAM VOWEL SIGN AA The difference is in "Composed with the following character(s) "്കാ" and in the rows of glyphs. And according to the composition string "്കാ" the list of 4 characters is correct. > The added list of characters seems to imply that 4 characters were > composed at buffer position 411. But actually only the first 2 of > them were composed, as clearly see from the line starting with > "Composed with" above. > > A similar problem happens at buffer position 413 of HELLO. Here is the output from 'C-u 413 M-g c C-u C-x =': Composed with the following character(s) "ം" using this font: ftcrhb:-PfEd-Lohit Malayalam-normal-normal-normal-*-13-*-*-*-*-0-iso10646-1 by these glyphs: [0 1 3376 59 8 0 8 8 0 nil] [0 1 3330 16 7 0 8 6 0 nil] from these character(s): ര (#xd30) MALAYALAM LETTER RA ം (#xd02) MALAYALAM SIGN ANUSVARA Again, it seems the list of characters is correct according to the text "Composed with the following character(s) "ം". > And at position 872 you can see an even more stark example: instead of > 2 characters, we show 8(!). I don't understand where these 8 characters are coming from. This composition of 8 characters is returned by find-composition. Maybe the bug is in find-composition?