unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
From: Juri Linkov <juri@linkov.net>
To: Eli Zaretskii <eliz@gnu.org>
Cc: rms@gnu.org, 42256@debbugs.gnu.org
Subject: bug#42256: 27.0.50; composition
Date: Tue, 14 Jul 2020 03:13:12 +0300	[thread overview]
Message-ID: <874kqbar8n.fsf@mail.linkov.net> (raw)
In-Reply-To: <83sgdvmt3q.fsf@gnu.org> (Eli Zaretskii's message of "Mon, 13 Jul 2020 16:39:53 +0300")

> Why do you need that, may I ask?  Why is it important to know which
> characters were composed, and in what usage scenario?

For the same reason why there is the need to see the name of
the base character.  Displaying information about only part of
composition (only its first character) is incomplete to see
what characters participate in the composition, especially to know
the names of usually small glyphs of accents that are hardly distinguishable
when composed with the base character.

> Please also update the Emacs manual, where it describes this display,
> because now the text there is outdated.

Done.

> Actually, the results are inaccurate or even incorrect, at least in
> some cases.  Here's one case where the results are wrong:
>
>   emacs -Q
>   C-h h
>   C-u 411 M-g c
>   C-u C-x =
>
> You will see towards the end of the *help* buffer:
>
>  Composed with the following character(s) "്" using this font:
>    harfbuzz:-outline-Kartika-normal-normal-normal-serif-13-*-*-*-p-*-iso10646-1
>  by these glyphs:
>    [0 1 3384 337 12 0 12 9 0 nil]
>  from these character(s):
>    സ (#xd38) MALAYALAM LETTER SA
>    ് (#xd4d) MALAYALAM SIGN VIRAMA
>    ക (#xd15) MALAYALAM LETTER KA
>    ാ (#xd3e) MALAYALAM VOWEL SIGN AA

I tried, but got a different output:

Composed with the following character(s) "്കാ" using this font:
  ftcrhb:-PfEd-Lohit Malayalam-normal-normal-normal-*-13-*-*-*-*-0-iso10646-1
by these glyphs:
  [0 3 3384 184 14 0 15 8 5 nil]
  [0 3 3405 71 6 0 6 8 0 nil]
from these character(s):
  സ (#xd38) MALAYALAM LETTER SA
  ് (#xd4d) MALAYALAM SIGN VIRAMA
  ക (#xd15) MALAYALAM LETTER KA
  ാ (#xd3e) MALAYALAM VOWEL SIGN AA

The difference is in "Composed with the following character(s) "്കാ"
and in the rows of glyphs.  And according to the composition string "്കാ"
the list of 4 characters is correct.

> The added list of characters seems to imply that 4 characters were
> composed at buffer position 411.  But actually only the first 2 of
> them were composed, as clearly see from the line starting with
> "Composed with" above.
>
> A similar problem happens at buffer position 413 of HELLO.

Here is the output from 'C-u 413 M-g c C-u C-x =':

Composed with the following character(s) "ം" using this font:
  ftcrhb:-PfEd-Lohit Malayalam-normal-normal-normal-*-13-*-*-*-*-0-iso10646-1
by these glyphs:
  [0 1 3376 59 8 0 8 8 0 nil]
  [0 1 3330 16 7 0 8 6 0 nil]
from these character(s):
  ര (#xd30) MALAYALAM LETTER RA
  ം (#xd02) MALAYALAM SIGN ANUSVARA

Again, it seems the list of characters is correct according
to the text "Composed with the following character(s) "ം".

> And at position 872 you can see an even more stark example: instead of
> 2 characters, we show 8(!).

I don't understand where these 8 characters are coming from.
This composition of 8 characters is returned by find-composition.
Maybe the bug is in find-composition?





  reply	other threads:[~2020-07-14  0:13 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-07-08  2:41 bug#42256: 27.0.50; composition Richard Stallman
2020-07-08 14:18 ` Eli Zaretskii
2020-07-09  3:01   ` Richard Stallman
2020-07-09 17:51     ` Eli Zaretskii
2020-07-09 23:36       ` Juri Linkov
2020-07-10  6:21         ` Eli Zaretskii
2020-07-11  2:17           ` Richard Stallman
2020-07-11 23:57           ` Juri Linkov
2020-07-12 15:40             ` Eli Zaretskii
2020-07-12 23:35               ` Juri Linkov
2020-07-13  3:37                 ` Eli Zaretskii
2020-07-13 13:39                   ` Eli Zaretskii
2020-07-14  0:13                     ` Juri Linkov [this message]
2020-07-14  2:36                       ` Eli Zaretskii
2020-07-14 23:20                         ` Juri Linkov
2020-07-15 14:39                           ` Eli Zaretskii
2020-07-15 23:43                             ` Juri Linkov
2020-07-16 16:40                               ` Eli Zaretskii
2020-07-13  2:56               ` Richard Stallman
2020-07-10  3:52       ` Richard Stallman

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=874kqbar8n.fsf@mail.linkov.net \
    --to=juri@linkov.net \
    --cc=42256@debbugs.gnu.org \
    --cc=eliz@gnu.org \
    --cc=rms@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).