From: Philipp <p.stephani2@gmail.com>
To: Eli Zaretskii <eliz@gnu.org>
Cc: help-gnu-emacs@gnu.org
Subject: Re: Display of decomposed characters
Date: Sun, 28 Feb 2021 19:10:57 +0100 [thread overview]
Message-ID: <0077B374-A65D-412D-B1A5-4ADDD50D41A7@gmail.com> (raw)
In-Reply-To: <831rea3ymg.fsf@gnu.org>
> Am 24.01.2021 um 20:48 schrieb Eli Zaretskii <eliz@gnu.org>:
>
>> From: Philipp Stephani <p.stephani2@gmail.com>
>> Date: Sun, 24 Jan 2021 19:58:18 +0100
>> Cc: help-gnu-emacs <help-gnu-emacs@gnu.org>
>>
>>> If the default font supports the diaeresis, that will happen
>>> automatically. If not, then simply don't choose the default font that
>>> doesn't support accents.
>>
>> The font will always support the composite variant (because it's part
>> of Latin-1).
>
> That is only relevant if Emacs decides to compose the characters.
> Then, and only then, will it ask the text-shaping engine to produce
> glyphs for the base character and the accent together, and then the
> font could provide a single precomposed glyph for them.
So in this case the decision to not compose the characters is incorrect or happens too early?
>
>> I guess fonts assume that applications will first try to normalize
>> strings to avoid issues like this?
>
> Normalizing strings before you know whether the font has the
> precomposed glyphs makes no sense.
Why? If the font doesn’t support a precomposed character, wouldn’t the rendering engine automatically fall back to a decomposed representation? (Serious question; I don’t know whether Harfbuzz does that.) IOW, would normalizing strings to NFC before sending them to the rendering engine ever break anything?
>
> What the text-shaping folks tell us is that we should pass _all_ the
> text through the text shaper, then the shaper will DTRT in every
> case. But this would mean a thorough redesign and reimplementation of
> how we do that in Emacs, and that is not easy if we want to keep the
> current flexibility and customizability (which is why the character
> composition code calls out to Lisp, and that makes sending all the
> text that way tool expensive to be practical).
Would it be possible to implement a more minimal change to fix the problem at hand?
>
>> Does it ever make sense to pick different fonts for a base character
>> and its combining characters?
>
> If the default font doesn't support the combining accent, what else
> can you do? Most fonts don't have precomposed glyphs for every
> arbitrary sequence of base character followed by several combining
> accents. So sometimes you will have to compose the accents "by hand",
> and that is not really possible if they come from different fonts.
Which is why they shouldn’t come from different fonts. What if Emacs ignored font lookup for combining characters and always picked the font of the previous base character?
>
>> Wouldn't that fundamentally prevent using combining characters? IIUC
>> text rendering engines should be able to pick the right glyph if
>> that didn't happen (assuming they can perform Unicode
>> normalization).
>
> Unicode normalization is only tangentially relevant here.
>
Sure, but in this case it would fix them problem AFICS.
next prev parent reply other threads:[~2021-02-28 18:10 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-12-23 10:05 Display of decomposed characters Philipp Stephani
2020-12-23 13:00 ` Janusz S. Bień
2020-12-23 15:44 ` Eli Zaretskii
2020-12-25 17:14 ` Philipp Stephani
2020-12-25 19:01 ` Eli Zaretskii
2021-01-24 18:58 ` Philipp Stephani
2021-01-24 19:48 ` Eli Zaretskii
2021-01-24 19:57 ` Eli Zaretskii
2021-02-28 18:10 ` Philipp [this message]
2021-02-28 18:42 ` Eli Zaretskii
2021-03-18 14:16 ` Philipp
2021-03-18 14:37 ` Philipp
2021-03-18 15:01 ` Eli Zaretskii
2021-03-19 16:37 ` Philipp
2021-03-19 16:44 ` Eli Zaretskii
2021-03-21 11:43 ` Philipp
2021-03-21 12:10 ` Eli Zaretskii
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.gnu.org/software/emacs/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=0077B374-A65D-412D-B1A5-4ADDD50D41A7@gmail.com \
--to=p.stephani2@gmail.com \
--cc=eliz@gnu.org \
--cc=help-gnu-emacs@gnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).