From: Stephen Berman <stephen.berman@gmx.net>
To: Eli Zaretskii <eliz@gnu.org>
Cc: larsi@gnus.org, cedric.chepied@gmail.com, 14461@debbugs.gnu.org
Subject: bug#14461: 24.3.50; bad display for 'space' + (U+0336) unicode combination
Date: Sat, 17 Aug 2019 16:40:44 +0200 [thread overview]
Message-ID: <87o90nkg7n.fsf@gmx.net> (raw)
In-Reply-To: <8336hzop4a.fsf@gnu.org> (Eli Zaretskii's message of "Sat, 17 Aug 2019 17:14:45 +0300")
On Sat, 17 Aug 2019 17:14:45 +0300 Eli Zaretskii <eliz@gnu.org> wrote:
>> From: Stephen Berman <stephen.berman@gmx.net>
>> Cc: Kenichi Handa <handa@gnu.org>, cedric.chepied@gmail.com,
>> 14461@debbugs.gnu.org, larsi@gnus.org
>> Date: Sat, 17 Aug 2019 15:50:20 +0200
>>
>> Hm, I chose COMBINING ACUTE ACCENT and COMBINING CIRCUMFLEX ACCENT more
>> or less at random, but I do indeed see the sequence 'aU+0301U+0302' as
>> two grapheme clusters (also with -Q): 'a' with an acute accent over it
>> followed by a circumflex. In contrast, the sequences 'aU+0301U+0317'
>> and 'aU+0302U+0317' are displayed as single grapheme clusters (317 is
>> COMBINING ACUTE ACCENT BELOW). I also noticed that the seqence
>> '-U+0301U+0302' is displayed as a dash followed by a single grapheme
>> cluster of an acute accent and a circumflex; this holds for all
>> nonalphabetic ASCII characters I tried and for some but not all
>> non-ASCII alphabetic characters. So there seems to be some
>> inconsistency in the display of combining characters.
>
> Is this in Emacs 27 built with HarfBuzz support?
Yes (both --with-cairo and without).
> If so, I think this
> just means that the default font you use doesn't support these
> combining accents, because on my system I see a single grapheme
> cluster in both of the above cases, when I select a suitable font.
My default font is DejaVu Sans Mono, but it seems there's something else
at play here: in contrast to 'aU+0301U+0302', I do see the sequence
'bU+0301U+0302' as a single grapheme cluster. Maybe the difference is
because there is a glyph for 'a' with an acute accent and it doesn't
support further combining. (But I have no idea if that makes sense.)
Here's what describe-char shows on both:
________________________________________________________________________
position: 1 of 7 (0%), column: 0
character: a (displayed as a) (codepoint 97, #o141, #x61)
charset: ascii (ASCII (ISO646 IRV))
code point in charset: 0x61
script: latin
syntax: w which means: word
category: .:Base, L:Left-to-right (strong), a:ASCII, l:Latin, r:Roman
to input: type "C-x 8 RET 61" or "C-x 8 RET LATIN SMALL LETTER A"
buffer code: #x61
file code: #x61 (encoded by coding system utf-8-unix)
display: composed to form "á̂" (see below)
Composed with the following character(s) "́̂" using this font:
xfthb:-PfEd-DejaVu Sans Mono-normal-normal-normal-*-15-*-*-*-m-0-iso10646-1
by these glyphs:
[0 2 97 163 9 0 8 12 0 nil]
[0 2 769 650 9 2 7 12 -9 [0 0 0]]
Character code properties: customize what to show
name: LATIN SMALL LETTER A
general-category: Ll (Letter, Lowercase)
decomposition: (97) ('a')
________________________________________________________________________
position: 5 of 7 (57%), column: 0
character: b (displayed as b) (codepoint 98, #o142, #x62)
charset: ascii (ASCII (ISO646 IRV))
code point in charset: 0x62
script: latin
syntax: w which means: word
category: .:Base, L:Left-to-right (strong), a:ASCII, l:Latin, r:Roman
to input: type "C-x 8 RET 62" or "C-x 8 RET LATIN SMALL LETTER B"
buffer code: #x62
file code: #x62 (encoded by coding system utf-8-unix)
display: composed to form "b́̂" (see below)
Composed with the following character(s) "́̂" using this font:
xfthb:-PfEd-DejaVu Sans Mono-normal-normal-normal-*-15-*-*-*-m-0-iso10646-1
by these glyphs:
[0 2 98 69 9 1 9 11 0 nil]
[0 2 769 649 9 3 7 12 -9 [-9 -3 0]]
[0 2 770 650 9 2 7 12 -9 [-9 -3 0]]
Character code properties: customize what to show
name: LATIN SMALL LETTER B
general-category: Ll (Letter, Lowercase)
decomposition: (98) ('b')
next prev parent reply other threads:[~2019-08-17 14:40 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-05-24 14:30 bug#14461: 24.3.50; bad display for 'space' + (U+0336) unicode combination Cédric Chépied
2019-08-15 4:50 ` Lars Ingebrigtsen
2019-08-15 9:01 ` Stephen Berman
2019-08-15 10:02 ` Cédric Chépied
2019-08-15 12:29 ` Stephen Berman
2019-08-16 1:03 ` Lars Ingebrigtsen
2019-08-16 6:55 ` Eli Zaretskii
2019-08-17 12:00 ` Eli Zaretskii
2019-08-17 13:50 ` Stephen Berman
2019-08-17 14:14 ` Eli Zaretskii
2019-08-17 14:40 ` Stephen Berman [this message]
2019-08-17 15:09 ` Eli Zaretskii
2019-08-17 15:39 ` Stephen Berman
2019-08-17 15:44 ` Eli Zaretskii
2019-08-17 17:05 ` Stephen Berman
2019-08-17 17:29 ` Eli Zaretskii
2019-08-17 18:11 ` Stephen Berman
2019-08-17 18:22 ` Eli Zaretskii
2019-08-17 18:58 ` Stephen Berman
2019-09-07 9:21 ` Eli Zaretskii
2019-08-15 14:48 ` Eli Zaretskii
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87o90nkg7n.fsf@gmx.net \
--to=stephen.berman@gmx.net \
--cc=14461@debbugs.gnu.org \
--cc=cedric.chepied@gmail.com \
--cc=eliz@gnu.org \
--cc=larsi@gnus.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this external index
https://git.savannah.gnu.org/cgit/emacs.git
https://git.savannah.gnu.org/cgit/emacs/org-mode.git
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.