From: Robert Pluim <rpluim@gmail.com>
To: Eli Zaretskii <eliz@gnu.org>
Cc: lists@traduction-libre.org, emacs-devel@gnu.org, juri@linkov.net
Subject: Re: Better emoji support
Date: Sun, 19 Sep 2021 20:40:11 +0200 [thread overview]
Message-ID: <87ee9kl88k.fsf@gmail.com> (raw)
In-Reply-To: <83tuig2zc7.fsf@gnu.org> (Eli Zaretskii's message of "Sun, 19 Sep 2021 21:29:44 +0300")
>>>>> On Sun, 19 Sep 2021 21:29:44 +0300, Eli Zaretskii <eliz@gnu.org> said:
>> From: Robert Pluim <rpluim@gmail.com>
>> Cc: Juri Linkov <juri@linkov.net>, lists@traduction-libre.org,
>> emacs-devel@gnu.org
>> Date: Sun, 19 Sep 2021 20:10:22 +0200
>>
Eli> Hmm... Robert, I see quite a few characters that now belong to the
Eli> emoji script, which shouldn't be there, AFAIU. The above is one of
Eli> them (AFAIK, the Arrows block doesn't belong to Emoji). But there are
Eli> more stark cases, for example:
>>
>> The whole block might not, but some of the codepoints do:
>>
>> 2194..2199 ; Emoji # E0.6 [6] (↔️..↙️) left-right arrow..down-left arrow
Eli> Only if followed by a variation selector VS-16, right?
Iʼm inclined to agree, but Iʼd have to re-read tr51, and I have a
headache. They definitely have Emoji_Presentation=No.
Eli> (aref char-script-table ?#) => emoji
Eli> (aref char-script-table ?0) => emoji
>>
>> I donʼt see that here (and itʼs definitely not the
>> intention). Blocks.awk skips any ASCII codepoints (and those both
>> evaluate to "latin" here). Could you double-check your
>> lisp/international/charscript.el?
Eli> I see them there:
Eli> (#x0023 #x0023 emoji) ; Autogenerated emoji
Eli> (#x002A #x002A emoji) ; Autogenerated emoji
Eli> (#x0030 #x0039 emoji) ; Autogenerated emoji
Eli> (#x00A9 #x00A9 emoji) ; Autogenerated emoji
Eli> (#x00AE #x00AE emoji) ; Autogenerated emoji
Eli> Which corresponds to these lines in emoji-data.txt:
Eli> 0023 ; Emoji # E0.0 [1] (#️) hash sign
Eli> 002A ; Emoji # E0.0 [1] (*️) asterisk
Eli> 0030..0039 ; Emoji # E0.0 [10] (0️..9️) digit zero..digit nine
Eli> 00A9 ; Emoji # E0.6 [1] (©️) copyright
Eli> 00AE ; Emoji # E0.6 [1] (®️) registered
Blocks.awk has this:
FILENAME ~ "emoji-data.txt" && /^00[0-9A-F]{2}.*; Emoji / {
next
}
so those should have been filtered out (this is where I learn more
about Awk incompatibilities than I care to, perhaps)
Eli> It seems like these characters ended up in the emoji script because
Eli> they should render as emoji when followed by variation selectors? But
Eli> in that case, the place to do this is in composition-function-table,
Eli> if we can, and if we cannot, let's for now decide we don't support
Eli> these sequences, because the cure sounds worse than the disease with
Eli> our current infrastructure.
>>
Eli> Am I missing something?
>>
>> Are now saying that we only want to add to the emoji script those
>> characters with Emoji_Presentation=Yes?
Eli> Yes, I think so. Are there any downsides to that?
Not that I can see. As a side effect it will fix whatever is causing
those ASCII codepoints to be treated as Emoji for you.
Robert
--
next prev parent reply other threads:[~2021-09-19 18:40 UTC|newest]
Thread overview: 73+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-07-10 9:19 Better emoji support Eli Zaretskii
2021-07-10 9:24 ` Robert Pluim
2021-07-10 9:37 ` Eli Zaretskii
2021-07-10 9:45 ` Robert Pluim
2021-07-10 9:48 ` Eli Zaretskii
2021-07-12 8:27 ` Robert Pluim
2021-07-12 14:02 ` Robert Pluim
2021-07-12 15:38 ` Robert Pluim
2021-07-12 15:59 ` Eli Zaretskii
2021-07-13 7:00 ` Jean Louis
2021-07-13 7:50 ` Robert Pluim
2021-07-15 15:15 ` Robert Pluim
2021-07-15 15:44 ` Eli Zaretskii
2021-09-13 12:36 ` Eli Zaretskii
2021-09-13 15:44 ` Robert Pluim
2021-09-13 16:07 ` Eli Zaretskii
2021-09-13 16:36 ` Jean-Christophe Helary
2021-09-17 13:01 ` Robert Pluim
2021-09-17 13:42 ` Jean-Christophe Helary
2021-09-17 13:50 ` Robert Pluim
2021-09-17 15:51 ` Kévin Le Gouguec
2021-09-17 16:43 ` Robert Pluim
2021-09-17 19:34 ` Kévin Le Gouguec
2021-09-19 18:27 ` Robert Pluim
2021-09-19 19:43 ` Kévin Le Gouguec
2021-09-20 4:23 ` Eli Zaretskii
2021-09-20 6:20 ` Kévin Le Gouguec
2021-09-20 6:53 ` Eli Zaretskii
2021-09-20 8:40 ` Robert Pluim
2021-09-20 9:53 ` Eli Zaretskii
2021-09-20 13:03 ` Robert Pluim
2021-09-20 13:15 ` Eli Zaretskii
2021-09-20 13:25 ` Eli Zaretskii
2021-09-20 13:50 ` Robert Pluim
2021-09-20 15:27 ` Eli Zaretskii
2021-09-20 17:32 ` Robert Pluim
2021-09-20 18:54 ` Eli Zaretskii
2021-09-20 19:30 ` Robert Pluim
2021-09-20 19:42 ` Eli Zaretskii
2021-09-20 20:05 ` Robert Pluim
2021-09-20 13:40 ` Robert Pluim
2021-09-20 13:45 ` Eli Zaretskii
2021-09-17 15:58 ` Jean-Christophe Helary
2021-09-17 16:35 ` Daniel Martín
2021-09-17 16:52 ` Robert Pluim
2021-09-17 15:39 ` Stephen Berman
2021-09-17 16:01 ` Eli Zaretskii
2021-09-17 16:34 ` Stephen Berman
2021-09-17 16:58 ` Robert Pluim
2021-09-17 17:05 ` Stephen Berman
2021-09-17 17:10 ` Robert Pluim
2021-09-17 17:17 ` Stephen Berman
2021-09-17 17:37 ` Better emoji support, " Robert Pluim
2021-09-17 17:49 ` Stephen Berman
2021-09-17 17:51 ` Robert Pluim
2021-09-17 18:53 ` martin rudalics
2021-09-17 18:58 ` Robert Pluim
2021-09-17 19:44 ` Stephen Berman
2021-09-17 18:53 ` martin rudalics
2021-09-17 16:55 ` martin rudalics
2021-09-17 18:48 ` Eli Zaretskii
2021-09-17 18:59 ` Robert Pluim
2021-09-18 5:39 ` Eli Zaretskii
2021-09-18 6:25 ` Eli Zaretskii
2021-09-19 16:09 ` Juri Linkov
2021-09-19 17:16 ` Kévin Le Gouguec
2021-09-19 18:20 ` Robert Pluim
2021-09-19 19:13 ` Robert Pluim
2021-09-19 17:24 ` Eli Zaretskii
2021-09-19 18:10 ` Robert Pluim
2021-09-19 18:29 ` Eli Zaretskii
2021-09-19 18:40 ` Robert Pluim [this message]
2021-09-19 18:34 ` Eli Zaretskii
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.gnu.org/software/emacs/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87ee9kl88k.fsf@gmail.com \
--to=rpluim@gmail.com \
--cc=eliz@gnu.org \
--cc=emacs-devel@gnu.org \
--cc=juri@linkov.net \
--cc=lists@traduction-libre.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/emacs.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).