unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
From: Robert Pluim <rpluim@gmail.com>
To: Eli Zaretskii <eliz@gnu.org>
Cc: lists@traduction-libre.org, emacs-devel@gnu.org, juri@linkov.net
Subject: Re: Better emoji support
Date: Sun, 19 Sep 2021 20:40:11 +0200	[thread overview]
Message-ID: <87ee9kl88k.fsf@gmail.com> (raw)
In-Reply-To: <83tuig2zc7.fsf@gnu.org> (Eli Zaretskii's message of "Sun, 19 Sep 2021 21:29:44 +0300")

>>>>> On Sun, 19 Sep 2021 21:29:44 +0300, Eli Zaretskii <eliz@gnu.org> said:

    >> From: Robert Pluim <rpluim@gmail.com>
    >> Cc: Juri Linkov <juri@linkov.net>,  lists@traduction-libre.org,
    >> emacs-devel@gnu.org
    >> Date: Sun, 19 Sep 2021 20:10:22 +0200
    >> 
    Eli> Hmm...  Robert, I see quite a few characters that now belong to the
    Eli> emoji script, which shouldn't be there, AFAIU.  The above is one of
    Eli> them (AFAIK, the Arrows block doesn't belong to Emoji).  But there are
    Eli> more stark cases, for example:
    >> 
    >> The whole block might not, but some of the codepoints do:
    >> 
    >> 2194..2199    ; Emoji                # E0.6   [6] (↔️..↙️)    left-right arrow..down-left arrow

    Eli> Only if followed by a variation selector VS-16, right?

Iʼm inclined to agree, but Iʼd have to re-read tr51, and I have a
headache. They definitely have Emoji_Presentation=No.

    Eli> (aref char-script-table ?#) => emoji
    Eli> (aref char-script-table ?0) => emoji
    >> 
    >> I donʼt see that here (and itʼs definitely not the
    >> intention). Blocks.awk skips any ASCII codepoints (and those both
    >> evaluate to "latin" here). Could you double-check your
    >> lisp/international/charscript.el?

    Eli> I see them there:

    Eli>     (#x0023 #x0023 emoji) ; Autogenerated emoji
    Eli>     (#x002A #x002A emoji) ; Autogenerated emoji
    Eli>     (#x0030 #x0039 emoji) ; Autogenerated emoji
    Eli>     (#x00A9 #x00A9 emoji) ; Autogenerated emoji
    Eli>     (#x00AE #x00AE emoji) ; Autogenerated emoji

    Eli> Which corresponds to these lines in emoji-data.txt:

    Eli>   0023          ; Emoji                # E0.0   [1] (#️)       hash sign
    Eli>   002A          ; Emoji                # E0.0   [1] (*️)       asterisk
    Eli>   0030..0039    ; Emoji                # E0.0  [10] (0️..9️)    digit zero..digit nine
    Eli>   00A9          ; Emoji                # E0.6   [1] (©️)       copyright
    Eli>   00AE          ; Emoji                # E0.6   [1] (®️)       registered

Blocks.awk has this:

FILENAME ~ "emoji-data.txt" && /^00[0-9A-F]{2}.*; Emoji / {
    next
}

so those should have been filtered out (this is where I learn more
about Awk incompatibilities than I care to, perhaps)

    Eli> It seems like these characters ended up in the emoji script because
    Eli> they should render as emoji when followed by variation selectors?  But
    Eli> in that case, the place to do this is in composition-function-table,
    Eli> if we can, and if we cannot, let's for now decide we don't support
    Eli> these sequences, because the cure sounds worse than the disease with
    Eli> our current infrastructure.
    >> 
    Eli> Am I missing something?
    >> 
    >> Are now saying that we only want to add to the emoji script those
    >> characters with Emoji_Presentation=Yes?

    Eli> Yes, I think so.  Are there any downsides to that?

Not that I can see. As a side effect it will fix whatever is causing
those ASCII codepoints to be treated as Emoji for you.

Robert
-- 



  reply	other threads:[~2021-09-19 18:40 UTC|newest]

Thread overview: 73+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-07-10  9:19 Better emoji support Eli Zaretskii
2021-07-10  9:24 ` Robert Pluim
2021-07-10  9:37   ` Eli Zaretskii
2021-07-10  9:45     ` Robert Pluim
2021-07-10  9:48       ` Eli Zaretskii
2021-07-12  8:27         ` Robert Pluim
2021-07-12 14:02           ` Robert Pluim
2021-07-12 15:38             ` Robert Pluim
2021-07-12 15:59               ` Eli Zaretskii
2021-07-13  7:00                 ` Jean Louis
2021-07-13  7:50                   ` Robert Pluim
2021-07-15 15:15                 ` Robert Pluim
2021-07-15 15:44                   ` Eli Zaretskii
2021-09-13 12:36         ` Eli Zaretskii
2021-09-13 15:44           ` Robert Pluim
2021-09-13 16:07             ` Eli Zaretskii
2021-09-13 16:36             ` Jean-Christophe Helary
2021-09-17 13:01               ` Robert Pluim
2021-09-17 13:42                 ` Jean-Christophe Helary
2021-09-17 13:50                   ` Robert Pluim
2021-09-17 15:51                     ` Kévin Le Gouguec
2021-09-17 16:43                       ` Robert Pluim
2021-09-17 19:34                         ` Kévin Le Gouguec
2021-09-19 18:27                           ` Robert Pluim
2021-09-19 19:43                             ` Kévin Le Gouguec
2021-09-20  4:23                               ` Eli Zaretskii
2021-09-20  6:20                                 ` Kévin Le Gouguec
2021-09-20  6:53                                   ` Eli Zaretskii
2021-09-20  8:40                                     ` Robert Pluim
2021-09-20  9:53                                       ` Eli Zaretskii
2021-09-20 13:03                                         ` Robert Pluim
2021-09-20 13:15                                           ` Eli Zaretskii
2021-09-20 13:25                                             ` Eli Zaretskii
2021-09-20 13:50                                               ` Robert Pluim
2021-09-20 15:27                                                 ` Eli Zaretskii
2021-09-20 17:32                                                   ` Robert Pluim
2021-09-20 18:54                                                     ` Eli Zaretskii
2021-09-20 19:30                                                       ` Robert Pluim
2021-09-20 19:42                                                         ` Eli Zaretskii
2021-09-20 20:05                                                           ` Robert Pluim
2021-09-20 13:40                                             ` Robert Pluim
2021-09-20 13:45                                               ` Eli Zaretskii
2021-09-17 15:58                     ` Jean-Christophe Helary
2021-09-17 16:35                     ` Daniel Martín
2021-09-17 16:52                       ` Robert Pluim
2021-09-17 15:39                 ` Stephen Berman
2021-09-17 16:01                   ` Eli Zaretskii
2021-09-17 16:34                     ` Stephen Berman
2021-09-17 16:58                       ` Robert Pluim
2021-09-17 17:05                         ` Stephen Berman
2021-09-17 17:10                           ` Robert Pluim
2021-09-17 17:17                             ` Stephen Berman
2021-09-17 17:37                               ` Better emoji support, " Robert Pluim
2021-09-17 17:49                                 ` Stephen Berman
2021-09-17 17:51                                   ` Robert Pluim
2021-09-17 18:53                                     ` martin rudalics
2021-09-17 18:58                                       ` Robert Pluim
2021-09-17 19:44                                         ` Stephen Berman
2021-09-17 18:53                                 ` martin rudalics
2021-09-17 16:55                     ` martin rudalics
2021-09-17 18:48                 ` Eli Zaretskii
2021-09-17 18:59                   ` Robert Pluim
2021-09-18  5:39                     ` Eli Zaretskii
2021-09-18  6:25                 ` Eli Zaretskii
2021-09-19 16:09                 ` Juri Linkov
2021-09-19 17:16                   ` Kévin Le Gouguec
2021-09-19 18:20                     ` Robert Pluim
2021-09-19 19:13                       ` Robert Pluim
2021-09-19 17:24                   ` Eli Zaretskii
2021-09-19 18:10                     ` Robert Pluim
2021-09-19 18:29                       ` Eli Zaretskii
2021-09-19 18:40                         ` Robert Pluim [this message]
2021-09-19 18:34                       ` Eli Zaretskii

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87ee9kl88k.fsf@gmail.com \
    --to=rpluim@gmail.com \
    --cc=eliz@gnu.org \
    --cc=emacs-devel@gnu.org \
    --cc=juri@linkov.net \
    --cc=lists@traduction-libre.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).