From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Robert Pluim Newsgroups: gmane.emacs.devel Subject: Re: Entering emojis Date: Tue, 26 Oct 2021 19:09:03 +0200 Message-ID: <878ryfr9w0.fsf@gmail.com> References: <87cznths5j.fsf@gnus.org> <83zgqxymd3.fsf@gnu.org> <878rygj4gt.fsf@gnus.org> <83wnm0zz0q.fsf@gnu.org> <874k94j3rn.fsf@gnus.org> <83v91kzydh.fsf@gnu.org> <87tuh4holf.fsf@gnus.org> <822aec9d01909cecfc6c@heytings.org> <87a6iwhltf.fsf@gnus.org> <83tuh4zfg5.fsf@gnu.org> <87y26gfobr.fsf@gnus.org> <87tuh4f1ie.fsf@gnus.org> <87lf2fg44h.fsf@gnus.org> <87h7d3g2uu.fsf@gnus.org> <83bl3bybm3.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="35763"; mail-complaints-to="usenet@ciao.gmane.io" Cc: Lars Ingebrigtsen , emacs-devel@gnu.org, gregory@heytings.org, stefankangas@gmail.com To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Tue Oct 26 19:10:29 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mfPy1-00097L-Pw for ged-emacs-devel@m.gmane-mx.org; Tue, 26 Oct 2021 19:10:29 +0200 Original-Received: from localhost ([::1]:44226 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mfPy0-00039J-To for ged-emacs-devel@m.gmane-mx.org; Tue, 26 Oct 2021 13:10:28 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:42916) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mfPwj-0001mQ-DF for emacs-devel@gnu.org; Tue, 26 Oct 2021 13:09:10 -0400 Original-Received: from mail-wm1-x32f.google.com ([2a00:1450:4864:20::32f]:54892) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1mfPwh-0007Nd-HK; Tue, 26 Oct 2021 13:09:09 -0400 Original-Received: by mail-wm1-x32f.google.com with SMTP id g141so115710wmg.4; Tue, 26 Oct 2021 10:09:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:cc:subject:references:gmane-reply-to-list:date:in-reply-to :message-id:mime-version:content-transfer-encoding; bh=rU1Zt1iqTgJZI61fzri6GAXV00hyxp1gB7DXwpJQUZU=; b=lszaWvE4Uc8RHsOJKDhJ/3eNn6K38Y8dy6icXfWJBbU4y/rrxz4hkGLZNj0K+PUY+i 4Najn0n3eh7vfmzlFMCiIST1zILxW4xDqxePCbvPK85v6rDgUPyEDRJa2hWFdLXFcRwz +JVjHbpRKCMq5ULTA4utynd3V3vLMzX/n/hD9/9RQ0c1SH3x9Mt+kONS0M5MrZuWMoEY qMTsD2FvgJ6gd2cI70a+MaOA4olcDIA63RLmfSsiqrJw6HD2VtuMiZ274j1nDFO0CWcM pMKm4SLkiJbYdUQjE/PivxVtiy2ivRI+ViWiYpuQzvnnsxPOOitHq4d/QzeE3lIFrA8L rong== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:references :gmane-reply-to-list:date:in-reply-to:message-id:mime-version :content-transfer-encoding; bh=rU1Zt1iqTgJZI61fzri6GAXV00hyxp1gB7DXwpJQUZU=; b=UeFvU790NoEajjFPd11uinMxZdgyIp9hvIat2KLPO4QzkpbTbkL0O/qr4Kt4g1c8ns eoy6CP8dlyTENGmr2bNrTFIOXX8IY2aK6iTnyrbT4foUbunTTqiMb3z2vCj7FNRu9UQy B3ufxt3OR2B7OJLxFJNSGtq+5wZ48wuweJup0VjdaUhf+mdK0lKZaHIRrfieJfLCJwT1 qxDnS4NQZdIBCI+KTiMBL06/t++vo6vzzB3t/XU5RScMRJ0ktIPaDYXF+N0oEnMEtBdM AIjPKl3JcFj1zN5CYYY2m9kUVIUOgs+0kg2LyCe4RBuXA3AHMLEFsiW4m7P6BnSKImgo Jhbw== X-Gm-Message-State: AOAM532lX45SCn9IjBl8MdLxwlvm9JUkm2c3qbv+k20+UQ5MfkerZLGX 5yWOF6exHUhXTh1nntzKZUfKSVNa4Cs= X-Google-Smtp-Source: ABdhPJzmhPN1gwPvPdOj8yCTDaeRXMRjY/PbKO27UKi07k6V+bS4Yf/vefY0zhKs1jzNbpsrNHdscg== X-Received: by 2002:a05:600c:4fc5:: with SMTP id o5mr7674625wmq.147.1635268145192; Tue, 26 Oct 2021 10:09:05 -0700 (PDT) Original-Received: from rltb ([82.66.8.55]) by smtp.gmail.com with ESMTPSA id s11sm11521115wrt.60.2021.10.26.10.09.04 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 26 Oct 2021 10:09:04 -0700 (PDT) Gmane-Reply-To-List: yes In-Reply-To: <83bl3bybm3.fsf@gnu.org> (Eli Zaretskii's message of "Tue, 26 Oct 2021 19:49:56 +0300") Received-SPF: pass client-ip=2a00:1450:4864:20::32f; envelope-from=rpluim@gmail.com; helo=mail-wm1-x32f.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:277900 Archived-At: >>>>> On Tue, 26 Oct 2021 19:49:56 +0300, Eli Zaretskii said: >> From: Lars Ingebrigtsen >> Cc: Eli Zaretskii , Gregory Heytings , >> Emacs developers >> Date: Tue, 26 Oct 2021 18:36:25 +0200 >>=20 >> So I'm trying to figure out how this all maps up. >>=20 >> In the labels file, we have (for instance) =F0=9F=91=AE=E2=80=8D=E2= =99=82=EF=B8=8F (a male police officer). >> I can find that glyph in emoji-zwj-sequences: >>=20 >> 1F46E 200D 2642 FE0F ; man police officer = # E4.0 [1] (=F0=9F=91=AE=E2=80= =8D=E2=99=82=EF=B8=8F) >> 1F46E 1F3FB 200D 2640 FE0F ; woman police officer= : light skin tone # E4.0 [1] (=F0=9F=91=AE=F0=9F= =8F=BB=E2=80=8D=E2=99=80=EF=B8=8F) >> 1F46E 1F3FB 200D 2642 FE0F ; man police officer: = light skin tone # E4.0 [1] (=F0=9F=91=AE=F0=9F= =8F=BB=E2=80=8D=E2=99=82=EF=B8=8F) >> 1F46E 1F3FC 200D 2640 FE0F ; woman police officer= : medium-light skin tone # E4.0 [1] (=F0=9F=91=AE=F0=9F= =8F=BC=E2=80=8D=E2=99=80=EF=B8=8F) >> 1F46E 1F3FC 200D 2642 FE0F ; man police officer: = medium-light skin tone # E4.0 [1] (=F0=9F=91=AE=F0=9F= =8F=BC=E2=80=8D=E2=99=82=EF=B8=8F) >> 1F46E 1F3FD 200D 2640 FE0F ; woman police officer= : medium skin tone # E4.0 [1] (=F0=9F=91=AE=F0=9F= =8F=BD=E2=80=8D=E2=99=80=EF=B8=8F) >>=20 >> etc. But there's no mapping from that glyph to these other ones exc= ept >> by ... being in the vicinity... and the "woman" forms aren't varian= ts. >> Hm... >>=20 >> Aha! common/annotationsDerived/en.xml has >>=20 >> man police officer: light skin tone >> man police officer: medium-light skin tone >> man police officer: medium skin tone >>=20 >> So I can find "man police officer" in the sequences file, and then g= et >> the derivations from that XML file? Geez. Well, that sounds doable, >> and I hope that those names for the glyphs are the same in both file= s. >> :-/ Eli> I don't think I understand the problem. The first 2 codepoints ar= e in Eli> admin/unidata/emoji-sequences.txt, and the gender thingy is what Eli> determines if its "man" or "woman". VS-16 is a no-op, and I'm not Eli> even sure you should produce it in these sequences. It is only ne= eded Eli> when the original character is not an emoji. It=CA=BCs not a no-op: it modifies U+2640 or U+2642 I=CA=BCm not sure I understand the issue either: the base codepoint is U+1F46E, and emoji-zwj-sequences tells you what the sequences are. What else is needed? Robert --=20