From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Lars Ingebrigtsen Newsgroups: gmane.emacs.devel Subject: Re: Entering emojis Date: Tue, 26 Oct 2021 19:58:28 +0200 Message-ID: <87r1c7ekhn.fsf@gnus.org> References: <87cznths5j.fsf@gnus.org> <83zgqxymd3.fsf@gnu.org> <878rygj4gt.fsf@gnus.org> <83wnm0zz0q.fsf@gnu.org> <874k94j3rn.fsf@gnus.org> <83v91kzydh.fsf@gnu.org> <87tuh4holf.fsf@gnus.org> <822aec9d01909cecfc6c@heytings.org> <87a6iwhltf.fsf@gnus.org> <83tuh4zfg5.fsf@gnu.org> <87y26gfobr.fsf@gnus.org> <87tuh4f1ie.fsf@gnus.org> <87lf2fg44h.fsf@gnus.org> <87h7d3g2uu.fsf@gnus.org> <83bl3bybm3.fsf@gnu.org> <878ryfr9w0.fsf@gmail.com> <878ryfg07k.fsf@gnus.org> <874k93r869.fsf@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="38011"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) Cc: Eli Zaretskii , stefankangas@gmail.com, gregory@heytings.org, emacs-devel@gnu.org To: Robert Pluim Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Tue Oct 26 20:21:29 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mfR4j-0009fi-9S for ged-emacs-devel@m.gmane-mx.org; Tue, 26 Oct 2021 20:21:29 +0200 Original-Received: from localhost ([::1]:52916 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mfR4h-0000tA-Ci for ged-emacs-devel@m.gmane-mx.org; Tue, 26 Oct 2021 14:21:27 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:52056) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mfQig-0006jv-2M for emacs-devel@gnu.org; Tue, 26 Oct 2021 13:58:42 -0400 Original-Received: from quimby.gnus.org ([2a01:4f9:2b:f0f::2]:59902) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mfQia-0004wy-QW; Tue, 26 Oct 2021 13:58:40 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Transfer-Encoding:Content-Type:MIME-Version:Message-ID :In-Reply-To:Date:References:Subject:Cc:To:From:Sender:Reply-To:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=GrvytzuLclKwf1ef1ZgiPyUg++7eBU1sdgau9sXttHg=; b=o9h+8gO3PDmaPgsTD1hE8qK6xj ve1/cMb1twz9ptuTKlVkMdvz5AbGbKibfJvB1G66ViD/41kz2UBkzexNLlMXUDRAZ3dOL14naSFIU 5IdoQMGYxVLBGKkC3dgFgp/i3aW38ZHzZbVQgOHd/NDtN8zbe5LwAQm/kiZ7UbPyhl50=; Original-Received: from [84.212.220.105] (helo=elva) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mfQiV-0006Ah-GY; Tue, 26 Oct 2021 19:58:33 +0200 Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAABGdBTUEAALGPC/xhBQAAACBj SFJNAAB6JgAAgIQAAPoAAACA6AAAdTAAAOpgAAA6mAAAF3CculE8AAAAElBMVEXv7vKlobF4dIlT UGIWEhj///+hG8C1AAAAAWJLR0QF+G/pxwAAAAd0SU1FB+UKGhEbFSZGaf0AAAGCSURBVDjLdZOL ccQgDETtSwMGpwFJacASDcTQf01Z/sdlzNwcHh67AiFtWxueLaQ61GFsA4i19WSUwe7qmIIVvAlS pGl1eElz3BO4BSQawHN4B9cTQJQVqFnET20q2mFjSiFE02OCtFhNRXgA+wruAV4rMHqy0ofgylcL wR/Auw5Wq6RHs5JVcbsBVkVf/gRT8Aay5/VPgRwqplkiANa243z3GziDxRSDFvA7gYMCz0BS3GYI b2bKHNzZXrwTbEbafHTM8IujRl54Z2Mnt0cRWcilWyVfygAsKBpRb6UyCzmZ2ZQClVrVCbCbBSpH B4qbV8D4c1lCNMCuHWQnm1YvZc+sdhVQu6MDWIuVlTNVyVFPm4Go5BmXda0HcVoC4Dw5aspjgHZM RC/BMsBe8srUulSstzMSCyBKQ1IVeyz3YO1e0sBXNKmgSJCd2ufbNzKbNVTSt8O4BNm2H3wWcOaN ISI7VkAYAC5BueVg262DnMl8IXjlPt/Ro1LPle/oy8sgyB9lWZ1P4RWxFwAAACV0RVh0ZGF0ZTpj cmVhdGUAMjAyMS0xMC0yNlQxNzoyNzoyMSswMDowMEfgiQgAAAAldEVYdGRhdGU6bW9kaWZ5ADIw MjEtMTAtMjZUMTc6Mjc6MjErMDA6MDA2vTG0AAAAAElFTkSuQmCC X-Now-Playing: Hermine's _Who'll Come Walking?_: "America (Studio Version)" In-Reply-To: <874k93r869.fsf@gmail.com> (Robert Pluim's message of "Tue, 26 Oct 2021 19:46:06 +0200") Received-SPF: pass client-ip=2a01:4f9:2b:f0f::2; envelope-from=larsi@gnus.org; helo=quimby.gnus.org X-Spam_score_int: -43 X-Spam_score: -4.4 X-Spam_bar: ---- X-Spam_report: (-4.4 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_MED=-2.3, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:277914 Archived-At: Robert Pluim writes: > If you want them to display correctly in Emacs you=CA=BCre going to have = to > use the sequences in emoji-zwj-sequences.txt and emoji-sequences.txt I'm using the labels file, because it had better taxonomy... > The field in emoji-zwj-sequences.txt is called 'descriptions', and I > think they=CA=BCre not intended to be normative in any way, so probably > best to use the base codepoint. Right... So in emoji-sequences, we have 1F46E..1F4AC ; Basic_Emoji ; police officer = # E0.6 [63] (=F0=9F=91=AE..=F0=9F=92=AC) and then 1F46E 1F3FB ; RGI_Emoji_Modifier_Sequence ; police officer: light skin t= one # E1.0 [1] (=F0=9F=91=AE=F0=9F=8F=BB) 1F46E 1F3FC ; RGI_Emoji_Modifier_Sequence ; police officer: medium-light= skin tone # E1.0 [1] (=F0=9F=91=AE=F0=9F=8F=BC) 1F46E 1F3FD ; RGI_Emoji_Modifier_Sequence ; police officer: medium skin = tone # E1.0 [1] (=F0=9F=91=AE=F0=9F=8F=BD) and then in the zwj file we have 1F46E 200D 2640 FE0F ; RGI_Emoji_ZWJ_Sequence ; wom= an police officer # E4.0 [1] (= =F0=9F=91=AE=E2=80=8D=E2=99=80=EF=B8=8F) 1F46E 200D 2642 FE0F ; RGI_Emoji_ZWJ_Sequence ; man= police officer # E4.0 [1] (= =F0=9F=91=AE=E2=80=8D=E2=99=82=EF=B8=8F) 1F46E 1F3FB 200D 2640 FE0F ; RGI_Emoji_ZWJ_Sequence ; wom= an police officer: light skin tone # E4.0 [1] (= =F0=9F=91=AE=F0=9F=8F=BB=E2=80=8D=E2=99=80=EF=B8=8F) 1F46E 1F3FB 200D 2642 FE0F ; RGI_Emoji_ZWJ_Sequence ; man= police officer: light skin tone # E4.0 [1] (= =F0=9F=91=AE=F0=9F=8F=BB=E2=80=8D=E2=99=82=EF=B8=8F) So that all matches up, and I can go from 1F46E and get all the variants. Seems very promising; I'll give it a go. But... I can't go from "woman police officer" to "woman police officer: light skin tone" by looking at the first code point. Er... what's the rule here, then? 1F46E plus 2640 as the next-to-last code point? --=20 (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no