From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: describe-char on emoji sequences Date: Wed, 27 Oct 2021 19:30:22 +0300 Message-ID: <83sfwmv3a9.fsf@gnu.org> References: <87cznths5j.fsf@gnus.org> <874k94j3rn.fsf@gnus.org> <83v91kzydh.fsf@gnu.org> <87tuh4holf.fsf@gnus.org> <822aec9d01909cecfc6c@heytings.org> <87a6iwhltf.fsf@gnus.org> <83tuh4zfg5.fsf@gnu.org> <87y26gfobr.fsf@gnus.org> <87tuh4f1ie.fsf@gnus.org> <87lf2fg44h.fsf@gnus.org> <87h7d3g2uu.fsf@gnus.org> <83bl3bybm3.fsf@gnu.org> <878ryfr9w0.fsf@gmail.com> <878ryfg07k.fsf@gnus.org> <874k93r869.fsf@gmail.com> <87r1c7d28k.fsf_-_@gnus.org> <83zgqvwpq2.fsf@gnu.org> <87mtmvd13h.fsf@gnus.org> <83a6iuwox1.fsf@gnu.org> <874k92a6j6.fsf@gnus.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="6561"; mail-complaints-to="usenet@ciao.gmane.io" Cc: emacs-devel@gnu.org To: Lars Ingebrigtsen Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Wed Oct 27 18:58:18 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mfmFm-0001SP-C2 for ged-emacs-devel@m.gmane-mx.org; Wed, 27 Oct 2021 18:58:18 +0200 Original-Received: from localhost ([::1]:51814 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mfmFl-0000Vg-2q for ged-emacs-devel@m.gmane-mx.org; Wed, 27 Oct 2021 12:58:17 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:52922) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mflp6-00065V-O7 for emacs-devel@gnu.org; Wed, 27 Oct 2021 12:30:45 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:43288) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mflp3-0004MU-OB; Wed, 27 Oct 2021 12:30:43 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=MIME-version:References:Subject:In-Reply-To:To:From: Date; bh=HBdgbCNjggttUZBVd4b9JPes+LKtn0jgH1DqoEqXWOg=; b=XFvH9htXpAXKG3nugzu6 fjoMpy7IEu111OCrPjTPfrgAu502BQ7Y630O8jObKFi7Qq3clioIdvnWcUoTXrjRl44xrDRTQA2Us Pa9UpcsJQTUOl0wXwV4EYHp3fRb3HG2ZG57N2C9Og6FE/UXaiVH6T7qxpD1kJ2D5oL7r3rPjVnOFc N46wn3X/h2PIGbgdUGj0FGPMTnLc7vzKwj2GliLPcq79BGErAU4564E4X3S27zJQdp9ScgNl3gmdk ux/oAcHqjVF/BmE+OrszjVIrViLRqJpESbl+pcsa/FwBgENgT9528t81lCKgWnREIAVUzao8MoUHQ n3O8BLIyuqQ24A==; Original-Received: from [87.69.77.57] (port=3787 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mfloo-0001he-1X; Wed, 27 Oct 2021 12:30:32 -0400 In-Reply-To: <874k92a6j6.fsf@gnus.org> (message from Lars Ingebrigtsen on Wed, 27 Oct 2021 16:25:49 +0200) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:278005 Archived-At: > From: Lars Ingebrigtsen > Cc: emacs-devel@gnu.org > Date: Wed, 27 Oct 2021 16:25:49 +0200 > > Eli Zaretskii writes: > > Take ⚠️: > > to input: type "C-x 8 RET 26a0" or "C-x 8 RET WARNING SIGN" > buffer code: #xE2 #x9A #xA0 > file code: #xE2 #x9A #xA0 (encoded by coding system utf-8-emacs) > display: composed to form "⚠️" (see below) > > Composed with the following character(s) "️" using this font: > ftcrhb:-GOOG-Noto Color Emoji-medium-normal-normal-*-19-*-*-*-m-0-iso10646-1 > by these glyphs: > [0 1 9888 112 24 0 24 18 5 nil] > with these character(s): > ️ (#xfe0f) VARIATION SELECTOR-16 > > If you insert > > C-x 8 RET WARNING SIGN and then C-x 8 RET VARIATION SELECTOR-16 you > indeed get: > > ⚠️ > > So > > to input: type "C-x 8 RET 26a0" or "C-x 8 RET WARNING SIGN" > > could just be expanded to include all the code points in the > decomposition. Of course, with some of these sequences (with five code > points), doing this is completely impractical, so perhaps it's not worth > doing. We could add such a feature, but note that the existing display all but gives it to you already: it shows the codepoints of the other characters in parentheses. > The point of this is that it's not always clear what an emoji is trying > to express. For instance, if somebody writes you a message about 👨🏻‍✈️, > it'd be nice if Emacs could tell you that it's a "man pilot". Here, you are talking about a different feature: a kind of "parser" of Emoji sequences that would convert a sequence of characters into its Emoji description. I don't think it belongs to describe-char, but it could be a useful feature on its own.