From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: "T.V Raman" Newsgroups: gmane.emacs.devel Subject: Re: Entering emojis Date: Thu, 28 Oct 2021 07:19:55 -0700 Message-ID: References: <87cznths5j.fsf@gnus.org> <87ilxi7531.fsf@gnus.org> <875yth7bjr.fsf@gnus.org> Mime-Version: 1.0 Content-Type: text/plain; charset=gb18030 Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="27338"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) Cc: emacs-devel@gnu.org To: Lars Ingebrigtsen Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Thu Oct 28 16:21:10 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mg6HG-0006u5-9A for ged-emacs-devel@m.gmane-mx.org; Thu, 28 Oct 2021 16:21:10 +0200 Original-Received: from localhost ([::1]:41790 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mg6HF-0000DR-2N for ged-emacs-devel@m.gmane-mx.org; Thu, 28 Oct 2021 10:21:09 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:56176) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mg6GE-0007jz-RH for emacs-devel@gnu.org; Thu, 28 Oct 2021 10:20:06 -0400 Original-Received: from mail-pj1-x1032.google.com ([2607:f8b0:4864:20::1032]:43570) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1mg6GB-00047g-Go for emacs-devel@gnu.org; Thu, 28 Oct 2021 10:20:06 -0400 Original-Received: by mail-pj1-x1032.google.com with SMTP id k2-20020a17090ac50200b001a218b956aaso4851230pjt.2 for ; Thu, 28 Oct 2021 07:20:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version:content-transfer-encoding; bh=nQxvLbIRAtDY8GzHL4h0gy2uVZ7cl3Hq4OWI0Pr35g4=; b=Az0qGNufGx7CTJp04q1lLsR9EAyWIY+A0AvcR9g374ktz29ACtPz0Ng6g1janen1nQ yFzVwOu2SUnPRPJpmTCZ1+YKtKgbeeFIUOo14t+XtpQOOg0xj0hwioOpO/SwD4Cj8yPX pWONIJbKfYw8bmZl1wz7bITKdPLWowKm73T0Z5LpLcMbpO5Km6UlmTBOgqf7yWTi3e1E Cu3F1FQn+n3bDau3/AqmStN5GeVj8aK0RSqwoOQ56t1dsaP44sCVtFyQgLHRuYxuRNeQ K3VLGm6hhwyKyHrZg+nEqd5MlB2Bqqv7RryEtXYQbbl5Z78wP7eNocbfnWO3J9zedvgq rAqg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version:content-transfer-encoding; bh=nQxvLbIRAtDY8GzHL4h0gy2uVZ7cl3Hq4OWI0Pr35g4=; b=JTpvLWx132jdVb8zv5kIddLrezjoi/pnu9PQmq6hS+bUC5LPelqPR8zdeerE205HYB A2as0pgjyxIMHujiSfpyh9PoBUdRfNAlclr1tB6eHr3POUMJxvEXS7cHy3qvNqgy6h0b XQo6xpN4HcvbuLLsX1yur6lJAKdaBOyCTLcYUUMfchsM5NLJeq45RcqIdMCE+dakiJF0 M51GCp4HfA0rGabu9qGLUvqN1bR+MPgXUcSTRrXvr9M6wVdfuOIdiUXHeGW3fHdIxmRj BiEA0EB8ZPqMdc4f8HYV81sxZV4Bl8BrmI4Fh7+40zLfDTJ1hBzNC5DZnyZHFu/R+C3P poyw== X-Gm-Message-State: AOAM530dhf5IDDnEvaq6rA4rdKLIjMTfFyhIP9WKIGn0nUO4P6+Yur6j R8KY2Ogz2RwA1q/cBaeZAShICrbonFB1pHAc X-Google-Smtp-Source: ABdhPJwh7Xh+fGwKRqxifVxXOoUu+c6I8mlOxZGBKdzp3HfOdYZtMRecgqe4M/H7bpKBCmFM0pZmww== X-Received: by 2002:a17:90a:df0c:: with SMTP id gp12mr13017187pjb.95.1635430799253; Thu, 28 Oct 2021 07:19:59 -0700 (PDT) Original-Received: from raman-glaptop (c-24-4-174-65.hsd1.ca.comcast.net. [24.4.174.65]) by smtp.gmail.com with ESMTPSA id q9sm4128787pfj.88.2021.10.28.07.19.57 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 28 Oct 2021 07:19:58 -0700 (PDT) In-Reply-To: <875yth7bjr.fsf@gnus.org> (Lars Ingebrigtsen's message of "Thu, 28 Oct 2021 11:18:00 +0200") Received-SPF: pass client-ip=2607:f8b0:4864:20::1032; envelope-from=raman@google.com; helo=mail-pj1-x1032.google.com X-Spam_score_int: -175 X-Spam_score: -17.6 X-Spam_bar: ----------------- X-Spam_report: (-17.6 / 5.0 requ) BAYES_00=-1.9, DKIMWL_WL_MED=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, ENV_AND_HDR_SPF_MATCH=-0.5, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, USER_IN_DEF_DKIM_WL=-7.5, USER_IN_DEF_SPF_WL=-7.5 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:278110 Archived-At: Lars Ingebrigtsen writes: Nice! Perhaps we already get what I am about to say below from what you have, but given that there are 256 variation selectors, it might be useful to have a bigram/trigram model for chars that can meaningfully compose as an emoji? Once we had such a bigram/trigram model, the choices could be progressively filtered without creating a giant list of choices --- this would permit the user to focus in on the emojis that make sense as an emoji. > Lars Ingebrigtsen writes: > >> https://unicode.org/Public/emoji/14.0/emoji-test.txt >> >> So I'll be rewriting that bit tomorrow. That file also seems to allow >> me to easy make the derivation groups without parsing all the other >> files, so it should be faster, too. So... better all around, I think. > > Yup. This is now implemented, and it's way less hacky, and I seem to be > able to get all the derivations I'd expect (and I don't see any false > positives). --=20 Thanks, --Raman(I Search, I Find, I Misplace, I Research) =817=A94 Id: kg:/m/0285kf1 =950=DC8