From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Robert Pluim Newsgroups: gmane.emacs.devel Subject: Re: ucs-normalize and diacritics Date: Thu, 26 Jul 2018 10:14:52 +0200 Message-ID: <87bmau5s77.fsf@gmail.com> References: <8736w88pnn.fsf@gmail.com> <83lga0v4ff.fsf@gnu.org> <87tvoo73s9.fsf@gmail.com> <83fu08ujln.fsf@gnu.org> <837eljv0v0.fsf@gnu.org> <87k1pj5b3w.fsf@gmail.com> <87fu0759kt.fsf@gmail.com> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha256; protocol="application/pgp-signature" X-Trace: blaine.gmane.org 1532592839 2419 195.159.176.226 (26 Jul 2018 08:13:59 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Thu, 26 Jul 2018 08:13:59 +0000 (UTC) Cc: emacs-devel@gnu.org To: Cesar Crusius Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Jul 26 10:13:55 2018 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1fibPG-0000WZ-RS for ged-emacs-devel@m.gmane.org; Thu, 26 Jul 2018 10:13:54 +0200 Original-Received: from localhost ([::1]:59939 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fibRN-000710-La for ged-emacs-devel@m.gmane.org; Thu, 26 Jul 2018 04:16:05 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:39273) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fibQM-0006yS-Km for emacs-devel@gnu.org; Thu, 26 Jul 2018 04:15:06 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1fibQJ-0003Y5-JU for emacs-devel@gnu.org; Thu, 26 Jul 2018 04:15:02 -0400 Original-Received: from mail-wm0-x235.google.com ([2a00:1450:400c:c09::235]:36545) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1fibQJ-0003Xf-B5 for emacs-devel@gnu.org; Thu, 26 Jul 2018 04:14:59 -0400 Original-Received: by mail-wm0-x235.google.com with SMTP id s14-v6so1058711wmc.1 for ; Thu, 26 Jul 2018 01:14:59 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:references:mail-followup-to:mail-copies-to :gmane-reply-to-list:date:in-reply-to:message-id:mime-version; bh=9oTx40f6Q447ISsJUZFG8b1UmU+tV0iGgM3lf5OFoJg=; b=awiLcXGVO3fIpRjQ6zWnLURXzCdzNh45liZ3cG1vE6GqAuzsVmUcDSkiKKRLtJcTbx xV61ZanPwzmiQFXzDVfUYMMmfBGas7MJVx9R0/FnIT4zhIu925aC/UQVANCEOdTpYzcz xPR0v1e7bTUZtnfKuHzFwJaujBet2QQWNse95OevkhPJqBY4cU25YIPSNcrR8L6+fFSw x60UU5fmAav5eWiz42+C/9upzoVB71goGjHK7Lhkz1zxpV9UvNj79gdnvaujE3HKir+N QA6PdXrQJdzbxttzL4qL4wRuIauZkg3w31Y2WqDzzdYLGE5tuWzbwETiTBzod1btCJ8h KccQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:references:mail-followup-to :mail-copies-to:gmane-reply-to-list:date:in-reply-to:message-id :mime-version; bh=9oTx40f6Q447ISsJUZFG8b1UmU+tV0iGgM3lf5OFoJg=; b=r4Wyi+lFJoosFLpUVl2iN4JLkLFWEOn/6Ghom8vl5gPO2uFCqT+H6jIMy1zFld4Xwv HF5ErKTvS328sGPDtSVEgauEgue3tzgS/g6r5eBFV9WdNOuti4ogXMmi5AQihCNd0td7 c6KODwX1UeRPux3lsEylAI1VG/m4vF25e0eVTu5y0WfkCu3wBRxLwfaHBDaCIJ8F2WBC ZLti5G6OV0OU0+J71mEGiVP/uzh5lxeSZJ9X2JVXhxK3zIwpDCvSH/v9TQGEqoSwpOsl qs1WyMvZpG9OpnLRYug0gTlDjSvoDB7cS2x9eHExZxhOdB2L7cFizB8+K8jKL6b2TPqE 3SCg== X-Gm-Message-State: AOUpUlErYLmtAEPtQlMg9PXjy6sS1i1+1ZZrBbkc9xcOoInos12PQbzj AUjkVqRpMR1ww6QyDuABmHiUEUk5 X-Google-Smtp-Source: AAOMgpeesT+soeSgwkA6QxBN6fuOdXmIu4M3kR4yt9T4a1+Lr8xcyL7WO7cIocZmNQkIrue5QvvWNw== X-Received: by 2002:a1c:5e08:: with SMTP id s8-v6mr927268wmb.88.1532592898013; Thu, 26 Jul 2018 01:14:58 -0700 (PDT) Original-Received: from rpluim-ubuntu ([149.5.228.1]) by smtp.gmail.com with ESMTPSA id x62-v6sm1051799wmg.1.2018.07.26.01.14.56 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 26 Jul 2018 01:14:56 -0700 (PDT) Mail-Followup-To: emacs-devel@gnu.org Mail-Copies-To: never Gmane-Reply-To-List: yes In-Reply-To: (Cesar Crusius's message of "Wed, 25 Jul 2018 15:59:30 -0700") X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 2a00:1450:400c:c09::235 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.org gmane.emacs.devel:227827 Archived-At: --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Cesar Crusius writes: > Robert Pluim writes: > >> Cesar Crusius writes: >> >>>> How common is 3-character composition likely to be? (for that matter, >>>> how common is 2-character composition? I explicitly use input methods >>>> for this kind of stuff). I can envisage an algorithm that takes a >>>> combining character, then scans backwards to see if the font used for >>>> it will cover all previous characters, recursively. It does seem like >>>> a lot of effort for a small return. >>> >>> Recalling a recent discussion, they are unavoidable in polytonic >>> Greek, because Unicode does not provide the pre-combined >>> character. There's no other way to get an "rough breathing long alpha >>> with acute accent," =E1=BE=B1=CC=94=CC=81. (Which by the way Emacs hand= les nicely with the >>> font I use, Iosevka.) >> >> Even that is only a two character composition (unless I=CA=BCve >> misunderstood the what-cursor-position output), and it=CA=BCs rather >> specialized, and you know what you=CA=BCre doing :-) > > There are three characters in total, =E1=BE=B1 plus =CC=94plus =CC=81. You=CA=BCre right, I=CA=BCd missed the final combining accent. Robert --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQIcBAEBCAAGBQJbWYL8AAoJED165MeRD2FAO5IQAIcIUi4EMI86Je3wsYEfcIbO MEucY4czRbB7RhjWmnPGpx6xEavPsyAu0sILw/5bCZXAQ1DfQ3EIaffVqojHDrPp h9c5iVck0bFEmMNc3uh9enOWyLqWKrvKU1CZmZ+MqVUGLIaAV2fggnrDwVNZNLQA 2B4UWM5TTHcIe5CK391oReIpVXurf2eBWWIkiXdJtdAUjiYW9Om9FQXGS4gsNV7x d6UdTe0TreVfeqFOUT1MzQbjSMbPwG6H8bsjNkbOZwtWdor0hlTQCpbfZ8pDr5SO oj2L5RbedEpEJ1mpI6S2R6j0KbmYK+xoto+W5hqJDVQgPhofCe91/RdGiNIiEPDH IS6lMV9/ImM9c4vBAEHHX0sD+fqWAN51SZ4lKMDAUqSBLONG7Nj8YqlK4ULWhaGy k2YAC6CWKOEPWpkk1x/w5NYrxNSGnUl3x0KfyjLm3BFS5j5Eaza9t4MNZJ8WnkWl 3ub6/9Klq5P9oRZ3ZLm1NY3cXLkmwZIpUUXxwC1kx/Z0mpzZaDlPSxdbDZ/T2PnK PKq7pJEwuHhJhCHDWuB4n45CvDbiObG18oj+I8KhaB0Y+hEBbGzTe592lPy8/HH1 1HuthaZeCl6mMnopdgcnVAZtmEWcLuo6/NOIVTLMaT6ZCvzGoElmbFmHkUM3Nym9 WIbMemdLyt0npGtgXnAu =27Am -----END PGP SIGNATURE----- --=-=-=--