From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Robert Pluim Newsgroups: gmane.emacs.devel Subject: Re: ucs-normalize and diacritics Date: Wed, 25 Jul 2018 22:44:50 +0200 Message-ID: <87fu0759kt.fsf@gmail.com> References: <8736w88pnn.fsf@gmail.com> <83lga0v4ff.fsf@gnu.org> <87tvoo73s9.fsf@gmail.com> <83fu08ujln.fsf@gnu.org> <837eljv0v0.fsf@gnu.org> <87k1pj5b3w.fsf@gmail.com> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: blaine.gmane.org 1532551418 30748 195.159.176.226 (25 Jul 2018 20:43:38 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Wed, 25 Jul 2018 20:43:38 +0000 (UTC) Cc: emacs-devel@gnu.org To: Cesar Crusius Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Jul 25 22:43:34 2018 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1fiQdA-0007qB-C4 for ged-emacs-devel@m.gmane.org; Wed, 25 Jul 2018 22:43:32 +0200 Original-Received: from localhost ([::1]:55588 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fiQfG-0003DK-SO for ged-emacs-devel@m.gmane.org; Wed, 25 Jul 2018 16:45:42 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:36256) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fiQeX-0003D1-IB for emacs-devel@gnu.org; Wed, 25 Jul 2018 16:45:00 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1fiQeU-0006q6-GO for emacs-devel@gnu.org; Wed, 25 Jul 2018 16:44:57 -0400 Original-Received: from mail-wm0-x233.google.com ([2a00:1450:400c:c09::233]:34666) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1fiQeU-0006ob-9Q for emacs-devel@gnu.org; Wed, 25 Jul 2018 16:44:54 -0400 Original-Received: by mail-wm0-x233.google.com with SMTP id l2-v6so479123wme.1 for ; Wed, 25 Jul 2018 13:44:54 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:references:mail-followup-to:mail-copies-to :gmane-reply-to-list:date:in-reply-to:message-id:mime-version :content-transfer-encoding; bh=x+YJ9O2HWeuLMn61FQplA0O0Sat5nt1PqvlMlihvmg4=; b=svt3pyocE61hWSWQcsMouvdrEbd37YYQvTauB1NJaKX+Z5VQvCYalw/BKxqH6Fb5ZK Kf/ToG8ac58czDmgOGza6fuX9HcQRASciJcYUVPw2j95412xhL+iwQOqUZNr4FBwhlxR bG0sgiA/qQAAVEEn+Gl88tB8+LwERwf0GwprFpHIdHBgoObpnrT5NtKKerE5ufBmbCJv 2C7oZbS4fO/9A6YQTxqSyzvuo0VtsSevdsPsmGmNpR+jxNKUp/mv7O85s1k+zcHJPE2w WLYHnhTiJJP5/TvemHOwHN5s7QpMX0x+GC+Hls57P9mEe7hIHWD4lVwUYhEPT0K1+btl CtxA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:references:mail-followup-to :mail-copies-to:gmane-reply-to-list:date:in-reply-to:message-id :mime-version:content-transfer-encoding; bh=x+YJ9O2HWeuLMn61FQplA0O0Sat5nt1PqvlMlihvmg4=; b=i+pciqvPsLu3PsaXHhULYb+HVNhRwXM1TzcftfoYEqtQfQrHJ/m7I3jmX76grNDeGa /T/xySW3lZJ8Vwkbn4pd4wk6doJIAH3hfTFmJJUP9CwX8STVezeNOK+BIibtTurBBxaI uSCIkNoczisD+fHUozHNYduJSiX+8Rk6l9gHLOeosolwvQcHAuotxzvaCZDynEgf7NnT dClP1NZlaixDn6Z8bi/gQg7OBkLkLGWE95RB5+FJgUVM79COKo0DxI7DNHU5gYYzMdzq tq5X9bK84biMPi7aZ8gtzIf8PviSdAZcXe8guk5G3G6WG9KhgjNQIiy2RSyBz5P0TgQs TWbw== X-Gm-Message-State: AOUpUlHVkh0AZXQH+sB6IwKPGR0iKhu91FDLJLHllZIMHdZDV7ao1v9m wp4Iakg3ZgvWE42LavHklOyCrU59 X-Google-Smtp-Source: AAOMgpe6O68lT5gI+bvclSDYcboB7hlYhEH6Wa5GKuUUDpFhnS/Zx6vY+iZF8WWgcQE7w7GE9OyowA== X-Received: by 2002:a1c:c60a:: with SMTP id w10-v6mr5503756wmf.26.1532551492784; Wed, 25 Jul 2018 13:44:52 -0700 (PDT) Original-Received: from rpluim-ubuntu (vav06-1-78-207-202-134.fbx.proxad.net. [78.207.202.134]) by smtp.gmail.com with ESMTPSA id v130-v6sm5985425wme.13.2018.07.25.13.44.51 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 25 Jul 2018 13:44:51 -0700 (PDT) Mail-Followup-To: emacs-devel@gnu.org Mail-Copies-To: never Gmane-Reply-To-List: yes In-Reply-To: (Cesar Crusius's message of "Wed, 25 Jul 2018 13:21:54 -0700") X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 2a00:1450:400c:c09::233 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.org gmane.emacs.devel:227815 Archived-At: Cesar Crusius writes: >> How common is 3-character composition likely to be? (for that matter, >> how common is 2-character composition? I explicitly use input methods >> for this kind of stuff). I can envisage an algorithm that takes a >> combining character, then scans backwards to see if the font used for >> it will cover all previous characters, recursively. It does seem like >> a lot of effort for a small return. > > Recalling a recent discussion, they are unavoidable in polytonic > Greek, because Unicode does not provide the pre-combined > character. There's no other way to get an "rough breathing long alpha > with acute accent," =E1=BE=B1=CC=94=CC=81. (Which by the way Emacs handle= s nicely with the > font I use, Iosevka.) Even that is only a two character composition (unless I=CA=BCve misunderstood the what-cursor-position output), and it=CA=BCs rather specialized, and you know what you=CA=BCre doing :-) > Granted, not many people will use this, but for those who do, they > will be all over the place. I'm not sure I understand the comment. Current behaviour is what it is, I=CA=BCm not proposing anything that would make it worse. Robert