From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Yuri Khan Newsgroups: gmane.emacs.devel Subject: Re: ucs-normalize and diacritics Date: Thu, 26 Jul 2018 04:01:25 +0700 Message-ID: References: <8736w88pnn.fsf@gmail.com> <83lga0v4ff.fsf@gnu.org> <87tvoo73s9.fsf@gmail.com> <83fu08ujln.fsf@gnu.org> <837eljv0v0.fsf@gnu.org> <87k1pj5b3w.fsf@gmail.com> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Trace: blaine.gmane.org 1532552392 2824 195.159.176.226 (25 Jul 2018 20:59:52 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Wed, 25 Jul 2018 20:59:52 +0000 (UTC) Cc: Eli Zaretskii To: Emacs developers Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Jul 25 22:59:47 2018 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1fiQsr-0000Zu-Kx for ged-emacs-devel@m.gmane.org; Wed, 25 Jul 2018 22:59:45 +0200 Original-Received: from localhost ([::1]:55961 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fiQuy-0007fr-Hp for ged-emacs-devel@m.gmane.org; Wed, 25 Jul 2018 17:01:56 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:41254) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fiQup-0007fj-Ml for emacs-devel@gnu.org; Wed, 25 Jul 2018 17:01:50 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1fiQum-00013o-Or for emacs-devel@gnu.org; Wed, 25 Jul 2018 17:01:47 -0400 Original-Received: from mail-oi0-x22a.google.com ([2607:f8b0:4003:c06::22a]:46898) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1fiQum-000139-JM; Wed, 25 Jul 2018 17:01:44 -0400 Original-Received: by mail-oi0-x22a.google.com with SMTP id y207-v6so16365001oie.13; Wed, 25 Jul 2018 14:01:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=jHui1EiqOqxBHLo/RgnpWPr6jpESC9UFo4+AJ4JesKQ=; b=Sdus/xmd3m1J2ECqmwkwYjPSvum99z1dcKM2Kty88fIfBwElfO/x2kpXNPf4n66VVg YbdXHxZ22Iu8SA8Pfi8UcAV+wMLsf7EuI/LolZZ7tmS5PpMWgrB/yh0lSQjhe/NyL3Bx VUGGbKxLvIrnjgMZYWDzyJRN6DvMJXja/tIMrY/iRFgCGWldIAtCIMR/EAXGeSmIVsYN +rbKCx4CAn+6oc0W399UJpTrLPDCb+FFpRI4OfPeAyEzBGVu5Z9+nioZa+Mmqiwsx2Ph dv6ydW3CKr4b+eqvEEk+YzibmUkkcTNQIbF2fP05hDbgdWN9JG29ZHyn3SDAtQztyBsB bzGw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=jHui1EiqOqxBHLo/RgnpWPr6jpESC9UFo4+AJ4JesKQ=; b=q1hqoVPOT+6YvvXZWDYZl1KAZN+KnxJvZzdDg50rCA1w3NyNZni027ZL1V081B4NGt hhV/n0C9ciX0K6gimPrUsAsn3pipWUCcgjL1juY1eYmfE3T+ogCyv2EWRK2IGlO+Vj5P /lwd1x5zHoh0puAP+RKtGs9tjabwpl/7sYKjOjivps4/CeBnbb6kzoeOmP9ZNmUfkOuq yTbb5np/U8aW9e98oYc6Uv18plC7Lgi4Xm/XREs8tph6aWbF4/GB8YI5wzWr1IgcFTac hw3X7pRaWBedJSwrez2S52isGtNMTKbOzMUTCJshrlGpqNQDu0B/sYm2E5qTTzk2ekjn d9eg== X-Gm-Message-State: AOUpUlHK4Q3A17yyYY51DOO5l+6MxKpaa0Uo3ndtJvexhugKLQAg//DC xn+gpXL7OSfxxAqy9e5YCWvBy2pGv+x0NEDe/Nk8BbwG X-Google-Smtp-Source: AAOMgpdHvVTBq3LqrCGfC3W/UBttshlsWPljHD8R68f60t7xrBARxfZVZ2fwYhHhIdtNHC+lKQ/KF8RQIDQLVUU9VEE= X-Received: by 2002:aca:3c02:: with SMTP id j2-v6mr5357275oia.50.1532552499277; Wed, 25 Jul 2018 14:01:39 -0700 (PDT) In-Reply-To: <87k1pj5b3w.fsf@gmail.com> X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 2607:f8b0:4003:c06::22a X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.org gmane.emacs.devel:227816 Archived-At: On Thu, Jul 26, 2018 at 3:12 AM Robert Pluim wrote: > how common is 2-character composition? For Cyrillic letters and acute accent, there are no precomposed forms at all, so when you need to explicitly call out the stressed syllable, you=E2=80=99re going to use the composing acute accent. That need arises: (= 1) in educational texts on every word with more than one syllable, as training wheels of sort; (2) when you introduce an uncommon word or a proper name that the reader is not expected to know how to stress; (3) to disambiguate words that look the same but stressed differently. There are precomposed forms for =D1=91 (Cyrillic {capital|small} letter io =3D Cyrillic {capital|small} letter e + Combining diaeresis) and =D0=B9 (Cyrillic {capital|small} letter short i =3D Cyrillic {capital|small} letter i + Combining breve). Using composition for these would be considered highly unusual.