From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Yuri Khan Newsgroups: gmane.emacs.devel Subject: Re: Character folding in the pretest Date: Wed, 3 Feb 2016 22:52:54 +0600 Message-ID: References: <56B1B3A1.2050605@cs.ucla.edu> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1454518414 31286 80.91.229.3 (3 Feb 2016 16:53:34 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 3 Feb 2016 16:53:34 +0000 (UTC) Cc: Paul Eggert , Emacs developers To: Filipp Gunbin Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Feb 03 17:53:33 2016 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1aR0gP-0006Do-BT for ged-emacs-devel@m.gmane.org; Wed, 03 Feb 2016 17:53:33 +0100 Original-Received: from localhost ([::1]:36236 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aR0gO-0001dm-DX for ged-emacs-devel@m.gmane.org; Wed, 03 Feb 2016 11:53:32 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:58469) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aR0g8-0001dJ-4L for emacs-devel@gnu.org; Wed, 03 Feb 2016 11:53:17 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1aR0g7-0004TM-9W for emacs-devel@gnu.org; Wed, 03 Feb 2016 11:53:16 -0500 Original-Received: from mail-lf0-x22b.google.com ([2a00:1450:4010:c07::22b]:36816) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aR0g7-0004Sj-0e for emacs-devel@gnu.org; Wed, 03 Feb 2016 11:53:15 -0500 Original-Received: by mail-lf0-x22b.google.com with SMTP id 78so18063354lfy.3 for ; Wed, 03 Feb 2016 08:53:14 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:cc:content-type:content-transfer-encoding; bh=EcPNAJAj6P4vLpZ838KZPqHlBua3DGyKFLFtij1rQ48=; b=mjZP7+CXBJlTgjvdzCBmo0lvWtBzldAcpefoPARsOWZpdAidvBrMzNqLkimAm+Ad0Z OiA44WyF7IjzW8uwf/dzeDuEXs00o75fF4PLYwHJ1JPS013Bcm0AEPJWNDMdn5gV40Nf gK4qOAGeEFJLkoxmijB9EO/OXHN0yx0CKXeyHPmt5sd7akjUGCZ6uKpg8ahM5RYAkHD9 iOfYQBV3SuJ7tper2Cao3Kber4MTL50h9xCufHc0OyGOHQAnSmOlN7/SmQy2dIr8NqSC x9KyKXvHIVF6H2V23MQbSKWN2N0fE+6pGDEIDMS2YnvRmagXnhfQlXj215LTIJlpj2vO szZw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:sender:in-reply-to:references:from :date:message-id:subject:to:cc:content-type :content-transfer-encoding; bh=EcPNAJAj6P4vLpZ838KZPqHlBua3DGyKFLFtij1rQ48=; b=cVXnScdAy63CUKxkl5PLYE3hMsiOLtTjXLP7+I776IZMiH35JzoEUevcqlutGU3t+K fHaOeL/5AbuSCmoA9voQ6BaPWEbN9jdskArDnGy8GjDow0XQpVYpfTjTZEm9VXoN/Zvk umwyp/Mj33/IWWal+FXrOiPIkRnBOb2oJXoN4BZSdWwVh3P7poKIjq6PvVbiOM2f5UGn HZJqFIAJM1LniX5hP1Ojfu67cdtrpMaYJoCLJn7Ci+I+bwhxuIHRNBwa3j30IYOzxQGV UT7YoKmhD31R4nJ3LnkbCK898HV4nwVeBOYT2apINj2E1qJC/NLMS+UzHSHyr0X1O19m QOBA== X-Gm-Message-State: AG10YOQ0kr4P1u+5+Kf0Td8RHfg0mg3W9DpYf5TIZfTZxvwj9lL/DfUOXweV+PXXlMxxhQ2gtAl92ZH/PxiqlQ== X-Received: by 10.25.168.15 with SMTP id r15mr1110495lfe.166.1454518394167; Wed, 03 Feb 2016 08:53:14 -0800 (PST) Original-Received: by 10.112.7.100 with HTTP; Wed, 3 Feb 2016 08:52:54 -0800 (PST) In-Reply-To: X-Google-Sender-Auth: glx6NB4XPBT3SgRCDwbhDeB-Of0 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2a00:1450:4010:c07::22b X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:199249 Archived-At: On Wed, Feb 3, 2016 at 9:57 PM, Filipp Gunbin wrote: >> =D0=95 and =D0=81, on the other hand, are a holywar-inducing contention >> point. > > They have their own places in the Russian alphabet. I think > char-folding should fold only "modified" letter variants into > "canonical" form (without any modifications). > > =D0=95 and =D0=81 are just separate letters, although we don't use =D0=81= much... Oh, we use it all the time. It=E2=80=99s just that many people habitually write =D0=95 in place of =D0=81. And this is exactly the reason why char folding becomes relevant for this particular pair. When searching in a text by someone other, I will want to fold so that I find occurrences where I would write =D0=81 but other would replace it with =D0=95. Likewise, those other people, when reading my text, will want to fold in order to find occurrences where they would write =D0=95 but I would write =D0=81. > Once I "fixed" all our text resources files at work and a colleague of > mine commented in review that =D0=81 is used only in childrens books. I = had > to revert the change. In this situation, you will want to not fold, so that you can search for all instances of =D0=81 and decide which to replace with =D0=95. (Even = when the policy is to avoid =D0=81, it is still mandatory in cases of ambiguity.)