From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: =?UTF-8?Q?Per_Starb=C3=A4ck?= Newsgroups: gmane.emacs.devel Subject: Character folding in the pretest Date: Wed, 3 Feb 2016 01:31:11 +0100 Message-ID: NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1454459493 8636 80.91.229.3 (3 Feb 2016 00:31:33 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 3 Feb 2016 00:31:33 +0000 (UTC) To: "emacs-devel@gnu.org" Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Feb 03 01:31:28 2016 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1aQlLz-0002jF-Qv for ged-emacs-devel@m.gmane.org; Wed, 03 Feb 2016 01:31:28 +0100 Original-Received: from localhost ([::1]:60099 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aQlLz-0005SX-59 for ged-emacs-devel@m.gmane.org; Tue, 02 Feb 2016 19:31:27 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:49750) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aQlLl-0005SF-RU for emacs-devel@gnu.org; Tue, 02 Feb 2016 19:31:15 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1aQlLk-0002Qw-7x for emacs-devel@gnu.org; Tue, 02 Feb 2016 19:31:13 -0500 Original-Received: from mail-vk0-x22c.google.com ([2607:f8b0:400c:c05::22c]:34413) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aQlLk-0002Qr-2f for emacs-devel@gnu.org; Tue, 02 Feb 2016 19:31:12 -0500 Original-Received: by mail-vk0-x22c.google.com with SMTP id e185so3601863vkb.1 for ; Tue, 02 Feb 2016 16:31:11 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=bUyhPBV9oQwOZJ7MyvWsarM09iZBCXE/vj85J01rjcI=; b=fauvcIdiYyU4Zlw2HvPL35NrAeB5eOpi/EBgBwEupdrRuKs1jm0PIN3HyM045maM5a BPhBTBeqdwLHHXNRaOpiZiQScPYm4JgH9qdHQUBHdUoWTqHMx3ZBICSgwmriyhVl7FJ5 QepSllUsBy7Qh9BXsbjttJ07k53rnbjyAsUo5T+M2UrzyYNbf10HHIeb02LQSu8wDhAw Q465bX+coVKxZooQKQdFpQjKkpyOB61NiUci4Up7x9pAg5h7wSUecF0V4ZZWQNC+jLKT pR3rSV9DxdYKgv/g9Ks0JoDRjinIvaytb59nHVVlf1L7lyuyDODFGoz5COAa+mZhV31d rkUg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:sender:date:message-id:subject:from :to:content-type:content-transfer-encoding; bh=bUyhPBV9oQwOZJ7MyvWsarM09iZBCXE/vj85J01rjcI=; b=ChGC0gR+0E+13doITyWUGwwxMQC4K6ZFr6km1vn9U/HVACkfgZ+htGVVJPhS+C4ay8 pPsKpxFvV0IHW28tZX74jZ4CoiYPj9L5UETy/5/ePMx1yQgh83kuu1SaOs6SmtuxgkZS QaHziBsHvPT5yV7ziDSyRFMT9+riZ69XD7L23AgXx5rpBWxw0yReL5jb2bMBXZRGUrjo lsvrddiH9Xcr0ScgG3L+SCRG09wpw39Jq096XkHceqwUMV7GRL4lc0PbPr9xz9qbip3d xYKFhIb/rRnvdba6R5fLSCzwfpTLuqbZVFJLUt3M4WidBJ9W0RpK12akdABXAu0M7Hy1 oaAg== X-Gm-Message-State: AG10YOSF6bm/QLji1ITKGbQk5NLKsa8hTasM4p9bwGck2FiC3cW4uSFGkDrTPbGBdZ5rIT+1aLylU+CHMdIvuw== X-Received: by 10.31.178.134 with SMTP id b128mr19887839vkf.112.1454459471373; Tue, 02 Feb 2016 16:31:11 -0800 (PST) Original-Received: by 10.31.15.197 with HTTP; Tue, 2 Feb 2016 16:31:11 -0800 (PST) X-Google-Sender-Auth: gElNjOvmjYCDjCOg6Bx_gaXmgFc X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2607:f8b0:400c:c05::22c X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:199200 Archived-At: I brought up earlier that the new character fold feature that still hasn't been in any released version of Emacs shouldn't be turned on by default when it debuts. Now I've tested the first prerelease of Emacs 25, and seen that it is still turned on by default, so I'll revisit this and argue why this is important. Probably what I say here is all I have to say. =3D=3D=3D There was a lot of agreement =3D=3D=3D RMS wrote that there ought to be a poll about the default. Eli wrote that > Such a poll could only work if the behavior intended to become the > default is already available in released versions of Emacs, so users > could turn it on and try it. This is not the case with character > folding, which is only available in development snapshots, and > actually is still in flux: it changes in non-trivial ways almost every > day. > > If we are afraid users will hate this default, we can turn it off in > v25.1 and consider making it the default later. RMS commented: > That seems like the right approach. Artur Malabarba wrote: > I don't mind leaving this OFF by default in Emacs 25. So long as the > eventual goal is to have it ON by default (preferably in 26). Drew liked the feature and thought it should be turned off initially: > My expectation, if we turn it off by default, is that users will > try it, like it, and possibly ask for it to become the default > behavior. There is no reason to jump the gun on this. Eli thought that it should remain turned on in the pretest to get more testing: > The entire time interval between Nov 15 this year and until we release > Emacs 25.1 (which will take a few months, probably more than 6, > judging by past experience) is supposed to provide that feedback. All > it takes to turn this off by default is changing the default value of > a single variable (and change a couple of places in the User Manual to > reflect that). Once we decide to do that, it can be done very quickly > and easily. We can do that a day before the release, if we want to. > > OTOH, turning it off today means that it will get much less testing, > and therefore bugs related to it (like the one reported just today in > http://debbugs.gnu.org/cgi/bugreport.cgi?bug=3D22090) will most probably > remain hidden for who knows how long. It's time to make that decision now. =3D=3D=3D Why =3D=3D=3D Because this is a big change that have repercussions that haven't had all the major wrinkles ironed out yet. Some software throws big changes like that in the face of the users, and more or less force them to get the kinks out or find out how to turn it off, but that is usually not the Emacs way. Here usually the big kinks are already taken care of when something is introduced to users who haven't specifically asked for it. That's a good thing. Eli argued against me that Emacs sometimes does that, for example with bidi which he argued was a much bigger change. In some ways it was, but still, for people who don't use RTL languages all of that has been more of less invisible, and for those who do it was obviously better than without it. I know how the current character fold version is *just wrong* for Swedes and other Scandinavians when handling their native languages. There was a flurry of messages then which I couldn't keep up with, and where I thought most of it took up issues I already had answered anyway, but I'm getting back to this now. One answer was that problems for Scandinavians wasn't relevant, because I had to show that it was "_wrong_ in _most_ situations" to be relevant. I don't agree with that, but even if you do, I think my Scandinavian example is only an example, and that there probably are several similar in different locales. =3D=3D=3D What was that Swedish example now again? =3D=3D=3D A and =C3=84. In classical Latin U and V was the same letter. Not until Late Middle Ages were there these two forms and they weren't differentiated one as a consonant and one as a vowel until the 16th century. In spite their historical equivalness they are clearly different letters in for example English. Having a character fold feature where a search for U found V would be *just wrong*. Since everyone on this list knows English everyone knows that. What we get now for Swedish is very similar to that. Everyone who knows Swedish knows that. Here =C3=85=C3=84=C3=96 are separate letters in a= ll ways from A and O, in spite of their historic origin tying them together. That is just history. "=C3=84" has its own key on a keyboard, its own name and its own position in the alphabet. For a Swede to have a search for "varpa" in a Swedish text find "v=C3=A4rpa" or "varp=C3=A5" would be *just wrong*. It would give a strong impression of this being an American program not meant to be used for Swedish. Note that this is not me saying that we Swedes don't like character folding. It's a perfectly good feature to have a search for "entre" find "entr=C3=A9" or a search for "crepe" find "cr=C3=AApe" because "=C3=A9= " and "=C3=AA" are accented variants of "e". But "=C3=A4" in Swedish is in no way an accented letter. At this point several people usually reply "then just turn it off". But the point is that by having it work like this out of the box it sends a message to some new users that Emacs is not usable at all. If they instead have some problems with a feature they have explicitly turned on that's something else. Those who have turned it on know how to turn it off. Others don't necessarily know that. =3D=3D=3D Are there other examples? =3D=3D=3D I won't say something certain about a language I'm not a native speaker of, but I think there are similar situations. I suspect for example that Russian =D0=B8 and =D0=B9 is a similar pair, where it is *just wrong* that a search for "=D0=B8" (CYRILLIC SMALL LETTER I) also finds "=D0= =B9" (CYRILLIC SMALL LETTER SHORT I). All in all I see the need for a feature to adjust individual entries of the character folding before it ought to be turned on by default. =3D=3D=3D Are users expecting this? =3D=3D=3D Has Emacs been late implementing character folding? Is everyone expecting that now and it's important to turn it on to now seem to be out of the loop? It doesn't seem so. Eli wrote first that character folding was introduced in Emacs to give users "what the other text-editing and word-processing environments provide, what they therefore are expected to expect". I answered that for example Gedit and Firefox didn't have this feature, and then Eli wrote that I should "try more serious editing environments" like MS Word. Since then I have had opportunity to try MS Word 2013 and I couldn't find such a feature. Maybe there was such a feature I couldn't find. Maybe it had been turned off by the system administrators at my university. I don't know, but on a random web source, http://wordribbon.tips.net/T010627_Ignoring_Accented_Characters_in_Searches= .html I find it stated that MS Word (2007, 2010, and 2013) doesn't have such a feature. I don't think this is something users expect, but something that will be an example of how Emacs does things better for those who can turn it off for good results already and for the rest of us when it has become slightly more featureful. =3D=3D=3D Option menu =3D=3D=3D Also, please please add a checkbox for character folding just above or below the one for case folding in the Options menu!!