From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Richard Stallman Newsgroups: gmane.emacs.devel Subject: Re: On language-dependent defaults for character-folding Date: Mon, 22 Feb 2016 12:59:03 -0500 Message-ID: References: <83pow26svf.fsf@gnu.org> <87a8n5srbp.fsf@wanadoo.es> <83d1s17npz.fsf@gnu.org> <87oablfpn3.fsf@mail.linkov.net> <834mdd6llx.fsf@gnu.org> <7fbb8bc7-9a97-4bad-a103-a6690a35241d@default> <834mdc5w6o.fsf@gnu.org> <838u2hu6aq.fsf@gnu.org> <871t899tde.fsf@gnus.org> <83y4ahru04.fsf@gnu.org> <83fuwproyf.fsf@gnu.org> <837fi0sz29.fsf@gnu.org> <83egc8qzjh.fsf@gnu.org> <87egc7evu3.fsf@gnus.org> <83io1jpt4u.fsf@gnu.org> <87povqhj25.fsf@gnus.org> <83povqm3dw.fsf@gnu.org> <83io1im0zq.fsf@gnu.org> Reply-To: rms@gnu.org NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1456163991 26037 80.91.229.3 (22 Feb 2016 17:59:51 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 22 Feb 2016 17:59:51 +0000 (UTC) Cc: larsi@gnus.org, lokedhs@gmail.com, emacs-devel@gnu.org To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Feb 22 18:59:43 2016 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1aXulk-0002MM-2Z for ged-emacs-devel@m.gmane.org; Mon, 22 Feb 2016 18:59:36 +0100 Original-Received: from localhost ([::1]:50949 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aXulj-0005Vw-Mh for ged-emacs-devel@m.gmane.org; Mon, 22 Feb 2016 12:59:35 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:39859) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aXulR-0005TZ-W9 for emacs-devel@gnu.org; Mon, 22 Feb 2016 12:59:18 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1aXulR-0006qT-2C for emacs-devel@gnu.org; Mon, 22 Feb 2016 12:59:17 -0500 Original-Received: from fencepost.gnu.org ([2001:4830:134:3::e]:41423) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aXulE-0006a4-IV; Mon, 22 Feb 2016 12:59:04 -0500 Original-Received: from rms by fencepost.gnu.org with local (Exim 4.82) (envelope-from ) id 1aXulD-0008MK-Ki; Mon, 22 Feb 2016 12:59:03 -0500 In-reply-to: <83io1im0zq.fsf@gnu.org> (message from Eli Zaretskii on Sun, 21 Feb 2016 19:23:05 +0200) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:200472 Archived-At: [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > I don't think this is correct. I think ö is a letter on its own in > any language that uses it. Which is why I don't see how it is > different from ø. Users seem to disagree on whether to fold diacritics that make different letters (ñ, ç, polish l with slash) or only those that modify a single letter (as á, à, â in French). I think that we should have a user option which controls this and only this. That means we should have two levels of folding group definitions: the smaller groups which hold variants of the same letter, and the bigger groups which hold similar letters. These groups need to depend on the language setting. In English (and in French), ö is a modified o. In Swedish (and German, I think), ö and o are different letters. I think that each folding group should specify one character that is the base. This is because users also seem to disagree on what it should mean to specify a non-base letter in the search string. Some plausible meanings are * Find that one and only that one. * Treat it the same as specifying the base letter. There should be a user option to choose between those two (and maybe some other behaviors for a non-base letter in the search string). -- Dr Richard Stallman President, Free Software Foundation (gnu.org, fsf.org) Internet Hall-of-Famer (internethalloffame.org) Skype: No way! See stallman.org/skype.html.