From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Drew Adams Newsgroups: gmane.emacs.devel Subject: RE: Single quotes in Info Date: Thu, 29 Jan 2015 08:24:02 -0800 (PST) Message-ID: <6164d89d-23ac-46bf-9f84-154cc0e6c6e4@default> References: <87twzhgk84.fsf@wmi.amu.edu.pl> <83lhksshdm.fsf@gnu.org> <9ee0c895-a178-40e1-b1c8-ed2b97071c6b@default> <87h9vgglkz.fsf@wmi.amu.edu.pl> <83h9vcp0bq.fsf@gnu.org> <83y4onorcc.fsf@gnu.org> <83vbjrnd1f.fsf@gnu.org> <83386untcd.fsf@gnu.org> <83vbjpmv4w.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1422548672 28172 80.91.229.3 (29 Jan 2015 16:24:32 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 29 Jan 2015 16:24:32 +0000 (UTC) Cc: bruce.connor.am@gmail.com, emacs-devel@gnu.org To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Jan 29 17:24:31 2015 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YGrtN-00046H-Dg for ged-emacs-devel@m.gmane.org; Thu, 29 Jan 2015 17:24:29 +0100 Original-Received: from localhost ([::1]:60649 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YGrtM-0004GH-Tr for ged-emacs-devel@m.gmane.org; Thu, 29 Jan 2015 11:24:28 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:60424) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YGrtG-0004G9-M2 for emacs-devel@gnu.org; Thu, 29 Jan 2015 11:24:25 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YGrt8-0004FQ-NL for emacs-devel@gnu.org; Thu, 29 Jan 2015 11:24:22 -0500 Original-Received: from userp1040.oracle.com ([156.151.31.81]:43598) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YGrt2-0004Dv-1y; Thu, 29 Jan 2015 11:24:08 -0500 Original-Received: from ucsinet21.oracle.com (ucsinet21.oracle.com [156.151.31.93]) by userp1040.oracle.com (Sentrion-MTA-4.3.2/Sentrion-MTA-4.3.2) with ESMTP id t0TGO5Zm028692 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK); Thu, 29 Jan 2015 16:24:06 GMT Original-Received: from aserz7021.oracle.com (aserz7021.oracle.com [141.146.126.230]) by ucsinet21.oracle.com (8.14.4+Sun/8.14.4) with ESMTP id t0TGO3sw005637 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=FAIL); Thu, 29 Jan 2015 16:24:05 GMT Original-Received: from abhmp0018.oracle.com (abhmp0018.oracle.com [141.146.116.24]) by aserz7021.oracle.com (8.14.4+Sun/8.14.4) with ESMTP id t0TGO25F026033; Thu, 29 Jan 2015 16:24:02 GMT In-Reply-To: <83vbjpmv4w.fsf@gnu.org> X-Priority: 3 X-Mailer: Oracle Beehive Extensions for Outlook 2.0.1.8.2 (807160) [OL 12.0.6691.5000 (x86)] X-Source-IP: ucsinet21.oracle.com [156.151.31.93] X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.4.x-2.6.x [generic] X-Received-From: 156.151.31.81 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:181993 Archived-At: > > > I'm not at all sure we should compare a and =C3=A1 equal. It's an > > > additional feature anyway. > > > > I get the impression that you are talking only about a built-in > > (more or less hard-coded, predefined) set of equivalence classes > > of chars, whatever that set might be defined as. >=20 > We certainly should have predefined equivalence support based on the > Unicode Standard's recommendations. That is the state of the art > these days, and any respectable text editor should include such > support. >=20 > > Is that right, or would users be able to define the equivalence > > classes you are thinking of? >=20 > We should first provide users with a set of sensible optional > behaviors that they are likely to expect in various situations. Each > option will invoke a certain predefined behavior, such as whether or > not equivalence classes are at all considered, whether or not a and =C3= =A1 > compare equal, etc. There are important use cases for each one of > those, exactly like there important use cases for both case-sensitive > and case-insensitive search. >=20 > Once we have that in place, we can add user-defined additions. I > expect them to be relatively minor and mostly mode-specific, such as > the special treatment of quotes and other special characters in Info > buffers. Why minor? because Unicode already thought out and defined > almost any imaginable feature in this regard, so chances that some > user might need something in addition are small. >=20 > Mode-specific additions could be just alists that map characters or > strings to their equivalents. Since I don't expect those to become > large, there's no need for anything fancier, IMO. Glad to see all of those specific replies. It all sounds good to me, including the proposed development priorities. > > If they would not then a separate but desirable (IMO) feature > > would be for users to be able to easily define their own such > > equivalence classes. >=20 > I wouldn't call them equivalence classes. Users are not expected to > be experts in Unicode features, its various data tables, and their > implementation in Emacs. We should instead provide easy-to-customize > option variables to select out of an array of predefined features > based on Unicode tables we already have. User additions should be > some simple data structure that don't require any special expertise. I don't care what you call them. In the interest of brevity I also did not explicitly mention the possibility of associating multiple-char sequences with other such or with single chars (e.g., associating "=3D>" with =E2=87=92 or "ss" with =C3=9F, though those two would presumably be pr= edefined). To me, each set of such associations constitutes an equivalence class, but I don't care what nomenclature is used to describe it, as long as it is clear. My point was for users to eventually be able to specify their own such associations, in addition to those (e.g. Unicode) that would be predefined. And it would be good to be able to use these not only for search but also for easy replacement (in either direction of such an equivalence), etc. E.g., have easy access to such pairs via `M-%' - be able to input one of such a class (char or char sequence) and then pick from its defined equivalences for the replacement.