From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Drew Adams Newsgroups: gmane.emacs.devel Subject: RE: Change of Lisp syntax for "fancy" quotes in Emacs 27? Date: Sat, 3 Feb 2018 17:55:58 -0800 (PST) Message-ID: <17a728ea-a83a-45db-8c80-9d35ff7b1b02@default> References: <83o9l6bhfs.fsf@gnu.org> <1fedc60d-35a7-4ff0-adbb-b6b8306d192f@default> <83wozu9f6r.fsf@gnu.org> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: blaine.gmane.org 1517709287 2759 195.159.176.226 (4 Feb 2018 01:54:47 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Sun, 4 Feb 2018 01:54:47 +0000 (UTC) Cc: emacs-devel@gnu.org, npostavs@users.sourceforge.net To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sun Feb 04 02:54:42 2018 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ei9Vh-0007ob-4r for ged-emacs-devel@m.gmane.org; Sun, 04 Feb 2018 02:54:25 +0100 Original-Received: from localhost ([::1]:46132 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ei9Xh-0000ag-S1 for ged-emacs-devel@m.gmane.org; Sat, 03 Feb 2018 20:56:29 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:57709) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ei9XU-0000aD-4t for emacs-devel@gnu.org; Sat, 03 Feb 2018 20:56:18 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ei9XS-0006Dy-AB for emacs-devel@gnu.org; Sat, 03 Feb 2018 20:56:16 -0500 Original-Received: from userp2120.oracle.com ([156.151.31.85]:56864) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1ei9XM-0006BY-LE; Sat, 03 Feb 2018 20:56:08 -0500 Original-Received: from pps.filterd (userp2120.oracle.com [127.0.0.1]) by userp2120.oracle.com (8.16.0.22/8.16.0.22) with SMTP id w141q97P095670; Sun, 4 Feb 2018 01:56:03 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.com; h=mime-version : message-id : date : from : sender : to : cc : subject : references : in-reply-to : content-type : content-transfer-encoding; s=corp-2017-10-26; bh=gt/oSTpz+P9Jzghw23QY7/wYDBKL4Ok001U/l1q1iDk=; b=TlVD8q7mqTlAuS7C49KM889/7UAZLp5BS8ceHXQkxzMxOyMK7+JxEGmtnxsFDHFiv1Gt IC3u0NQIOgXo4xn2KUCb3KmARCxvveysWYkOkY0dcuCAfXfNFlgPsftCUQ4dnUupp2fg 4Np3LClaR3KIpwWgebZLKDbKxMz5iZsWhP6uUSReSYMvGXxTB/fmS/nwNetY9UzMEbVH Y2fokAmjjoGeS3wwfYANak5kkEYwHk6gaC59pIWjGUebEZzm/Jkkwl6CFgcfOUHWAgfH 83vw5ylKtLDTd23v1hdK6965SLPhXBk6PwmKeyPeKJsvWAjB1ieTzmW/Iix3KvWm93c+ cQ== Original-Received: from userv0021.oracle.com (userv0021.oracle.com [156.151.31.71]) by userp2120.oracle.com with ESMTP id 2fw5qx9kb6-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Sun, 04 Feb 2018 01:56:03 +0000 Original-Received: from userv0121.oracle.com (userv0121.oracle.com [156.151.31.72]) by userv0021.oracle.com (8.14.4/8.14.4) with ESMTP id w141u27c007934 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=FAIL); Sun, 4 Feb 2018 01:56:03 GMT Original-Received: from abhmp0001.oracle.com (abhmp0001.oracle.com [141.146.116.7]) by userv0121.oracle.com (8.14.4/8.13.8) with ESMTP id w141u0Ne027648; Sun, 4 Feb 2018 01:56:02 GMT In-Reply-To: <83wozu9f6r.fsf@gnu.org> X-Priority: 3 X-Mailer: Oracle Beehive Extensions for Outlook 2.0.1.9.1 (1003210) [OL 16.0.4639.0 (x86)] X-Proofpoint-Virus-Version: vendor=nai engine=5900 definitions=8794 signatures=668662 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 mlxscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1711220000 definitions=main-1802040023 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x [generic] [fuzzy] X-Received-From: 156.151.31.85 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.org gmane.emacs.devel:222479 Archived-At: > Those bug reports complained about obscure error messages that are > unhelpful when a Lisp programmer tries to figure out the root cause. > I'm saying that we should find an alternative way of making clear, > helpful error messages in those special cases where characters which > display similarly might make the error message confusing if it just > cites the symbol's name. OK. Except I would say warnings, not error messages, at least in most cases. But even if we have an error message, that's not a call to change the syntax of Lisp. User errors happen. We should just want to help users avoid making such errors. > For example, suppose you have a Lisp program that produces > the following error message when compiled/executed: >=20 > Symbol's value as variable is void: '=D0=B0bbrevs-changed >=20 > You then type "C-h v abbrevs-changed RET" and get the expected result, > meaning that the variable is known to Emacs. How quickly will you be > able to spot the cause of the error message? Some people will wonder for a while. Others, perhaps already bitten by this gotcha, will notice the quote mark there right away. One thing that would help, I think, and which should be done in general, would be to put the offending thingie between `...': Symbol's value as variable is void: `'=D0=B0bbrevs-changed' That makes it more obvious that the symbol name includes that fancy quote char. Still, all of this is pilot error, where "pilot" can include the user who wrote the code but more likely means a user who copy+pasted it. > The change that got reverted from the emacs-26 branch was about a > similar case, but for a character that's much more important for Lisp > than 'a': it's about the character used to quote symbol names. But > the essence is the same: due to how characters are displayed, some > characters can be confused for others. >=20 > We want to find a way of identifying such situation and telling the > Lisp programmer about that in clear and easily understandable ways. > One way, perhaps too radical one, is to reject such "confusable" > characters outright. We could decide that we don't want such a > radical solution, but that doesn't mean we should give up on the > attempt to find some other solution for the problem. Neither does it > mean we should proclaim people who installed the change as enemies of > the society. Agreed. As I've said, I'm in favor of providing friendly warnings/reminders that point out that such a character is present. I think that should be enough. There are lots of potential confusables, and lots of different use contexts. But if we start with just one or two such chars and one or two common and clear contexts where a warning might help, that would be good. We can always add more such warnings as cases come up (get reported or otherwise become obvious). It would be an overreaction, IMO, to jump to changing the existing Lisp syntax to raise errors when someone uses such a character in, say, a symbol name. We should not require such chars to be escaped in a symbol name. Such chars have no special meaning for Lisp (unlike `.', `,' `'', ``', `(', `)', `[', `]', `"', `<', `>', `#' `;', and perhaps some more). > > Bug #23425, on the other hand, is a gigantic > > stream-of-consciousness about anything and > > everything [...] > > [...] > > How is it helpful to throw all of #23425 into > > this Lisp syntax-change question, as if the > > present issue puts into question everything > > ever discussed about curly quotes? >=20 > I could turn the table and ask you how is it helpful > to dump on us all your random thoughts about this, > instead of simply saying you didn't understand the > relevance and asking for more explanations. Which I > just provided. Whoa! I don't see a connection between the current issue and the many things discussed in #23425. And I don't think I dumped any random thoughts on anyone. > I hope now the issue is clear enough. No idea what your point is there. If there is some part of bug #23425 that you think is relevant here, and you think it will be UNfixed by reverting the Lisp-syntax change made for bug #30217, please tell us what that part is. I don't see anything in #23425 that needs the change in Lisp syntax made for #30217. And I don't see that Lisp change being necessary to fix #30217 either. It wasn't requested by the bug filer, AFAIK. Same for the other bugs you mentioned. The filers just asked for warnings, AFAICT. > > And the original error message from bug #23425 > > is _more_ meaningful and helpful, not less, > > than the new one after the "fix". > > I think you are so eager to make your point that you are willing to > claim that black is white and vice versa. Any objective person would > agree that the new error message is more directly pointing to the root > cause, which is the syntax of specifying a quoted symbol name using a > "strange quote". If we are good in writing and indexing our ELisp > manual, then I'd expect to find there an index entry for "strange > quote", which will land me where this issue is explained. Case > closed. We can perhaps agree to disagree about that. But of course if you say the case is closed then it's closed. > Once again, I can agree that this measure might be too harsh, but I > would still like to see clear diagnostics of such typos, and like > Paul, I thing we should take our inspiration from the Unicode > Standard's notion of "confusables". I've agreed about that from the beginning. It can be helpful to warn users about possible confusion when they use confusables. And I agree that clear diagnostics are needed - that was one of my points. That's different from changing the syntax of Lisp. > Ideas and proposals for patches along those lines > are welcome. Ditto. > Ignoring the problem, or trying to convince us > that it doesn't exist, is not. I recognize the problems of confusable characters. Not all such possible confusions are equally likely, in practice. Recognizing contexts where something might well be a typo, and warrants a helpful reminder/warning, is what's needed - case by case. What's not needed, IMO (and probably the only place where I differ from you on this, even if you don't want to recognize it) is a change in Lisp syntax, making it a read error not to escape such a character. > > Either doing nothing or trying to warn about such > > gotchas is right. Changing Lisp syntax here is > > not right. >=20 > Doing nothing would be ignoring the problem. Yes. It's maybe not the best help for users, but it would be one way to handle those few reports of confusion. We get a lot more questions due to other confusions wrt Lisp than we do such questions due to confusing one char for another. I didn't, and don't, say that doing nothing is the best approach. I said it's one way to deal with such reports. Unlike changing Lisp syntax, it at least doesn't introduce new problems. > That changing Lisp syntax is not right is your > opinion: legitimate, but clearly not shared by at > least some. That's why we're having this discussion. I have yet to hear a reason why it is right to change Lisp syntax for this - why a simple warning is not sufficient and we need to also make Lisp raise an error. > > Lisp doesn't have a bug here. >=20 > That's a strawman, and you know it. We are talking > about diagnostics for bugs in Lisp programs. I have no objection to diagnostics. Add warnings for byte-compilation, loading, whatever. Make sure the warnings are clear. Say, for instance that a curly quote was used in sexp `...'. Don't just say that invalid syntax was read (somewhere). Clearly pointing out the confusable char in the possibly confused sexp should go a long way to making things clear. My objection is to making such chars be escaped to prevent Lisp from raising an error. I don't put `a=E2=80=99b' in the same class as, say, `a,b'. `,' is special in Lisp, and (setq a,b 42) should (and does) raise an error. `=E2=80=99' is not special in Lisp, and (setq a=E2=80=99b 42) should not raise an error (IMO). Likewise, (setq ,b 42) (yes) and (setq =E2=80=99b 42) (no). If you want to argue for this syntax change, why not address some of my arguments against it? Where will you draw the line, for instance? There are _lots_ of possible confusables. I'd say start with only the few that have actually been reported (is there only one reported?), trying to come up with reasonable warnings in particular contexts of use. That would be a good start. We might even have a user option that lists the confusables to check/warn for, with whatever default value people here think is best (it might be only `=E2=80=99', to start with - or both left and right curly quotes). Are you thinking instead (since both you and Paul mentioned the Unicode list of confusables) of starting with _all_ characters in that list? http://www.unicode.org/Public/security/8.0.0/confusables.txt I won't argue about which chars should be warned about, though I might be interested to see what contexts we warn for and what the messages will be. My objection is not about detecting this or that use of this or that character and warning/reminding users about it. My objection is to making Lisp require escaping of such characters. That's all. I think I've made that as clear as I possibly can. But you seem to want to paint my objection as being against helping users know about accidental use of confusables, e.g., `=E2=80=99' instead of `''. Why?