From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Gregory Heytings Newsgroups: gmane.emacs.devel Subject: Re: Unicode confusables and reordering characters considered harmful, a simple solution Date: Thu, 04 Nov 2021 09:14:42 +0000 Message-ID: <7699dbfaffc44df293f3@heytings.org> References: <875ytag0hb.fsf@yahoo.com> <87zgqmd5np.fsf@mat.ucm.es> <83wnlqk3rn.fsf@gnu.org> <72dd5c2a-42c7-b12e-05ed-e93adbd89727@gmail.com> <83ilxajyhw.fsf@gnu.org> <83fssejxf8.fsf@gnu.org> <835ytajsv2.fsf@gnu.org> <831r3yjqo9.fsf@gnu.org> <83v91aibe7.fsf@gnu.org> <87o872s0wf.fsf_-_@db48x.net> <83lf25gm1j.fsf@gnu.org> <83ee7xgio2.fsf@gnu.org> <87fssdrp54.fsf@db48x.net> <831r3xgfz3.fsf@gnu.org> <87v918qx37.fsf@db48x.net> <83o870fjqg.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="37699"; mail-complaints-to="usenet@ciao.gmane.io" Cc: cpitclaudel@gmail.com, stefan@marxist.se, emacs-devel@gnu.org, Daniel Brooks , monnier@iro.umontreal.ca, yuri.v.khan@gmail.com To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Thu Nov 04 10:15:26 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1miYqE-0009bO-7T for ged-emacs-devel@m.gmane-mx.org; Thu, 04 Nov 2021 10:15:26 +0100 Original-Received: from localhost ([::1]:50424 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1miYqC-0006dq-Ml for ged-emacs-devel@m.gmane-mx.org; Thu, 04 Nov 2021 05:15:24 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:59842) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1miYpf-0005wf-B1 for emacs-devel@gnu.org; Thu, 04 Nov 2021 05:14:51 -0400 Original-Received: from heytings.org ([95.142.160.155]:52460) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1miYpd-0002gS-Gb; Thu, 04 Nov 2021 05:14:50 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=heytings.org; s=20210101; t=1636017283; bh=zH336h+IIoNNwrY3g1uuK8BTY3KGx1PfEVLW7i6BhsI=; h=Date:From:To:cc:Subject:In-Reply-To:Message-ID:References:From; b=NoSM+XENZL0TQpiS+mqOOjMd2kFMExyFcM0t/Fv/0IAE+MS/ral0FKQIiI/6pjv5u mwueA/J585ht54pn2EgUevA7kEz771McFllQiQ5F0PrRaKhrEARJeDrzYgXHJzTFXj 8AHha5jiHBqsV9y/A0T8AvXQfpYerUekPliuS+ReK4z0/aH7IbGkC2W5aHi13Y3Ymr jquGcvGxo3gvKAL9IBTLjf82rdJr3CyAz8kqU252YRVS5XaECeu+LCJS0FVVkKDnxK muA8TtNQJ4SV7Ea2s+hVMwVlv/x+8KO8s0q0ymxKtJ6iYAZxL4FKb1nJGfI2NIjJcq g1lxPp+bQ9R0w== In-Reply-To: <83o870fjqg.fsf@gnu.org> Received-SPF: pass client-ip=95.142.160.155; envelope-from=gregory@heytings.org; helo=heytings.org X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:278663 Archived-At: > > The mere presence of these characters is NOT the root cause. These > characters are legitimate and helpful when used as intended. See > TUTORIAL.he for a pertinent example. > But TUTORIAL.he is not a pertinent example, because it's not a file with source code. It's a pertinent example to show that these characters do have legitimate uses, which is obvious. If you could find an actual source code file in an actual project in which these characters are used with their intended purpose, it would be a pertinent example. Otherwise it is safe and reasonable to assume (as the Rust developers did) that the mere presence of these characters in source code files is a potential problem and must be flagged as such.