From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Benjamin Riefenstahl Newsgroups: gmane.emacs.devel Subject: Re: Unicode confusables and reordering characters considered harmful, a simple solution Date: Sat, 06 Nov 2021 14:58:31 +0100 Message-ID: <87mtmhwflk.fsf@turtle-trading.net> References: <83v91aibe7.fsf@gnu.org> <87o872s0wf.fsf_-_@db48x.net> <83lf25gm1j.fsf@gnu.org> <83ee7xgio2.fsf@gnu.org> <87fssdrp54.fsf@db48x.net> <831r3xgfz3.fsf@gnu.org> <87v918qx37.fsf@db48x.net> <83o870fjqg.fsf@gnu.org> <87k0hnqr1v.fsf@db48x.net> <83ee7vdped.fsf@gnu.org> <83a6ijdnzv.fsf@gnu.org> <834k8qer8j.fsf@gnu.org> <831r3uelbn.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="26879"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Sat Nov 06 14:59:13 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mjMDx-0006nQ-Ix for ged-emacs-devel@m.gmane-mx.org; Sat, 06 Nov 2021 14:59:13 +0100 Original-Received: from localhost ([::1]:39308 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mjMDw-0003Iu-Jt for ged-emacs-devel@m.gmane-mx.org; Sat, 06 Nov 2021 09:59:12 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:57696) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mjMDN-0000FL-PV for emacs-devel@gnu.org; Sat, 06 Nov 2021 09:58:37 -0400 Original-Received: from odoacer.turtle-trading.net ([93.241.193.16]:39632) by eggs.gnu.org with esmtps (TLS1.2:DHE_RSA_AES_128_CBC_SHA1:128) (Exim 4.90_1) (envelope-from ) id 1mjMDL-0006sX-Lr for emacs-devel@gnu.org; Sat, 06 Nov 2021 09:58:37 -0400 Original-Received: from zenobia.turtle-trading.net ([192.168.2.111]) by odoacer.turtle-trading.net with esmtp (Exim 4.80) (envelope-from ) id 1mjMDI-0002Cd-10; Sat, 06 Nov 2021 14:58:32 +0100 Original-Received: from benny by zenobia.turtle-trading.net with local (Exim 4.94.2) (envelope-from ) id 1mjMDH-000cZj-PJ; Sat, 06 Nov 2021 14:58:31 +0100 In-Reply-To: <831r3uelbn.fsf@gnu.org> (Eli Zaretskii's message of "Fri, 05 Nov 2021 16:19:56 +0200") Received-SPF: none client-ip=93.241.193.16; envelope-from=benny@turtle-trading.net; helo=odoacer.turtle-trading.net X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_NONE=0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:278872 Archived-At: Eli Zaretskii writes: > The Unicode Bidirectional Algorithm (UBA) mandates > (https://unicode.org/reports/tr9/#X8): > > X8. All explicit directional embeddings, overrides and isolates are > completely terminated at the end of each paragraph. > > [...] > > So when the UBA says "at the end of each paragraph", it means in > practice at EOL, since all the other paragraph separators are rarely > if ever used in human-readable text. (And Emacs, of course, > implements that rule.) Should the end of a comment or string in source code then also qualify as the end of a paragraph in this sense?