From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp2 ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms0.migadu.com with LMTPS id gEVZLK/lm2FlCAEAgWs5BA (envelope-from ) for ; Mon, 22 Nov 2021 19:47:11 +0100 Received: from aspmx1.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp2 with LMTPS id GDIBKK/lm2GcewAAB5/wlQ (envelope-from ) for ; Mon, 22 Nov 2021 18:47:11 +0000 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 50FED29658 for ; Mon, 22 Nov 2021 19:47:11 +0100 (CET) Received: from localhost ([::1]:42582 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mpELO-00020I-Br for larch@yhetil.org; Mon, 22 Nov 2021 13:47:10 -0500 Received: from eggs.gnu.org ([209.51.188.92]:55794) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mpEJ3-0001xP-NQ for emacs-orgmode@gnu.org; Mon, 22 Nov 2021 13:44:45 -0500 Received: from relay4-d.mail.gandi.net ([217.70.183.196]:48735) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mpEIx-000124-M1 for emacs-orgmode@gnu.org; Mon, 22 Nov 2021 13:44:45 -0500 Received: (Authenticated sender: admin@nicolasgoaziou.fr) by relay4-d.mail.gandi.net (Postfix) with ESMTPSA id B8106E0007; Mon, 22 Nov 2021 18:44:36 +0000 (UTC) From: Nicolas Goaziou To: Ihor Radchenko Subject: Re: [PATCH] Re: c47b535bb origin/main org-element: Remove dependency on =?utf-8?Q?=E2=80=98org-emphasis-regexp-components=E2=80=99?= References: <87o86mw86r.fsf@localhost> <87fsrxkahq.fsf@nicolasgoaziou.fr> <87fsrxa1j5.fsf@localhost> <878rxoa6lk.fsf@localhost> <87tug93b2a.fsf@localhost> <87y25l8wvs.fsf@nicolasgoaziou.fr> <87r1bd39ny.fsf@localhost> <8735nsv9qo.fsf@nicolasgoaziou.fr> <87mtm09xzf.fsf@localhost> <87zgq02ueq.fsf@nicolasgoaziou.fr> <87h7c89rqr.fsf@localhost> <874k86y997.fsf@nicolasgoaziou.fr> <87v90lzwkm.fsf@localhost> Date: Mon, 22 Nov 2021 19:44:35 +0100 In-Reply-To: <87v90lzwkm.fsf@localhost> (Ihor Radchenko's message of "Sun, 21 Nov 2021 17:28:57 +0800") Message-ID: <87mtlwt4h8.fsf@nicolasgoaziou.fr> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Received-SPF: pass client-ip=217.70.183.196; envelope-from=mail@nicolasgoaziou.fr; helo=relay4-d.mail.gandi.net X-Spam_score_int: -25 X-Spam_score: -2.6 X-Spam_bar: -- X-Spam_report: (-2.6 / 5.0 requ) BAYES_00=-1.9, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-orgmode@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: Max Nikulin , emacs-orgmode@gnu.org Errors-To: emacs-orgmode-bounces+larch=yhetil.org@gnu.org Sender: "Emacs-orgmode" X-Migadu-Flow: FLOW_IN ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1637606831; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:list-id:list-help: list-unsubscribe:list-subscribe:list-post; bh=qFg1D+w/xOk9mvL/VWUXUUGD5p8ZJMKvCaqQ6a8VGkQ=; b=UwFgbOQiUEPrFGvyuI8QTvaF0L8oQ0ilEvZTFvWhTNGEZ1A7+Mkf0zWDxqoDbPNn/dSTxd Jb0Xlo6ak/toINs/Oz9oeH1wWl+8VJkPIn1mJg5oG8vXgWCmwojJxpQ9nu1xKzyy4TZx8I J+ocQxTaAiknChR423LxOM8HhLPdrk7kjfRqHKLmVKGphHuginABosqcsyPh48YBq1CC51 8yWjZ+LjsdLl0FOU8eunq9Mi/Iggm1kDb4/hcvOVnDg+wb/LPvtsIOhEI9GovALvlIUfeD M/s8UfPa/uh4Rb4MxvXkFOjbJ+qe00dpok5dlDMrttrZ+5h5EZFm1DEmE3Emsg== ARC-Seal: i=1; s=key1; d=yhetil.org; t=1637606831; a=rsa-sha256; cv=none; b=hz59YYDeOA+iSnncICyBF0/2XbPl2CCzZefDcyjvCILIiR4nrRMlepbb3uGADkY9uPG3BD b4ib91Tq4npSWLrxdFzSCYSz3J2SSPIPItV5kRlqU+gkBb/VnLcViNJcm5RvGUDmTYh3Mo 9lWfKUI55H9U2lVr/PHpS8V+I5t4RQfxcfAFo2k/0yySC+EX1vrRpSMBJeQCO1LuQ48Mno 1mZlTQCs2uiGckeivceYbzTrS/+OfvpGn0KFbUVAfEiVxmlVe4LucEKcxPvUzrBtrIa61L zU29xepMYWAd6ckKLD57GqAT7C/F2P82HDMeJBUBqwnMe9UO8d8JATgwKSdZdg== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=none; dmarc=none; spf=pass (aspmx1.migadu.com: domain of emacs-orgmode-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=emacs-orgmode-bounces@gnu.org X-Migadu-Spam-Score: -2.98 Authentication-Results: aspmx1.migadu.com; dkim=none; dmarc=none; spf=pass (aspmx1.migadu.com: domain of emacs-orgmode-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=emacs-orgmode-bounces@gnu.org X-Migadu-Queue-Id: 50FED29658 X-Spam-Score: -2.98 X-Migadu-Scanner: scn1.migadu.com X-TUID: +KEraNcAS/9x Hello, Ihor Radchenko writes: > Commit messages are also important, especially years later. I updated > the commit message in the attached new version of the patch. Note I'm not saying commit messages are not important. I just won't spend energy on the wording there. >> Thinking about it a bit more, you might be right: we may slightly change >> the closing part of the emphasis regexp, e.g.: >> >> (seq >> (not space) >> (group ,mark) >> (or (any space ?- ?') >> (and (any ?. ?, ?\; ?: ?! ?? ?\" ?\) ?\} ?\\ ?\[) (or space line-= end)) >> line-end)) >> >> The logic behind this is that in regular text, we assume usual >> punctuation rules apply. > > This will fail for "*Bold*?!" or "/Italics/!!!" Of course. Any regexp will fail somehow. > Also, is there any reason why we are not simply using punctuation > character class instead of listing punctuation chars explicitly (and > only for English)? What about "_=E4=BD=A0=E5=8F=AB=E4=BB=80=E4=B9=88=E5= =90=8D=E5=AD=97_=EF=BC=9F" > > Maybe just > > (seq > (not space) > (group ,mark) > (0+ (in punctuation)) > (or space line-end)) Historically, Org only focused on ASCII. But it makes sense to extend the allowed punctuation characters, indeed. This is orthogonal to OP's issue, however. >> My concern is that the more complicated is the rule, the more difficult >> it is to predict. Also, we introduce new corner case, e.g., >> >> Woot! I just released Org *10*.0! >> >> So, I'm not totally convinced it is worth the trouble. > > I am not sure if "Org *10*.0" is a good general example. It is probably > one of those cases when users want fine control over emphasis and must > use zero width space. This is simply the first example that crossed my mind. My point is that changing the regexp substantially may not be rewarding, ultimately. > +Sometimes, when marked text also contains the marker character itself, > +the result may be unsettling. For example, > + > +#+begin_example > +/One may expect this whole sentence to be italicized, but the > +following ~user/?variable~ contains =3D/=3D character, which effectively > +stops emphasis there./ > +#+end_example > + > +You can use zero width space to help Org sorting out the ambiguity. > +See [[*Escape Character]] for more details. LGTM! Regards, --=20 Nicolas Goaziou