From mboxrd@z Thu Jan 1 00:00:00 1970 From: Robert Klein Subject: Re: Recent regression in Org master with parsing of verbatim code with spaces Date: Wed, 11 Oct 2017 10:12:23 +0200 Message-ID: <20171011101223.25017f72@lt70.mpip-mainz.mpg.de> References: Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:52763) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1e2C88-0005ii-TR for emacs-orgmode@gnu.org; Wed, 11 Oct 2017 04:12:42 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1e2C83-0003q4-QV for emacs-orgmode@gnu.org; Wed, 11 Oct 2017 04:12:40 -0400 Received: from mout.kundenserver.de ([212.227.126.135]:55715) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1e2C83-0003p3-GY for emacs-orgmode@gnu.org; Wed, 11 Oct 2017 04:12:35 -0400 In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: "Emacs-orgmode" To: Reuben Thomas Cc: emacs-org list , Kaushal Modi Hi, On Wed, 11 Oct 2017 00:47:04 +0100 Reuben Thomas wrote: > On 10 October 2017 at 18:29, Kaushal Modi > wrote: >=20 > > On Tue, Oct 10, 2017 at 1:08 PM Kaushal Modi > > wrote: > > =20 > >> I scanned through recent commits, but haven't yet figured out which > >> commit affected this. Before I end up with git bisect, does anyone > >> know a master branch commit that could have caused this? > >> =20 > > > > Hello, > > > > I found the commit causing this change: > > > > http://orgmode.org/cgit.cgi/org-mode.git/commit/?id=3D > > 8383f7e08b964e129b175ef1c0297b1f4a8ed235 > > =20 >=20 > =E2=80=8BI haven't seen the rest of this thread, but at a guess is the > problem to do with [:space:] matching vertical space such as newline > and vertical tab? Fixing that is a bit painful, since as far as I > know there's no regex for "horizontal space"; I guess it would be a > matter of enumerating a suitable range of code points.=E2=80=8B I'm fiddling with org-emphasis-regexp-components in my .emacs. I put all horizontal whitespace characters (Wikipedia: "whitespace character" with Unicode character property WSpace=3D"y") in a string and concat org-emphasis-regexp-components. Note, I include some extra characters for pre, post, and borders (typographical double quotes). (setq horizontal-whitespace (string (cdr (assoc-string "CHARACTER TABULATION" (ucs-names))) (cdr (assoc-string "SPACE" (ucs-names))) (cdr (assoc-string "NO-BREAK SPACE" (ucs-names))) (cdr (assoc-string "OGHAM SPACE MARK" (ucs-names))) (cdr (assoc-string "EN QUAD" (ucs-names))) (cdr (assoc-string "EM QUAD" (ucs-names))) (cdr (assoc-string "EN SPACE" (ucs-names))) (cdr (assoc-string "EM SPACE" (ucs-names))) (cdr (assoc-string "THREE-PER-EM SPACE" (ucs-names))) (cdr (assoc-string "FOUR-PER-EM SPACE" (ucs-names))) (cdr (assoc-string "SIX-PER-EM SPACE" (ucs-names))) (cdr (assoc-string "FIGURE SPACE" (ucs-names))) (cdr (assoc-string "PUNCTUATION SPACE" (ucs-names))) (cdr (assoc-string "THIN SPACE" (ucs-names))) (cdr (assoc-string "HAIR SPACE" (ucs-names))) (cdr (assoc-string "NARROW NO-BREAK SPACE" (ucs-names))) (cdr (assoc-string "MEDIUM MATHEMATICAL SPACE" (ucs-names))) (cdr (assoc-string "IDEOGRAPHIC SPACE" (ucs-names))))) (setq org-emphasis-regexp-components `(,(concat horizontal-whitespace "('\"{=E2=80=9E=E2=80=9C") ,(concat "-" horizontal-whitespace ".,:!?;'\")}\\[=E2=80=9C=E2=80=9D") ,(concat horizontal-whitespace "[:space:],\"'=E2=80=9E=E2=80=9C=E2=80=9D") "." 10)) Best regards Robert