From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Juri Linkov Newsgroups: gmane.emacs.bugs Subject: bug#31796: 27.1; dired-do-find-regexp-and-replace fails to find multiline regexps Date: Wed, 16 Dec 2020 22:32:56 +0200 Organization: LINKOV.NET Message-ID: <873605mstj.fsf@mail.linkov.net> References: <10120030-8b8d-b702-add4-8f099f934ed5@chalmers.se> <831rgivl7l.fsf@gnu.org> <83lfequ30g.fsf@gnu.org> <83a6v6tss9.fsf@gnu.org> <08c0bbce-051e-7a49-106a-d6d0629b2224@yandex.ru> <87blffns95.fsf@mail.linkov.net> <8c124412-3bb3-fd92-4c3b-da4b3a8bdcac@yandex.ru> <87blfec4l3.fsf@mail.linkov.net> <00d1c8ef-5601-6445-199e-1590ddfae9e9@yandex.ru> <87eek2902v.fsf@mail.linkov.net> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="39910"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (x86_64-pc-linux-gnu) Cc: abela@chalmers.se, 31796@debbugs.gnu.org To: Dmitry Gutov Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Wed Dec 16 22:08:22 2020 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1kpe22-000AIW-4A for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 16 Dec 2020 22:08:22 +0100 Original-Received: from localhost ([::1]:34416 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kpe21-0004EW-1h for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 16 Dec 2020 16:08:21 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:46274) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kpdzm-00022F-Lp for bug-gnu-emacs@gnu.org; Wed, 16 Dec 2020 16:06:03 -0500 Original-Received: from debbugs.gnu.org ([209.51.188.43]:50707) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1kpdzm-0006Vv-EY for bug-gnu-emacs@gnu.org; Wed, 16 Dec 2020 16:06:02 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1kpdzm-00049D-AD for bug-gnu-emacs@gnu.org; Wed, 16 Dec 2020 16:06:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Juri Linkov Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Wed, 16 Dec 2020 21:06:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 31796 X-GNU-PR-Package: emacs Original-Received: via spool by 31796-submit@debbugs.gnu.org id=B31796.160815271015833 (code B ref 31796); Wed, 16 Dec 2020 21:06:02 +0000 Original-Received: (at 31796) by debbugs.gnu.org; 16 Dec 2020 21:05:10 +0000 Original-Received: from localhost ([127.0.0.1]:34009 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kpdyw-00047I-8l for submit@debbugs.gnu.org; Wed, 16 Dec 2020 16:05:10 -0500 Original-Received: from relay10.mail.gandi.net ([217.70.178.230]:43105) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kpdyu-00046m-I8 for 31796@debbugs.gnu.org; Wed, 16 Dec 2020 16:05:09 -0500 Original-Received: from mail.gandi.net (m91-129-99-98.cust.tele2.ee [91.129.99.98]) (Authenticated sender: juri@linkov.net) by relay10.mail.gandi.net (Postfix) with ESMTPSA id 58FC8240004; Wed, 16 Dec 2020 21:04:59 +0000 (UTC) In-Reply-To: (Dmitry Gutov's message of "Wed, 16 Dec 2020 05:00:33 +0200") X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:196224 Archived-At: >> Another backup plan is to use ripgrep. Its multiline handling with -U >> also allows to search words ignoring any whitespace, even newlines. >> This is like isearch-lax-whitespace using search-whitespace-regexp >> when it contains a newline, e.g. "[ \t\r\n]+". > > Right. It has a problem of its own, though: it still outputs a file name > per line, even when a match is spread across several lines (unlike > pcregrep). So we're left guessing where a given multiline match ends. > > Also, 'sort' doesn't seem to be able to treat both : and \0 as separators > at the same time. > > Here's a rough patch, for illustration. Thanks, now finally it's possible to search text ignoring whitespace between words, for example: Find regexp: file[ ]+names finds everything correctly, even though current implementation maybe not the most elegant. > It's kind of working, but I'm not loving it. What do you think about using the option `rg --json`? Emacs has the fast JSON parsing library now, so using JSON output would be more reliable.