From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#31796: 27.1; dired-do-find-regexp-and-replace fails to find multiline regexps Date: Wed, 02 Dec 2020 19:39:48 +0200 Message-ID: <83zh2wf6or.fsf@gnu.org> References: <10120030-8b8d-b702-add4-8f099f934ed5@chalmers.se> <831rgivl7l.fsf@gnu.org> <83lfequ30g.fsf@gnu.org> <83a6v6tss9.fsf@gnu.org> <08c0bbce-051e-7a49-106a-d6d0629b2224@yandex.ru> <83k0u1il6c.fsf@gnu.org> <83blfcgst9.fsf@gnu.org> Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="18285"; mail-complaints-to="usenet@ciao.gmane.io" Cc: abela@chalmers.se, rms@gnu.org, 31796@debbugs.gnu.org To: Dmitry Gutov Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Wed Dec 02 18:44:13 2020 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1kkWAn-0004cC-40 for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 02 Dec 2020 18:44:13 +0100 Original-Received: from localhost ([::1]:35794 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kkWAm-0008W4-6K for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 02 Dec 2020 12:44:12 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:37066) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kkW7i-0006Zl-Su for bug-gnu-emacs@gnu.org; Wed, 02 Dec 2020 12:41:02 -0500 Original-Received: from debbugs.gnu.org ([209.51.188.43]:53929) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1kkW7i-0002dt-KL for bug-gnu-emacs@gnu.org; Wed, 02 Dec 2020 12:41:02 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1kkW7i-0004tc-HY for bug-gnu-emacs@gnu.org; Wed, 02 Dec 2020 12:41:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Wed, 02 Dec 2020 17:41:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 31796 X-GNU-PR-Package: emacs Original-Received: via spool by 31796-submit@debbugs.gnu.org id=B31796.160693080918748 (code B ref 31796); Wed, 02 Dec 2020 17:41:02 +0000 Original-Received: (at 31796) by debbugs.gnu.org; 2 Dec 2020 17:40:09 +0000 Original-Received: from localhost ([127.0.0.1]:37242 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kkW6q-0004sJ-LL for submit@debbugs.gnu.org; Wed, 02 Dec 2020 12:40:08 -0500 Original-Received: from eggs.gnu.org ([209.51.188.92]:46400) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kkW6o-0004ri-KJ for 31796@debbugs.gnu.org; Wed, 02 Dec 2020 12:40:07 -0500 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:54369) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kkW6h-0002Mg-Bm; Wed, 02 Dec 2020 12:39:59 -0500 Original-Received: from [176.228.60.248] (port=1670 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1kkW6Z-00061h-1B; Wed, 02 Dec 2020 12:39:51 -0500 In-Reply-To: (message from Dmitry Gutov on Wed, 2 Dec 2020 19:17:06 +0200) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:194818 Archived-At: > Cc: abela@chalmers.se, 31796@debbugs.gnu.org > From: Dmitry Gutov > Date: Wed, 2 Dec 2020 19:17:06 +0200 > > On 02.12.2020 16:56, Eli Zaretskii wrote: > > The point is that our heuristics for detecting encoding is not > > perfect, so it could fail. > > Do you imagine Grep could use a more reliable detection algorithm? No, I don't. But it could allow the user to specify a different encoding for each file, as in grep --encoding=FOO FILES1* --encoding=BAR FILES2* etc. And even if it just did the job of the same quality as we do, it will do it faster, which is why we use Grep in the first place, right? The important part of the "enhancement" I described is actually the fact that the output gets encoded in a single encoding, no matter what was the encoding of the original files. This makes reading and decoding the output simple and always correct. > Although... since it has to scan the full file anyway, it could first do > a quick detection, and then maybe rescan from the beginning if the > encoding turns out to be something else. That'd be too late, as some matches were already output. Grep does begin by scanning a small portion of the file (at least it did, back when I was familiar with its code), so detection in the same style as Emacs does should be a natural addition, I think.