unofficial mirror of help-gnu-emacs@gnu.org
 help / color / mirror / Atom feed
From: Thibaut Verron <thibaut.verron@gmail.com>
To: Joost Kremers <joostkremers@fastmail.fm>
Cc: help-gnu-emacs <help-gnu-emacs@gnu.org>
Subject: Re: Regex to match lines with a specific number of words
Date: Sat, 23 Apr 2022 22:58:48 +0200	[thread overview]
Message-ID: <CAFsi02Q7mAMkn2pVei2YfbaN0bk5fQ1xaGicxPE=Bi5bTKnYUQ@mail.gmail.com> (raw)
In-Reply-To: <87czh7ttzt.fsf@fastmail.fm>

Hi,

The group [:space:] also matches newline characters. So your search has
exactly one match, spanning many lines.
You can use [:blank:] instead to match spaces and tabs only, for the
separator.

It's probably better to keep [^[:space:]] for the first group, you wouldn't
want to start matching newlines there.

Best wishes,
Thibaut

Le sam. 23 avr. 2022 à 22:39, Joost Kremers <joostkremers@fastmail.fm> a
écrit :

> Hi all,
>
> I've been trying to come up with a regex that will match any line
> containing at
> least 30 words in order to kill them from the buffer (preferably with
> `kill-matching-lines`, because I need to move the lines to another buffer.)
>
> Frustratingly enough, I have not been successful. Since "word" here can be
> interpreted very broadly, I thought this would be easy. Any sequence of
> non-whitespace characters surrounded by whitespace can be considered a
> "word"
> (even if it's a number of some special character such as & or #.) So I did
> this:
>
> \([^[:space:]]+[[:space:]]+\)
>
> This seems to capture a word (in the above sense) plus any following white
> space
> well enough.
>
> But when I try to modify the regex to only match those lines that repeat
> this
> pattern at least 30 times, it fails:
>
> \([^[:space:]]+[[:space:]]+\)\{30,\}
>
> Passing this to `flush-lines` simply deletes everything in the buffer
> starting
> at point, telling me it "[d]eleted 1 matching line", even though (many)
> more
> lines were deleted. Adding ^ and $ around the regex didn't have any effect.
>
> So what am I doing wrong here?
>
>
> --
> Joost Kremers
> Life has its moments
>
>


  reply	other threads:[~2022-04-23 20:58 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-04-23 19:44 Regex to match lines with a specific number of words Joost Kremers
2022-04-23 20:58 ` Thibaut Verron [this message]
2022-04-23 21:20   ` Joost Kremers
2022-04-23 21:46     ` Thibaut Verron
2022-04-23 22:11       ` [External] : " Drew Adams
2022-04-23 22:32         ` Thibaut Verron
2022-04-23 22:21       ` Joost Kremers
2022-04-23 22:46   ` Stefan Monnier via Users list for the GNU Emacs text editor
2022-04-24 14:31     ` Joost Kremers
2022-04-26 23:55 ` Nick Dokos
2022-04-27  7:23   ` Jean Louis

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAFsi02Q7mAMkn2pVei2YfbaN0bk5fQ1xaGicxPE=Bi5bTKnYUQ@mail.gmail.com' \
    --to=thibaut.verron@gmail.com \
    --cc=help-gnu-emacs@gnu.org \
    --cc=joostkremers@fastmail.fm \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).