all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
* Negative occur
@ 2007-11-28 10:15 spamfilteraccount
  2007-11-28 10:21 ` David Kastrup
  0 siblings, 1 reply; 11+ messages in thread
From: spamfilteraccount @ 2007-11-28 10:15 UTC (permalink / raw)
  To: help-gnu-emacs

Is there a command like occur which shows non-matching lines? It would
come in handy for me for the work I'm doing.

It's a useful a feature and not hard to implement, so I'm sure it's
already in Emacs in some form only I'm overlooking it.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Negative occur
  2007-11-28 10:15 spamfilteraccount
@ 2007-11-28 10:21 ` David Kastrup
  2007-11-28 10:53   ` spamfilteraccount
  0 siblings, 1 reply; 11+ messages in thread
From: David Kastrup @ 2007-11-28 10:21 UTC (permalink / raw)
  To: help-gnu-emacs

"spamfilteraccount@gmail.com" <spamfilteraccount@gmail.com> writes:

> Is there a command like occur which shows non-matching lines? It would
> come in handy for me for the work I'm doing.
>
> It's a useful a feature and not hard to implement, so I'm sure it's
> already in Emacs in some form only I'm overlooking it.

You could try running "occur" with the pattern "^" (which matches
every line), then prune the results with M-x delete-matching-lines RET

-- 
David Kastrup, Kriemhildstr. 15, 44793 Bochum

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Negative occur
  2007-11-28 10:21 ` David Kastrup
@ 2007-11-28 10:53   ` spamfilteraccount
  2007-11-28 17:03     ` Drew Adams
  2007-11-29 11:44     ` spamfilteraccount
  0 siblings, 2 replies; 11+ messages in thread
From: spamfilteraccount @ 2007-11-28 10:53 UTC (permalink / raw)
  To: help-gnu-emacs

On Nov 28, 11:21 am, David Kastrup <d...@gnu.org> wrote:
> "spamfilteracco...@gmail.com" <spamfilteracco...@gmail.com> writes:
> > Is there a command like occur which shows non-matching lines? It would
> > come in handy for me for the work I'm doing.
>
> > It's a useful a feature and not hard to implement, so I'm sure it's
> > already in Emacs in some form only I'm overlooking it.
>
> You could try running "occur" with the pattern "^" (which matches
> every line), then prune the results with M-x delete-matching-lines RET
>

Yep, there are workarounds. :)

But it seems a very basic feature, so I'd be surprised if Emacs didn't
have it.

We have keep-lines and flush-lines after all, so it would be logical
to have the same for occur.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* RE: Negative occur
  2007-11-28 10:53   ` spamfilteraccount
@ 2007-11-28 17:03     ` Drew Adams
  2007-11-29 11:44     ` spamfilteraccount
  1 sibling, 0 replies; 11+ messages in thread
From: Drew Adams @ 2007-11-28 17:03 UTC (permalink / raw)
  To: spamfilteraccount, help-gnu-emacs

> > > Is there a command like occur which shows non-matching lines? It would
> > > come in handy for me for the work I'm doing.
> >
> > > It's a useful a feature and not hard to implement, so I'm sure it's
> > > already in Emacs in some form only I'm overlooking it.
> >
> > You could try running "occur" with the pattern "^" (which matches
> > every line), then prune the results with M-x delete-matching-lines RET
>
> Yep, there are workarounds. :) But it seems a very basic feature,
> so I'd be surprised if Emacs didn't have it. We have keep-lines
> and flush-lines after all, so it would be logical to have the
> same for occur.

I realize that your suggestion is that this be added to Emacs. I agree.

FYI - In Icicles, just do this:  C-' foobar C-~

That shows and lets you visit all lines that do not match the regexp
"foobar". `C-'' is `icicle-occur'. `C-~' complements the current set of
matches.

You can also then match additional regexps, dynamically, just by typing
them, separated by `S-SPC' or `C-~'. `S-SPC' means AND (set intersection)
and `C-~' means AND NOT (set complement). Each applies to the current set of
matches. Example:

 C-' foobar C-~ toto S-SPC titi C-~ tata

That shows and lets you visit all lines that do not contain "foobar", do
contain "toto", do not contain "titi", and do contain "tata". You get the
idea: you can get what you want by progressively narrowing the search space.

http://www.emacswiki.org/cgi-bin/wiki/Icicles_-_Search_Commands%2c_Overview

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Negative occur
       [not found] <mailman.4263.1196269422.18990.help-gnu-emacs@gnu.org>
@ 2007-11-28 18:54 ` Ted Zlatanov
  2007-11-28 22:52   ` Drew Adams
  0 siblings, 1 reply; 11+ messages in thread
From: Ted Zlatanov @ 2007-11-28 18:54 UTC (permalink / raw)
  To: help-gnu-emacs

On Wed, 28 Nov 2007 09:03:17 -0800 "Drew Adams" <drew.adams@oracle.com> wrote: 

DA> On Wed, 28 Nov 2007 11:21:43 +0100 David Kastrup <dak@gnu.org> wrote: 
>> > You could try running "occur" with the pattern "^" (which matches
>> > every line), then prune the results with M-x delete-matching-lines RET

DA> I realize that your suggestion is that this be added to Emacs. I agree.

DA> FYI - In Icicles, just do this:  C-' foobar C-~

DA> That shows and lets you visit all lines that do not match the regexp
DA> "foobar". `C-'' is `icicle-occur'. `C-~' complements the current set of
DA> matches.

Both solutions will be slower on a large buffer than they should be.  A
real inversion parameter, either as a predicate function or a variable,
passed lexically or as a parameter to the occur-engine function call, is
necessary.  A predicate function is probably best as it can express
transformations more complex than identity and negation.

Ted

^ permalink raw reply	[flat|nested] 11+ messages in thread

* RE: Negative occur
  2007-11-28 18:54 ` Negative occur Ted Zlatanov
@ 2007-11-28 22:52   ` Drew Adams
  0 siblings, 0 replies; 11+ messages in thread
From: Drew Adams @ 2007-11-28 22:52 UTC (permalink / raw)
  To: Ted Zlatanov, help-gnu-emacs

> >> > You could try running "occur" with the pattern "^" (which matches
> >> > every line), then prune the results with M-x delete-matching-lines
RET

[spamfilteraccount suggested that Emacs should have this as part of
`occur'...]

> DA> I realize that your suggestion is that this be added to
> DA> Emacs. I agree. FYI - In Icicles, just do this: C-' foobar C-~
> DA> That shows and lets you visit all lines that do not match the
> DA> regexp "foobar".
>
> Both solutions will be slower on a large buffer than they should be.

What does "slower than they should be" mean? How slow should they be? How
slow are they in fact? How large is a large buffer? How do you judge that
"they" (two totally different approaches and implementations) are slower
than they should be?

> A real inversion parameter, either as a predicate function or a variable,
> passed lexically or as a parameter to the occur-engine function call, is
> necessary.

Necessary? For what? Why necessary? These are generalizations that don't
help.

Despite the name, `icicle-occur' (C-') doesn't even use "the occur-engine".
If you are suggesting that Emacs `occur' should let you pass a predicate and
incorporate it in a single search pass, that might be worthwhile, but it is
in any case irrelevant for `icicle-occur.

> A predicate function is probably best as it can express
> transformations more complex than identity and negation.

In Icicles, you can also apply a predicate for searching, and you can do
that on the fly too. You can combine any number of predicates that way. You
can mix regexps and predicates for searching. See
http://www.emacswiki.org/cgi-bin/wiki/Icicles_-_Progressive_Completion#Predi
cates.

Your statements are vague, but I'm guessing that what you're really trying
to say is that it is often more efficient to apply a predicate earlier
rather than later (filter promotion), which is true.

Icicles also lets you do that, BTW, to define your own Icicles search
command that incorporates a predicate from the outset (as opposed to using
it as a filter afterward). And this too can be mixed with regexp searching.

An example of this are the definitions of the Icicles Imenu commands that
let you browse definitions of Emacs commands only or non-interactive
functions only: an Imenu regexp to recognize function definitions is
combined with a `commandp' test.
http://www.emacswiki.org/cgi-bin/wiki/Icicles_-_Other_Search_Commands#Define
SearchCommands

However, the real question about efficiency is whether something is
efficient _enough_.

The Icicles approach is designed for interactive use, which is why it
emphasises changing search patterns (and predicates) on the fly. It works
fine with any buffers I've ever used, some of which are pretty darn big.
(How big is big? I just searched a 19MB buffer with no effect on
interactivity.)

As always, the usefulness of a tool depends on what you use it for. If you
want to search a 5 terabyte file, then interactivity might suffer with some
approaches (depending on your hardware... and, especially, depending on your
regexp). But, as always, the devil is in the details.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Negative occur
  2007-11-28 10:53   ` spamfilteraccount
  2007-11-28 17:03     ` Drew Adams
@ 2007-11-29 11:44     ` spamfilteraccount
  1 sibling, 0 replies; 11+ messages in thread
From: spamfilteraccount @ 2007-11-29 11:44 UTC (permalink / raw)
  To: help-gnu-emacs

On Nov 28, 11:53 am, "spamfilteracco...@gmail.com"
<spamfilteracco...@gmail.com> wrote:
> On Nov 28, 11:21 am, David Kastrup <d...@gnu.org> wrote:
>
> > "spamfilteracco...@gmail.com" <spamfilteracco...@gmail.com> writes:
> > > Is there a command like occur which shows non-matching lines? It would
> > > come in handy for me for the work I'm doing.
>
> > > It's a useful a feature and not hard to implement, so I'm sure it's
> > > already in Emacs in some form only I'm overlooking it.
>
> > You could try running "occur" with the pattern "^" (which matches
> > every line), then prune the results with M-x delete-matching-lines RET
>
> Yep, there are workarounds. :)

Here's a naive implementation of noccur I made for myself:

(defun noccur ()
  (interactive)
  (let ((orig-re-search-forward (symbol-function 're-search-forward)))
    (fset 're-search-forward 'noccur-re-search-forward)
    (unwind-protect
        (call-interactively 'occur)
      (fset 're-search-forward orig-re-search-forward))))


(defun noccur-re-search-forward (regexp bound noerror &optional count)
  (unless (eq (aref regexp 0) ?^)
    (setq regexp (concat ".*" regexp)))

  (while (and (not (eobp))
              (looking-at regexp))
    (forward-line 1))

  (if (eobp)
      nil
    (set-match-data (list (line-beginning-position)
                          (line-end-position)))
    (line-end-position)))


(provide 'nocccur)

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Negative occur
       [not found] <mailman.4275.1196290389.18990.help-gnu-emacs@gnu.org>
@ 2007-11-29 15:58 ` Ted Zlatanov
  2007-11-29 17:25   ` Drew Adams
  0 siblings, 1 reply; 11+ messages in thread
From: Ted Zlatanov @ 2007-11-29 15:58 UTC (permalink / raw)
  To: help-gnu-emacs

On Wed, 28 Nov 2007 14:52:15 -0800 "Drew Adams" <drew.adams@oracle.com> wrote: 

>> >> > You could try running "occur" with the pattern "^" (which matches
>> >> > every line), then prune the results with M-x delete-matching-lines
DA> RET

DA> [spamfilteraccount suggested that Emacs should have this as part of
DA> `occur'...]

DA> I realize that your suggestion is that this be added to
DA> Emacs. I agree. FYI - In Icicles, just do this: C-' foobar C-~
DA> That shows and lets you visit all lines that do not match the
DA> regexp "foobar".
>> 
>> Both solutions will be slower on a large buffer than they should be.

DA> What does "slower than they should be" mean? How slow should they be? How
DA> slow are they in fact? How large is a large buffer? How do you judge that
DA> "they" (two totally different approaches and implementations) are slower
DA> than they should be?

I'm certain that creating an *occur* buffer on every line of a 100+ MB
buffer and then removing most of them compares poorly in memory usage
and CPU usage to just matching what you need from it.  It's a very
suboptimal approach whose only advantage is that it doesn't require
changes to any internal logic.  A parallel would be (using `sort'
instead of `cat' to account for Emacs' memory usage):

sort file | grep x
grep x file | sort

>> A real inversion parameter, either as a predicate function or a variable,
>> passed lexically or as a parameter to the occur-engine function call, is
>> necessary.

DA> Necessary? For what? Why necessary? These are generalizations that don't
DA> help.

Necessary to implement the solution in such a way that it will satisfy
both the OP and future needs for tuning the occur results.  I'm not
talking about Icicles (that's why I mentioned occur-1 and occur-engine
originally), sorry if I didn't state that clearly.  I just thought that
since you recommended the filter-later approach, Icicles didn't support
predicates, so it made sense to follow up to you.

DA> Your statements are vague, but I'm guessing that what you're really trying
DA> to say is that it is often more efficient to apply a predicate earlier
DA> rather than later (filter promotion), which is true.

Sure.  Reduce the search results as early as possible, as in my earlier
example of sort/grep usage.

DA> The Icicles approach is designed for interactive use, which is why it
DA> emphasises changing search patterns (and predicates) on the fly. It works
DA> fine with any buffers I've ever used, some of which are pretty darn big.
DA> (How big is big? I just searched a 19MB buffer with no effect on
DA> interactivity.)

DA> As always, the usefulness of a tool depends on what you use it for. If you
DA> want to search a 5 terabyte file, then interactivity might suffer with some
DA> approaches (depending on your hardware... and, especially, depending on your
DA> regexp). But, as always, the devil is in the details.

I can see that between a O(n) and O(n log(n)) algorithm for small data
sets, but when the difference is that one approach copies every line and
the other doesn't, while they achieve the same result, it literally
bothers me to recommend the former approach just because the API doesn't
support the latter.  So I'll propose the API change to emacs-devel.

As for hardware, I maintain an Emacs Maemo port, which is for the Nokia
770/800/810 tablets that run GNU/Linux.  There is little memory
available and the CPU is slow, so copying a large buffer unnecessarily
would be terrible for the user experience.

Ted

^ permalink raw reply	[flat|nested] 11+ messages in thread

* RE: Negative occur
  2007-11-29 15:58 ` Ted Zlatanov
@ 2007-11-29 17:25   ` Drew Adams
  0 siblings, 0 replies; 11+ messages in thread
From: Drew Adams @ 2007-11-29 17:25 UTC (permalink / raw)
  To: Ted Zlatanov, help-gnu-emacs

> I'm not talking about Icicles (that's why I mentioned occur-1
> and occur-engine originally), sorry if I didn't state that
> clearly.  I just thought that since you recommended the
> filter-later approach, Icicles didn't support
> predicates, so it made sense to follow up to you.

A misunderstanding, then. I said "In Icicles, just do this..." and you
replied "Both solutions will be slower...".

I didn't recommend any filter-later approach.

In fact, I said "I realize that [spamfilter's] suggestion is that this be
added to Emacs. I agree.". By which I meant that it makes sense to let
`occur' do this itself. And in an efficient way - the first time around (not
via an after-the-fact filter).

> DA> As always, the usefulness of a tool depends on what you use
> DA> it for. If you want to search a 5 terabyte file, then
> DA> interactivity might suffer with some approaches (depending
> DA> on your hardware... and, especially, depending on your
> DA> regexp). But, as always, the devil is in the details.
>
> I can see that between a O(n) and O(n log(n)) algorithm for small data
> sets, but when the difference is that one approach copies every line and
> the other doesn't, while they achieve the same result, it literally
> bothers me to recommend the former approach just because the API doesn't
> support the latter.  So I'll propose the API change to emacs-devel.

Makes sense to me. I thought that Tamas (oops, spamfilter) was already
suggesting that. If `occur' is made to handle non-matches, it should
certainly do that efficiently.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Negative occur
       [not found] <mailman.4315.1196357177.18990.help-gnu-emacs@gnu.org>
@ 2007-11-29 18:27 ` spamfilteraccount
  2007-12-05 17:44   ` Mathias Dahl
  0 siblings, 1 reply; 11+ messages in thread
From: spamfilteraccount @ 2007-11-29 18:27 UTC (permalink / raw)
  To: help-gnu-emacs

On Nov 29, 6:25 pm, "Drew Adams" <drew.ad...@oracle.com> wrote:
>
> Makes sense to me. I thought that Tamas (oops, spamfilter) was already
> suggesting that.

It's okay to use my name. :)))

I did something wrong when setting up my Google Groups account ages
ago, and I just don't care enough to fix it.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Negative occur
  2007-11-29 18:27 ` spamfilteraccount
@ 2007-12-05 17:44   ` Mathias Dahl
  0 siblings, 0 replies; 11+ messages in thread
From: Mathias Dahl @ 2007-12-05 17:44 UTC (permalink / raw)
  To: help-gnu-emacs

"spamfilteraccount@gmail.com" <spamfilteraccount@gmail.com> writes:

>> Makes sense to me. I thought that Tamas (oops, spamfilter) was already
>> suggesting that.
>
> It's okay to use my name. :)))
>
> I did something wrong when setting up my Google Groups account ages
> ago, and I just don't care enough to fix it.

It's OK Tamas.  At first I though it was annoying, some spammer coming
to the list, but then I found you were really just a nice guy, so now
I think it's just cute... :)

^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2007-12-05 17:44 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <mailman.4263.1196269422.18990.help-gnu-emacs@gnu.org>
2007-11-28 18:54 ` Negative occur Ted Zlatanov
2007-11-28 22:52   ` Drew Adams
     [not found] <mailman.4315.1196357177.18990.help-gnu-emacs@gnu.org>
2007-11-29 18:27 ` spamfilteraccount
2007-12-05 17:44   ` Mathias Dahl
     [not found] <mailman.4275.1196290389.18990.help-gnu-emacs@gnu.org>
2007-11-29 15:58 ` Ted Zlatanov
2007-11-29 17:25   ` Drew Adams
2007-11-28 10:15 spamfilteraccount
2007-11-28 10:21 ` David Kastrup
2007-11-28 10:53   ` spamfilteraccount
2007-11-28 17:03     ` Drew Adams
2007-11-29 11:44     ` spamfilteraccount

Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.