unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* bug#27560: 26.0.50; seq-uniq is slow
@ 2017-07-03  8:20 Nicolas Petton
  2017-07-03 13:16 ` Tino Calancha
  0 siblings, 1 reply; 6+ messages in thread
From: Nicolas Petton @ 2017-07-03  8:20 UTC (permalink / raw)
  To: 27560; +Cc: ohwoeowho

[-- Attachment #1: Type: text/plain, Size: 733 bytes --]


Oleh (in Cc) did some benchmarks in May 2015, and `seq-uniq' is quite
slow:

    (setq cands (locate-file-completion-table
		 load-path (get-load-suffixes) "" nil t))
    (length cands)
    5357
    (length (cl-remove-duplicates cands :test 'equal))
    2481
    (benchmark-run (cl-remove-duplicates cands :test 'equal))
    (0.67873101 0 0.0)
    (benchmark-run (helm-fast-remove-dups cands :test 'equal))
    (0.001350054 0 0.0)
    (benchmark-run (seq-uniq cands 'equal))
    (5.270219822 27 2.396615401000002)

One easy way to make it faster would be to use `cl-remove-duplicates'
for sequences (sequencep), and default to the current implementation for
other seqp data structures (which means stream.el currently AFAIK).

Nico

[-- Attachment #2.1: Type: text/plain, Size: 0 bytes --]



[-- Attachment #2.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 472 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#27560: 26.0.50; seq-uniq is slow
  2017-07-03  8:20 bug#27560: 26.0.50; seq-uniq is slow Nicolas Petton
@ 2017-07-03 13:16 ` Tino Calancha
  2017-07-03 13:50   ` Nicolas Petton
  0 siblings, 1 reply; 6+ messages in thread
From: Tino Calancha @ 2017-07-03 13:16 UTC (permalink / raw)
  To: Nicolas Petton; +Cc: 27560, ohwoeowho, tino.calancha

Nicolas Petton <nicolas@petton.fr> writes:

> Oleh (in Cc) did some benchmarks in May 2015, and `seq-uniq' is quite
> slow:
>
>     (setq cands (locate-file-completion-table
> 		 load-path (get-load-suffixes) "" nil t))
>     (length cands)
>     5357
>     (length (cl-remove-duplicates cands :test 'equal))
>     2481
>     (benchmark-run (cl-remove-duplicates cands :test 'equal))
>     (0.67873101 0 0.0)
>     (benchmark-run (helm-fast-remove-dups cands :test 'equal))
>     (0.001350054 0 0.0)
>     (benchmark-run (seq-uniq cands 'equal))
>     (5.270219822 27 2.396615401000002)
>
> One easy way to make it faster would be to use `cl-remove-duplicates'
> for sequences (sequencep), and default to the current implementation for
> other seqp data structures (which means stream.el currently AFAIK).
The numbers above are Oleh's numbers from 2015.
If i run similar benchmarks now i don't get such impressive
difference between cl-lib/seq:

(setq cands (locate-file-completion-table
		 load-path (get-load-suffixes) "" nil t))
("cc-compat" "vi" "longlines" "rcompile" "eudcb-ph" "yow" "fast-lock" "cl-compat" "mouse-sel" "tpu-mapper" "otodo-mode" "levents" ...)
(length cands)
5317
(length (cl-remove-duplicates cands :test 'equal))
1716

;; Each benchmark run 5 times:

(benchmark-run 1 (helm-fast-remove-dups cands :test 'equal))
(0.003539776 0 0.0)
(0.007002079 0 0.0)
(0.006296864 0 0.0)
(0.002854921 0 0.0)
(0.009388854 0 0.0)

(benchmark-run 1 (cl-remove-duplicates cands :test 'equal))
(0.558590947 0 0.0)
(0.554376583 0 0.0)
(0.554984501 0 0.0)
(0.583015589 0 0.0)
(0.565781774 0 0.0)

(benchmark-run 1 (seq-uniq cands 'equal))
(0.36366049 0 0.0)
(0.37484882 0 0.0)
(0.604267587 1 0.22640233999999282)
(0.360611939 0 0.0)
(0.392288023 0 0.0)

Tino





^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#27560: 26.0.50; seq-uniq is slow
  2017-07-03 13:16 ` Tino Calancha
@ 2017-07-03 13:50   ` Nicolas Petton
  2017-07-03 13:52     ` Nicolas Petton
  0 siblings, 1 reply; 6+ messages in thread
From: Nicolas Petton @ 2017-07-03 13:50 UTC (permalink / raw)
  To: Tino Calancha; +Cc: 27560, ohwoeowho, tino.calancha

[-- Attachment #1: Type: text/plain, Size: 423 bytes --]

Tino Calancha <tino.calancha@gmail.com> writes:

> The numbers above are Oleh's numbers from 2015.

Yes, but AFAIR `seq-uniq' hasn't changed since then.

> If i run similar benchmarks now i don't get such impressive
> difference between cl-lib/seq:

Indeed, I will run some benchmarks as well. In your benchmarks, cl-lib
is still much faster than seq-uniq, even if it's not as impressive as
Oleh's benchmark.

Cheers,
Nico

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 472 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#27560: 26.0.50; seq-uniq is slow
  2017-07-03 13:50   ` Nicolas Petton
@ 2017-07-03 13:52     ` Nicolas Petton
  2017-07-03 14:09       ` Nicolas Petton
  0 siblings, 1 reply; 6+ messages in thread
From: Nicolas Petton @ 2017-07-03 13:52 UTC (permalink / raw)
  To: Tino Calancha; +Cc: 27560, ohwoeowho, tino.calancha

[-- Attachment #1: Type: text/plain, Size: 262 bytes --]

Nicolas Petton <nicolas@petton.fr> writes:

> Indeed, I will run some benchmarks as well. In your benchmarks, cl-lib
> is still much faster than seq-uniq, even if it's not as impressive as
> Oleh's benchmark.

It looks like I read the benchmark results wrong :)

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 472 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#27560: 26.0.50; seq-uniq is slow
  2017-07-03 13:52     ` Nicolas Petton
@ 2017-07-03 14:09       ` Nicolas Petton
  2017-07-03 14:19         ` Tino Calancha
  0 siblings, 1 reply; 6+ messages in thread
From: Nicolas Petton @ 2017-07-03 14:09 UTC (permalink / raw)
  To: Tino Calancha; +Cc: 27560, tino.calancha, 27560-done, ohwoeowho

[-- Attachment #1: Type: text/plain, Size: 445 bytes --]

Nicolas Petton <nicolas@petton.fr> writes:

>> Indeed, I will run some benchmarks as well. In your benchmarks, cl-lib
>> is still much faster than seq-uniq, even if it's not as impressive as
>> Oleh's benchmark.
>
> It looks like I read the benchmark results wrong :)

seq-uniq seems to be much faster than in 2015, while its definition
basically hasn't changed.  I won't investigate much further as I'm happy
with then benchmark results.

Nico

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 472 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#27560: 26.0.50; seq-uniq is slow
  2017-07-03 14:09       ` Nicolas Petton
@ 2017-07-03 14:19         ` Tino Calancha
  0 siblings, 0 replies; 6+ messages in thread
From: Tino Calancha @ 2017-07-03 14:19 UTC (permalink / raw)
  To: Nicolas Petton; +Cc: Tino Calancha, 27560-done, ohwoeowho



On Mon, 3 Jul 2017, Nicolas Petton wrote:

> Nicolas Petton <nicolas@petton.fr> writes:
>
>>> Indeed, I will run some benchmarks as well. In your benchmarks, cl-lib
>>> is still much faster than seq-uniq, even if it's not as impressive as
>>> Oleh's benchmark.
>>
>> It looks like I read the benchmark results wrong :)
>
> seq-uniq seems to be much faster than in 2015, while its definition
> basically hasn't changed.  I won't investigate much further as I'm happy
> with then benchmark results.
I am quite amazed with the 27 garbage collections that Oleh found:
>     (benchmark-run (seq-uniq cands 'equal))
>     (5.270219822 27 2.396615401000002)

I might get occasionally 1 gc, tipically 0: 27 demands a double checking.
Tino





^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2017-07-03 14:19 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-07-03  8:20 bug#27560: 26.0.50; seq-uniq is slow Nicolas Petton
2017-07-03 13:16 ` Tino Calancha
2017-07-03 13:50   ` Nicolas Petton
2017-07-03 13:52     ` Nicolas Petton
2017-07-03 14:09       ` Nicolas Petton
2017-07-03 14:19         ` Tino Calancha

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).