unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
From: David Kastrup <dak@gnu.org>
To: emacs-devel@gnu.org
Subject: Re: Progress report on git-blame
Date: Sat, 25 Jan 2014 11:12:24 +0100	[thread overview]
Message-ID: <87eh3wcquv.fsf@fencepost.gnu.org> (raw)
In-Reply-To: 87iot8csyi.fsf@fencepost.gnu.org

David Kastrup <dak@gnu.org> writes:

> martin rudalics <rudalics@gmx.at> writes:
>
>>> Well, I finally did "git gc --aggressive" on my Emacs repository and
>>> tried again git-blame src/xdisp.c and the times got considerably _worse_
>>> than before the packing (though the repository went to about a quarter
>>> in size).  So after my fixes the run time in real use cases is probably
>>> dominated by unpacking the various revisions, particularly with a
>>> well-compressed repository.
>>
>> Seems like the second law of thermodynamics still applies ...
>
> Yes, life is still easiest on the peanut galleries.

At any rate, I forgot to set the comparison function on the governing
priority queue, turning it into a FIFO.  So the actual numbers on the
packed repository are not as disappointing as I first thought.

Here are the numbers, first the system version, then my own version:

dak@lola:/usr/local/tmp/emacs$ time git blame HEAD src/xdisp.c >/tmp/blame1

real	3m17.437s
user	2m23.676s
sys	0m52.560s
dak@lola:/usr/local/tmp/emacs$ time ../git/git blame HEAD src/xdisp.c >/tmp/blame2

real	1m24.628s
user	0m32.008s
sys	0m52.044s
dak@lola:/usr/local/tmp/emacs$ cmp /tmp/blame1 /tmp/blame2
dak@lola:/usr/local/tmp/emacs$ 

But it's obvious that significant further improvements will depend on
either or both
a) making git-blame respond to interactively determined requirements
(basically, reacting to expose events)
b) significantly decreasing the involved system time by employing a good
caching strategy
c) not letting the basic algorithm and the cached data progress
independently but try catering the workload order on what happens to be
in-memory at a given point of time.

c) is a very shady approach and would make debugging a nightmare.

At any rate, the user time already dropped by more than a factor of 4,
and I have not touched the inner work horse (the diff algorithm) at all,
just eliminated all the thrashing around it.  Now the diff algorithm
itself is a separate library by a separate author, with widespread use.
It would be rather optimistic to assume that there are significantly
low-hanging fruit to be found there.

Shrug.  I'll go see how I can reimplement the now defunct copy/move
detection not exercised without specific command line options in order
to get the current work into a shape where upstream can consider
accepting it.

-- 
David Kastrup




  reply	other threads:[~2014-01-25 10:12 UTC|newest]

Thread overview: 51+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-01-09 14:02 RFC - cleaning up /etc Eric S. Raymond
2014-01-09 14:48 ` Samuel El-Borai
2014-01-09 15:09 ` Samuel El-Borai
2014-01-09 16:57 ` Glenn Morris
2014-01-09 17:42   ` Eric S. Raymond
2014-01-09 19:50     ` Glenn Morris
2014-01-10 14:35 ` Richard Stallman
2014-01-10 15:51   ` Eric S. Raymond
2014-01-11  7:15     ` Richard Stallman
2014-01-11 10:17       ` Eric S. Raymond
2014-01-11 18:37         ` Richard Stallman
2014-01-11 10:53       ` Ulrich Mueller
2014-01-11 20:01       ` Glenn Morris
2014-01-11 20:59         ` Eric S. Raymond
2014-01-11 21:10           ` Eli Zaretskii
2014-01-11 21:27             ` Eric S. Raymond
2014-01-12  0:07           ` Lars Magne Ingebrigtsen
2014-01-12  0:20             ` Eric S. Raymond
2014-01-12  0:37             ` David Kastrup
2014-01-12  3:51               ` Eli Zaretskii
2014-01-25  1:06               ` Progress report on git-blame (was: RFC - cleaning up /etc) David Kastrup
2014-01-25  7:34                 ` Eli Zaretskii
2014-01-25  8:59                   ` Progress report on git-blame David Kastrup
2014-01-25  9:21                     ` martin rudalics
2014-01-25  9:27                       ` David Kastrup
2014-01-25 10:12                         ` David Kastrup [this message]
2014-02-20 18:33                           ` David Kastrup
2014-01-25 18:21                     ` Lars Ingebrigtsen
2014-01-25 18:30                       ` David Kastrup
2014-01-25 18:52                       ` Óscar Fuentes
2014-01-25 18:59                         ` David Kastrup
2014-01-25 19:31                           ` Eli Zaretskii
2014-01-25 19:44                           ` Óscar Fuentes
2014-01-25 20:02                             ` David Kastrup
2014-01-25 21:45                       ` Stefan Monnier
2014-01-26 14:40                         ` Aneesh Kumar K.V
2014-01-27 14:17                           ` Stefan Monnier
2014-01-25 21:48                       ` Stefan Monnier
2014-01-25 23:03                         ` Óscar Fuentes
2014-01-26  1:48                           ` Stefan Monnier
2014-01-26 17:37                             ` Eli Zaretskii
2014-01-26 19:05                               ` Stefan Monnier
2014-01-26 19:40                                 ` Eli Zaretskii
2014-01-26 22:14                                   ` Stefan Monnier
2014-01-26  6:30                         ` David Kastrup
2014-01-26 15:07                           ` Stefan Monnier
2014-01-25  8:28                 ` David Engster
2014-01-25  9:14                   ` David Kastrup
2014-01-12  3:03             ` RFC - cleaning up /etc Stefan Monnier
2014-01-12 13:46         ` Richard Stallman
2014-01-12 19:17           ` Glenn Morris

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87eh3wcquv.fsf@fencepost.gnu.org \
    --to=dak@gnu.org \
    --cc=emacs-devel@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).