* v1 deletion patches
@ 2021-04-16 12:00 David Bremner
2021-04-16 12:00 ` [PATCH 1/2] perf-tests: add test for removing files David Bremner
` (3 more replies)
0 siblings, 4 replies; 5+ messages in thread
From: David Bremner @ 2021-04-16 12:00 UTC (permalink / raw)
To: notmuch
This version has a better commit message for 2/2 and portability fixes
(thanks to Tomi) for 1/2.
It obsoletes the WIP version [1]
[1]: id:20210414021627.1236560-1-david@tethera.net
^ permalink raw reply [flat|nested] 5+ messages in thread
* [PATCH 1/2] perf-tests: add test for removing files.
2021-04-16 12:00 v1 deletion patches David Bremner
@ 2021-04-16 12:00 ` David Bremner
2021-04-16 12:00 ` [PATCH 2/2] lib: directly traverse postlists in _n_message_delete David Bremner
` (2 subsequent siblings)
3 siblings, 0 replies; 5+ messages in thread
From: David Bremner @ 2021-04-16 12:00 UTC (permalink / raw)
To: notmuch; +Cc: David Bremner
Without the proposed fix for deletion performance [1], this test is
probably unreasonably slow compared to others at the same
small/medium/large scale.
[1]: id:20210414021627.1236560-3-david@tethera.net
---
performance-test/T00-new.sh | 10 ++++++++++
1 file changed, 10 insertions(+)
diff --git a/performance-test/T00-new.sh b/performance-test/T00-new.sh
index a14dd13f..53de1c27 100755
--- a/performance-test/T00-new.sh
+++ b/performance-test/T00-new.sh
@@ -26,6 +26,16 @@ perl -nle 'rename "$_.renamed", $_' $manifest
time_run "new ($count mv back)" 'notmuch new'
+xargs tar cf backup.tar < $manifest
+
+perl -nle 'unlink $_; unlink $_.copy' $manifest
+
+time_run "new ($count rm)" 'notmuch new'
+
+tar xf backup.tar
+
+time_run "new ($count restore)" 'notmuch new'
+
perl -nle 'link $_, "$_.copy"' $manifest
time_run "new ($count cp)" 'notmuch new'
--
2.30.2
^ permalink raw reply related [flat|nested] 5+ messages in thread
* [PATCH 2/2] lib: directly traverse postlists in _n_message_delete
2021-04-16 12:00 v1 deletion patches David Bremner
2021-04-16 12:00 ` [PATCH 1/2] perf-tests: add test for removing files David Bremner
@ 2021-04-16 12:00 ` David Bremner
2021-04-18 7:10 ` v1 deletion patches Tomi Ollila
2021-04-18 12:57 ` David Bremner
3 siblings, 0 replies; 5+ messages in thread
From: David Bremner @ 2021-04-16 12:00 UTC (permalink / raw)
To: notmuch; +Cc: David Bremner
This is intended to fix the slow behaviour of "notmuch new" (and possibly
"notmuch reindex") when large numbers of files are deleted.
The underlying issue [1] seems to be the Xapian glass backend spending
a large amount of time in db.has_positions when running queries with
large-ish amounts of unflushed changes.
This commit removes two uses of Xapian queries [2], and replaces them with
an approximation of what Xapian would do after optimizing the
queries. This avoids the calls to has_positions (which are in any case
un-needed because we are only using boolean terms here).
[1] Thanks to "andres" on IRC for narrowing down the performance
bottleneck.
[2] Thanks to Olly Betts of Xapian fame for talking me a through a fix
that does not require people to update Xapian.
---
lib/message.cc | 68 +++++++++++++++++++++++++++++---------------------
1 file changed, 39 insertions(+), 29 deletions(-)
diff --git a/lib/message.cc b/lib/message.cc
index 0c2eeab5..42d56acb 100644
--- a/lib/message.cc
+++ b/lib/message.cc
@@ -1356,11 +1356,10 @@ notmuch_status_t
_notmuch_message_delete (notmuch_message_t *message)
{
notmuch_status_t status;
- const char *mid, *tid, *query_string;
+ const char *mid, *tid;
notmuch_message_t *ghost;
notmuch_private_status_t private_status;
notmuch_database_t *notmuch;
- notmuch_query_t *query;
unsigned int count = 0;
bool is_ghost;
@@ -1382,16 +1381,33 @@ _notmuch_message_delete (notmuch_message_t *message)
if (is_ghost)
return NOTMUCH_STATUS_SUCCESS;
- query_string = talloc_asprintf (message, "thread:%s", tid);
- query = notmuch_query_create (notmuch, query_string);
- if (query == NULL)
- return NOTMUCH_STATUS_OUT_OF_MEMORY;
- status = notmuch_query_count_messages (query, &count);
- if (status) {
- notmuch_query_destroy (query);
- return status;
+ /* look for a non-ghost message in the same thread */
+ try {
+ Xapian::PostingIterator thread_doc, thread_doc_end;
+ Xapian::PostingIterator mail_doc, mail_doc_end;
+
+ _notmuch_database_find_doc_ids (message->notmuch, "thread", tid, &thread_doc,
+ &thread_doc_end);
+ _notmuch_database_find_doc_ids (message->notmuch, "type", "mail", &mail_doc, &mail_doc_end);
+
+ while (count == 0 &&
+ thread_doc != thread_doc_end &&
+ mail_doc != mail_doc_end) {
+ thread_doc.skip_to (*mail_doc);
+ if (thread_doc != thread_doc_end) {
+ if (*thread_doc == *mail_doc) {
+ count++;
+ } else {
+ mail_doc.skip_to (*thread_doc);
+ if (mail_doc != mail_doc_end && *thread_doc == *mail_doc)
+ count++;
+ }
+ }
+ }
+ } catch (Xapian::Error &error) {
+ LOG_XAPIAN_EXCEPTION (message, error);
+ return NOTMUCH_STATUS_XAPIAN_EXCEPTION;
}
-
if (count > 0) {
/* reintroduce a ghost in its place because there are still
* other active messages in this thread: */
@@ -1410,27 +1426,21 @@ _notmuch_message_delete (notmuch_message_t *message)
notmuch_message_destroy (ghost);
status = COERCE_STATUS (private_status, "Error converting to ghost message");
} else {
- /* the thread is empty; drop all ghost messages from it */
- notmuch_messages_t *messages;
- status = _notmuch_query_search_documents (query,
- "ghost",
- &messages);
- if (status == NOTMUCH_STATUS_SUCCESS) {
- notmuch_status_t last_error = NOTMUCH_STATUS_SUCCESS;
- while (notmuch_messages_valid (messages)) {
- message = notmuch_messages_get (messages);
- status = _notmuch_message_delete (message);
- if (status) /* we'll report the last failure we see;
- * if there is more than one failure, we
- * forget about previous ones */
- last_error = status;
- notmuch_message_destroy (message);
- notmuch_messages_move_to_next (messages);
+ /* the thread now contains only ghosts: delete them */
+ try {
+ Xapian::PostingIterator doc, doc_end;
+
+ _notmuch_database_find_doc_ids (message->notmuch, "thread", tid, &doc, &doc_end);
+
+ for (; doc != doc_end; doc++) {
+ message->notmuch->writable_xapian_db->delete_document (*doc);
}
- status = last_error;
+ } catch (Xapian::Error &error) {
+ LOG_XAPIAN_EXCEPTION (message, error);
+ return NOTMUCH_STATUS_XAPIAN_EXCEPTION;
}
+
}
- notmuch_query_destroy (query);
return status;
}
--
2.30.2
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: v1 deletion patches
2021-04-16 12:00 v1 deletion patches David Bremner
2021-04-16 12:00 ` [PATCH 1/2] perf-tests: add test for removing files David Bremner
2021-04-16 12:00 ` [PATCH 2/2] lib: directly traverse postlists in _n_message_delete David Bremner
@ 2021-04-18 7:10 ` Tomi Ollila
2021-04-18 12:57 ` David Bremner
3 siblings, 0 replies; 5+ messages in thread
From: Tomi Ollila @ 2021-04-18 7:10 UTC (permalink / raw)
To: David Bremner, notmuch
On Fri, Apr 16 2021, David Bremner wrote:
> This version has a better commit message for 2/2 and portability fixes
> (thanks to Tomi) for 1/2.
Looks safe enough to be pushed in.
Tomi
>
> It obsoletes the WIP version [1]
>
> [1]: id:20210414021627.1236560-1-david@tethera.net
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: v1 deletion patches
2021-04-16 12:00 v1 deletion patches David Bremner
` (2 preceding siblings ...)
2021-04-18 7:10 ` v1 deletion patches Tomi Ollila
@ 2021-04-18 12:57 ` David Bremner
3 siblings, 0 replies; 5+ messages in thread
From: David Bremner @ 2021-04-18 12:57 UTC (permalink / raw)
To: notmuch
David Bremner <david@tethera.net> writes:
> This version has a better commit message for 2/2 and portability fixes
> (thanks to Tomi) for 1/2.
>
> It obsoletes the WIP version [1]
>
> [1]: id:20210414021627.1236560-1-david@tethera.net
Applied to master.
d
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2021-04-18 12:57 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2021-04-16 12:00 v1 deletion patches David Bremner
2021-04-16 12:00 ` [PATCH 1/2] perf-tests: add test for removing files David Bremner
2021-04-16 12:00 ` [PATCH 2/2] lib: directly traverse postlists in _n_message_delete David Bremner
2021-04-18 7:10 ` v1 deletion patches Tomi Ollila
2021-04-18 12:57 ` David Bremner
Code repositories for project(s) associated with this public inbox
https://yhetil.org/notmuch.git/
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).