unofficial mirror of meta@public-inbox.org
 help / color / mirror / Atom feed
From: "Eric Wong (Contractor, The Linux Foundation)" <e@80x24.org>
To: meta@public-inbox.org
Subject: [PATCH 00/27] v2 public-inbox-watch support
Date: Mon, 19 Mar 2018 08:14:32 +0000	[thread overview]
Message-ID: <20180319081459.10645-1-e@80x24.org> (raw)

barrier support in v2writable makes checking duplicates during
large imports faster as it no longer requires tearing down
subprocesses.  This also makes checking deletes a bit faster
(we check deletes to prevent blindly writing the same deleted
 files over and over again)

On the subject of subprocesses for v2, they're now optional
(per-invocation), so -init obviously won't need to spawn
subprocesses, and normal -watch usage shouldn't, either, as mail
doesn't tend to arrive quickly.

public-inbox-watch seems to be working well enough on my desktop
with my Maildir for LKML (synched via offlineimap).

Eric Wong (Contractor, The Linux Foundation) (27):
  content_id: use Sender header if From is not available
  v2writable: support "barrier" operation to avoid reforking
  use string ref for Email::Simple->new
  v2writable: remove unnecessary idx_init call
  searchidx: do not delete documents while iterating
  search: allow ->reopen to be chainable
  v2writable: implement remove correctly
  skeleton: barrier init requires a lock
  import: (v2) delete writes the blob into history in subdir
  import: (v2): write deletes to a separate '_' subdirectory
  import: implement barrier operation for v1 repos
  mid: mid_mime uses v2-compatible mids function
  watchmaildir: use content_digest to generate Message-Id
  import: force Message-ID generation for v1 here
  import: switch to URL-safe Base64 for Message-IDs
  v2writable: test for idempotent removals
  import: enable locking under v2
  index: s/GIT_DIR/REPO_DIR/
  Lock: new base class for writable lockers
  t/watch_maildir: note the reason for FIFO creation
  v2writable: ensure ->done is idempotent
  watchmaildir: support v2 repositories
  searchidxpart: s/barrier/remote_barrier/
  v2writable: allow disabling parallelization
  scripts/import_vger_from_mbox: filter out same headers as MDA
  v2writable: add DEBUG_DIFF env support
  v2writable: remove "resent" message for duplicate Message-IDs

 Documentation/public-inbox-index.pod |   8 +-
 MANIFEST                             |   2 +
 lib/PublicInbox/ContentId.pm         |  16 +++-
 lib/PublicInbox/Import.pm            | 132 ++++++++++++++++++-----------
 lib/PublicInbox/Lock.pm              |  31 +++++++
 lib/PublicInbox/MID.pm               |   3 +-
 lib/PublicInbox/Msgmap.pm            |   8 ++
 lib/PublicInbox/Search.pm            |   1 +
 lib/PublicInbox/SearchIdx.pm         | 159 +++++++++++++++++++++++------------
 lib/PublicInbox/SearchIdxPart.pm     |  62 +++++++++-----
 lib/PublicInbox/SearchIdxSkeleton.pm | 112 +++++++++++++++++-------
 lib/PublicInbox/SearchMsg.pm         |   4 +-
 lib/PublicInbox/V2Writable.pm        | 136 ++++++++++++++++++++++++------
 lib/PublicInbox/WatchMaildir.pm      |  54 ++++--------
 script/public-inbox-index            |  12 +--
 script/public-inbox-init             |   3 +-
 scripts/import_vger_from_mbox        |   2 +
 t/v2writable.t                       |  51 +++++++++--
 t/watch_maildir.t                    |   3 +-
 t/watch_maildir_v2.t                 | 125 +++++++++++++++++++++++++++
 20 files changed, 683 insertions(+), 241 deletions(-)
 create mode 100644 lib/PublicInbox/Lock.pm
 create mode 100644 t/watch_maildir_v2.t

-- 
EW

             reply	other threads:[~2018-03-19  8:14 UTC|newest]

Thread overview: 29+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-03-19  8:14 Eric Wong (Contractor, The Linux Foundation) [this message]
2018-03-19  8:14 ` [PATCH 01/27] content_id: use Sender header if From is not available Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 02/27] v2writable: support "barrier" operation to avoid reforking Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 03/27] use string ref for Email::Simple->new Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 04/27] v2writable: remove unnecessary idx_init call Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 05/27] searchidx: do not delete documents while iterating Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 06/27] search: allow ->reopen to be chainable Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 07/27] v2writable: implement remove correctly Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 08/27] skeleton: barrier init requires a lock Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 09/27] import: (v2) delete writes the blob into history in subdir Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 10/27] import: (v2): write deletes to a separate '_' subdirectory Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 11/27] import: implement barrier operation for v1 repos Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 12/27] mid: mid_mime uses v2-compatible mids function Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 13/27] watchmaildir: use content_digest to generate Message-Id Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 14/27] import: force Message-ID generation for v1 here Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 15/27] import: switch to URL-safe Base64 for Message-IDs Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 16/27] v2writable: test for idempotent removals Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 17/27] import: enable locking under v2 Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 18/27] index: s/GIT_DIR/REPO_DIR/ Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 19/27] Lock: new base class for writable lockers Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 20/27] t/watch_maildir: note the reason for FIFO creation Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 21/27] v2writable: ensure ->done is idempotent Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 22/27] watchmaildir: support v2 repositories Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 23/27] searchidxpart: s/barrier/remote_barrier/ Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 24/27] v2writable: allow disabling parallelization Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 25/27] scripts/import_vger_from_mbox: filter out same headers as MDA Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 26/27] v2writable: add DEBUG_DIFF env support Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:14 ` [PATCH 27/27] v2writable: remove "resent" message for duplicate Message-IDs Eric Wong (Contractor, The Linux Foundation)
2018-03-19  8:18   ` SQUASH: " Eric Wong

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://public-inbox.org/README

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20180319081459.10645-1-e@80x24.org \
    --to=e@80x24.org \
    --cc=meta@public-inbox.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).