From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-Status: No, score=-4.0 required=3.0 tests=ALL_TRUSTED,BAYES_00 shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from localhost (dcvr.yhbt.net [127.0.0.1]) by dcvr.yhbt.net (Postfix) with ESMTP id 459C31F86C for ; Sun, 29 Nov 2020 05:09:35 +0000 (UTC) From: Eric Wong To: meta@public-inbox.org Subject: [PATCH 0/2] extindex: administrative stuffs Date: Sun, 29 Nov 2020 05:09:33 +0000 Message-Id: <20201129050935.3110-1-e@80x24.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit List-Id: Welcome to episode #968 of "Deletes Are Hard". Since inboxes may be removed from the config file and made inaccessible unbeknownst to public-inbox indexing code; we need to support a way to remove stale indexed data associated with them. PATCH 2/2 makes it happen with garbage collection. Lightly-tested, but it seems to work reasonably well. PATCH 1/2 is a bug fix :x Eric Wong (2): v2writable: detect shard count for ExtSearchIdx properly extindex: support `--gc' to remove dead inboxes lib/PublicInbox/ExtSearchIdx.pm | 90 ++++++++++++++++++++++++++++++- lib/PublicInbox/MiscIdx.pm | 16 ++++++ lib/PublicInbox/SearchIdx.pm | 23 +++++++- lib/PublicInbox/SearchIdxShard.pm | 21 ++++++-- lib/PublicInbox/V2Writable.pm | 16 ++++-- script/public-inbox-extindex | 19 +++++-- t/extsearch.t | 19 ++++++- 7 files changed, 190 insertions(+), 14 deletions(-)