From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS27357 104.130.224.0/20 X-Spam-Status: No, score=-3.6 required=3.0 tests=AWL,BAYES_00,RP_MATCHES_RCVD, SPF_PASS shortcircuit=no autolearn=ham autolearn_force=no version=3.4.0 Received: from cloud.peff.net (cloud.peff.net [104.130.231.41]) by dcvr.yhbt.net (Postfix) with SMTP id 5A6FF20899 for ; Wed, 23 Aug 2017 20:06:54 +0000 (UTC) Received: (qmail 24992 invoked by uid 109); 23 Aug 2017 20:06:54 -0000 Received: from Unknown (HELO peff.net) (10.0.1.2) by cloud.peff.net (qpsmtpd/0.94) with SMTP; Wed, 23 Aug 2017 20:06:54 +0000 Authentication-Results: cloud.peff.net; auth=none Received: (qmail 3527 invoked by uid 111); 23 Aug 2017 20:07:21 -0000 Received: from Unknown (HELO sigill.intra.peff.net) (10.0.1.3) by peff.net (qpsmtpd/0.94) with SMTP; Wed, 23 Aug 2017 16:07:21 -0400 Authentication-Results: peff.net; auth=none Received: by sigill.intra.peff.net (sSMTP sendmail emulation); Wed, 23 Aug 2017 16:06:51 -0400 Date: Wed, 23 Aug 2017 16:06:51 -0400 From: Jeff King To: Stefan Beller Cc: Eric Wong , meta@public-inbox.org Subject: Re: Nonlinear history? Message-ID: <20170823200651.gb7e5h3dywduxkrk@sigill.intra.peff.net> References: <20170823014239.GA4113@starla> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: List-Id: On Wed, Aug 23, 2017 at 11:29:24AM -0700, Stefan Beller wrote: > Note that Peff seems to have build tooling around public-inbox > (https://public-inbox.org/git/20170823154747.vxtyy2v2ofkxwrkx@sigill.intra.peff.net/) > that would produce this precise lookup already. It's not really built around public-inbox. I just like public-inbox URLs because they use global identifiers, which means I can index them into other systems. My setup is basically maildir (backfilled from gmane long ago and kept up to date with my subscription), indexed by mairix, and a script that does m{https?:/public-inbox.org/git/(\S+)} on mail contents and and runs "mairix m:$1" on the result. It also looks for gmane.org URLs and runs gunzip -c ~/.gmane-to-mid.gz | grep "^${id}\$" Not exactly high-tech, but it was easy to write and linear search is good enough for personal use. I used to do the same thing when gmane was up by resolving the article numbers at gmane. But doing it without hitting the network is nicer anyway, and of course the online method doesn't work anymore. :) -Peff