From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-Status: No, score=-2.9 required=3.0 tests=AWL,BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_BL_SPAMCOP_NET, RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_PASS shortcircuit=no autolearn=no autolearn_force=no version=3.4.2 Received: from mail-ej1-x635.google.com (mail-ej1-x635.google.com [IPv6:2a00:1450:4864:20::635]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id 55C011F86C for ; Tue, 1 Dec 2020 14:00:42 +0000 (UTC) Received: by mail-ej1-x635.google.com with SMTP id qw4so4237360ejb.12 for ; Tue, 01 Dec 2020 06:00:42 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linuxfoundation.org; s=google; h=date:from:to:cc:subject:message-id:mail-followup-to:references :mime-version:content-disposition:in-reply-to:user-agent; bh=h4CyVl5c5rT8SmRxkb00zOgGy5ivSzk3vbpEpVo8E7Y=; b=SpHoGTfqjd4HLPKEV0Pdk1t78pVputzGh4FOk5PupT2rey344SeGeTOc4F+ZphLysb 4JfaTwq4Ud1VVejGNnvgk+PAhkA+/1p5hYJg4PxNY6di6UF91wjg/nhnsjgbvYDwReDw 2+djiEamx/7ebOC6jfBaMTxSl8eXmJmYNayMk= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id :mail-followup-to:references:mime-version:content-disposition :in-reply-to:user-agent; bh=h4CyVl5c5rT8SmRxkb00zOgGy5ivSzk3vbpEpVo8E7Y=; b=L14TcY55SwOoNirD5nHTiV/wnebCJm0UczgpF3Fa8AkpLLE2dBcHxHK0Fx5DxDqeAT 8hroT9Wlt4Z9+pnSG6FTvdkQN3rPxCqECdOb2eoBLniTT3d5NgU4D6tfLtG534Q5jbPk LY7X6MbJbTUjacQ50EQzZZhc2mApuU7wa8uPsAgNk0r9mSO7GgdEtFxC9hpl75XSWNId EKtjVP89uFHrUR/8dz3UXRZ1YW8rircJ/i4I61vCHjUzRLJgDM48Oays+qa6WsDAymQg sGt+ye+zCeVARgS3JYgAbAwqvy/YYy8vhX45ly1AtsfpzZ7cRlNGKeU4pdoxPLzmPVlN 5jeQ== X-Gm-Message-State: AOAM531IOfWjvuqQ5swNEnlRkUesQPgsacMxT515k/O888rYlDvsWPN+ wvgzEspRRXNA4iNlDW/ATup10A== X-Google-Smtp-Source: ABdhPJxYMaYkjOxGxLmJX3fIL7Bvh0EmQIunDHHaAUtxaaIIU6WQFdgB9VHsFMrctpDAcrhq/YXxnA== X-Received: by 2002:a17:906:2e16:: with SMTP id n22mr3196135eji.477.1606831240873; Tue, 01 Dec 2020 06:00:40 -0800 (PST) Received: from nitro.local (tor-exit-14.zbau.f3netze.de. [185.220.100.241]) by smtp.gmail.com with ESMTPSA id u23sm887039ejy.87.2020.12.01.06.00.38 (version=TLS1_2 cipher=ECDHE-ECDSA-CHACHA20-POLY1305 bits=256/256); Tue, 01 Dec 2020 06:00:39 -0800 (PST) Date: Tue, 1 Dec 2020 09:00:33 -0500 From: Konstantin Ryabitsev To: Eric Wong Cc: workflows@vger.kernel.org, meta@public-inbox.org Subject: Re: WIP: searching all of lore Message-ID: <20201201140033.gyxmaejay2ddpiz3@nitro.local> Mail-Followup-To: Eric Wong , workflows@vger.kernel.org, meta@public-inbox.org References: <20201126194543.GA30337@dcvr> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20201126194543.GA30337@dcvr> User-Agent: NeoMutt/20171215 List-Id: On Thu, Nov 26, 2020 at 07:45:43PM +0000, Eric Wong wrote: > Requires Tor, for now: > > http://rskvuqcfnfizkjg6h5jvovwb3wkikzcwskf54lfpymus6mxrzw67b5ad.onion/all/ > http://lore.czquwvybam4bgbro.onion/all/ Thanks for this work, Eric, things are looking good in my tests, though I uncovered a bunch of problems with b4 when used with torsocks. :) When grabbing t.mbox.gz threads from /all, it appears to properly reconstitute follow-ups from multiple mailing lists, correct? Is there a way to "weight" different sources, so that when the same message-id exist in multiple places, we can prefer one source over another? For example, this is useful when we're trying to do DKIM validation and some lists are known to mess that up, while others do the right thing. Thanks again, Konstantin