From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-Status: No, score=-3.4 required=3.0 tests=AWL,BAYES_00,SPF_HELO_NONE, SPF_PASS shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from metis.ext.pengutronix.de (metis.ext.pengutronix.de [IPv6:2001:67c:670:201:290:27ff:fe1d:cc33]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id 0D62B1F4B4 for ; Tue, 22 Dec 2020 07:37:13 +0000 (UTC) Received: from ptx.hi.pengutronix.de ([2001:67c:670:100:1d::c0]) by metis.ext.pengutronix.de with esmtps (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1krcED-0000hA-KO; Tue, 22 Dec 2020 08:37:05 +0100 Received: from ukl by ptx.hi.pengutronix.de with local (Exim 4.92) (envelope-from ) id 1krcEC-0001AD-Ln; Tue, 22 Dec 2020 08:37:04 +0100 Date: Tue, 22 Dec 2020 08:37:04 +0100 From: Uwe =?utf-8?Q?Kleine-K=C3=B6nig?= To: meta@public-inbox.org Cc: Konstantin Ryabitsev Subject: About header filtering Message-ID: <20201222073704.u7hacjk5m7mpuc52@pengutronix.de> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha512; protocol="application/pgp-signature"; boundary="hzcdlilmmbwmvrgy" Content-Disposition: inline X-SA-Exim-Connect-IP: 2001:67c:670:100:1d::c0 X-SA-Exim-Mail-From: ukl@pengutronix.de X-SA-Exim-Scanned: No (on metis.ext.pengutronix.de); SAEximRunCond expanded to false X-PTX-Original-Recipient: meta@public-inbox.org List-Id: --hzcdlilmmbwmvrgy Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Hello, I'm trying to setup a public-inbox instance to archive some mailing lists using a regular subscription (so I'm not collecting the mails directly at the mailing list address, but rely on the mailing list software (here: mailman) to forward to the archiver). One thing I want to have is that some headers that are relevant for the path between the mailing list host and the subscribed mail account only are filtered out. That's things like: Received: from $mailinglistserver ([2001:....]) by $publicinboxmachine with esmtp (Exim 4.92) (envelope-from ) id 23487275432 for $publicinboxaccount; Fri, 18 Dec 2020 15:48:54 +0100 Envelope-to: $publicinboxaccount Return-path: Errors-To: listname-bounces+something@mailinglistdomain I found that Konstantin Ryabitsev's tool to prepare an initial archive =66rom an already existing mailing list[1] filters some of these out, but the instance on kernel.org has some of these details, too. (See for example https://lore.kernel.org/lkml/20201013082132.661993-1-u.kleine-koenig@pengut= ronix.de/raw; there are Return-Path: and also some Received: headers that I consider not-so-nice as they were added after the mail was processed by the mailing list tool on vger.kernel.org.) Is it considerd bad to filter these out? Or is it just that nobody wanted this kind of cleanliness before in such a setup? I could handcraft a preprocessor[2] but I assume that a solution in public-inbox itself would find some users?! Best regards Uwe [1] https://git.kernel.org/pub/scm/linux/kernel/git/mricon/korg-helpers.git= /plain/list-archive-maker.py [2] something like formail -I Envelope-to -I Return-path -I Errors-To but filtering Received: is a bit harder if you want to keep the lines describing the path from the sender to the mailing list. --=20 Pengutronix e.K. | Uwe Kleine-K=F6nig | Industrial Linux Solutions | https://www.pengutronix.de/ | --hzcdlilmmbwmvrgy Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQEzBAABCgAdFiEEfnIqFpAYrP8+dKQLwfwUeK3K7AkFAl/hohwACgkQwfwUeK3K 7Akpmwf8Czsdhw4gqcRuPPvQWCFwj+P9JDbmMxqYfpULAsUEpHzWuMGD9d5tTj/T W4iKHWsoN5FtnCTSXrH4KKyenAaIczN1q75ff2RvMdMdex8ylzECmXtxhNDc6Z+L DWXZb+YV/27dAqHYY+jNf6m7VyexXzFvbG2Ttv7uZtDx+gpqoEEaVedEQ7S1QjnS OsWBw7+LJ8yRrf5Fw72vTamwKfpaekwq6viQZOYfIF+0WUSuMuqpNzmlIqdPjhZk KvFCIJbsAQP7LjCvorGea4EmElEuw2qbAWFG3K/IRnDHOC2USf+GMmUASYXDU3oo YbDp3RjoOmdz7Sr+7sYL4zxGKaoQpg== =AkI2 -----END PGP SIGNATURE----- --hzcdlilmmbwmvrgy--