From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS11403 173.228.157.0/24 X-Spam-Status: No, score=-3.8 required=3.0 tests=AWL,BAYES_00,DKIM_SIGNED, DKIM_VALID,RCVD_IN_DNSWL_LOW,SPF_HELO_NONE,SPF_PASS shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from pb-smtp20.pobox.com (pb-smtp20.pobox.com [173.228.157.52]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id 24E4E1F45A for ; Mon, 20 Apr 2020 00:13:42 +0000 (UTC) Received: from pb-smtp20.pobox.com (unknown [127.0.0.1]) by pb-smtp20.pobox.com (Postfix) with ESMTP id BDA61CAEBB; Sun, 19 Apr 2020 20:13:40 -0400 (EDT) (envelope-from kyle@kyleam.com) DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=pobox.com; h=from:to:cc :subject:date:message-id:in-reply-to:references:mime-version :content-transfer-encoding; s=sasl; bh=DCS1D4c4OVgPEDX/8znkbNAbN tM=; b=i2ejRifFCtSv57FVTPNznKSld/6MT7WsdS8nOnoq+LFL5CSkNjJRp4QQQ jYAUXbfV1rfZBifSju9R3E5egysrCSt4jiwlX2lVmhHVBU8Zptm1mpNVO372uE2B Cz7zH8RrSewY7Jkxon3hlkvwV1ir3V0rAXcpKaAs14PohjLhas= Received: from pb-smtp20.sea.icgroup.com (unknown [127.0.0.1]) by pb-smtp20.pobox.com (Postfix) with ESMTP id A25A8CAEBA; Sun, 19 Apr 2020 20:13:40 -0400 (EDT) (envelope-from kyle@kyleam.com) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed; d=kyleam.com; h=from:to:cc:subject:date:message-id:in-reply-to:references:mime-version:content-transfer-encoding; s=mesmtp; bh=BLfjSzWszNyipHRTMWsyufeirPEyncy5lOmgxIK6wnY=; b=eXXd3vpe6MITQYSi8c9R+yitB5ce/OULkYUAhrgI/yob+gJbf6DV0As1wKzO6DO0qy1se6yBStMtdw66mViIg2Au8O2aEyDuufrD1SbJjiiswozyRKhYwcFoneyzuXNmodmGRLxnJt9e5uOX9NsqgjEwDOyRv5TPU53CHQFoAR8= Received: from localhost (unknown [45.33.91.115]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by pb-smtp20.pobox.com (Postfix) with ESMTPSA id 147B9CAEB9; Sun, 19 Apr 2020 20:13:38 -0400 (EDT) (envelope-from kyle@kyleam.com) From: Kyle Meyer To: Eric Wong Cc: Kyle Meyer , meta@public-inbox.org Subject: [PATCH v2] watchmaildir: support multiple watchheader values Date: Sun, 19 Apr 2020 20:13:32 -0400 Message-Id: <20200420001332.14769-1-kyle@kyleam.com> X-Mailer: git-send-email 2.26.0 In-Reply-To: <20200412215933.GA63705@dcvr> References: <20200412215933.GA63705@dcvr> MIME-Version: 1.0 X-Pobox-Relay-ID: C8584E7C-829B-11EA-B7C9-B0405B776F7B-24757444!pb-smtp20.pobox.com Content-Transfer-Encoding: quoted-printable List-Id: The watchheader key supports only a single value. Supporting multiple watchheader values was mentioned in discussion [1] of 8d3e3bd8 (doc: explain publicinbox..watchheader, 2019-10-09), and it wasn't clear if there was a need. One scenario in which matching multiple headers would be convenient is when someone wants to set up public-inbox archives for some small projects but does _not_ want to run mailing lists for them, instead allowing others to follow the project by any of the pull mechanisms. Using a common underlying address, an address alias for each project is configured via a third-party email provider, with messages for each alias being exposed as a separate public-inbox archive. In this setup, messages for an inbox cannot be selected by a List-ID header but can be identified by the inbox's address in either the To or Cc header. To support such a use case, update the watchheader handling to consider multiple values, accepting a message if it matches any value. While selecting a message based on matching _any_ rather than _all_ values is motivated by the above scenario, it's worth noting that the "any" behavior is consistent with how multiple listid config values are handled. [1] https://public-inbox.org/meta/20191010085118.r3amey4cayazfycb@dcvr/ --- Changes since v1: * Add a test for matching multiple watchheader values. * Adjust WwwText::inbox_config to handle watchheader as an array. * Condense commit message. Documentation/public-inbox-config.pod | 3 +- MANIFEST | 1 + lib/PublicInbox/Config.pm | 4 +- lib/PublicInbox/WatchMaildir.pm | 8 +-- lib/PublicInbox/WwwText.pm | 4 +- t/watch_muliple_headers.t | 76 +++++++++++++++++++++++++++ 6 files changed, 88 insertions(+), 8 deletions(-) create mode 100644 t/watch_muliple_headers.t diff --git a/Documentation/public-inbox-config.pod b/Documentation/public= -inbox-config.pod index 4c9994dc..708a785f 100644 --- a/Documentation/public-inbox-config.pod +++ b/Documentation/public-inbox-config.pod @@ -88,7 +88,8 @@ Default: none; only for L users watchheader =3D List-Id: =20 If specified, L will only process mail matching -the given header. Multiple values are not currently supported. +the given header. If specified multiple times, mail will be processed +if it matches any of the values. =20 Default: none; only for L users =20 diff --git a/MANIFEST b/MANIFEST index 8b724352..bfcf26b4 100644 --- a/MANIFEST +++ b/MANIFEST @@ -307,6 +307,7 @@ t/view.t t/watch_filter_rubylang.t t/watch_maildir.t t/watch_maildir_v2.t +t/watch_muliple_headers.t t/www_altid.t t/www_listing.t t/www_static.t diff --git a/lib/PublicInbox/Config.pm b/lib/PublicInbox/Config.pm index 917939ca..458f29b2 100644 --- a/lib/PublicInbox/Config.pm +++ b/lib/PublicInbox/Config.pm @@ -367,7 +367,7 @@ sub _fill { my $ibx =3D {}; =20 foreach my $k (qw(inboxdir filter newsgroup - watch watchheader httpbackendmax + watch httpbackendmax replyto feedmax nntpserver indexlevel)) { my $v =3D $self->{"$pfx.$k"}; $ibx->{$k} =3D $v if defined $v; @@ -388,7 +388,7 @@ sub _fill { # TODO: more arrays, we should support multi-value for # more things to encourage decentralization foreach my $k (qw(address altid nntpmirror coderepo hide listid url - infourl)) { + infourl watchheader)) { if (defined(my $v =3D $self->{"$pfx.$k"})) { $ibx->{$k} =3D _array($v); } diff --git a/lib/PublicInbox/WatchMaildir.pm b/lib/PublicInbox/WatchMaild= ir.pm index bea2ed2a..7b9e8915 100644 --- a/lib/PublicInbox/WatchMaildir.pm +++ b/lib/PublicInbox/WatchMaildir.pm @@ -59,9 +59,11 @@ sub new { my $watch =3D $ibx->{watch} or return; if (is_maildir($watch)) { my $watch_hdrs =3D []; - if (my $wh =3D $ibx->{watchheader}) { - my ($k, $v) =3D split(/:/, $wh, 2); - push @$watch_hdrs, [ $k, qr/\Q$v\E/ ]; + if (my $whs =3D $ibx->{watchheader}) { + for (@$whs) { + my ($k, $v) =3D split(/:/, $_, 2); + push @$watch_hdrs, [ $k, qr/\Q$v\E/ ]; + } } if (my $list_ids =3D $ibx->{listid}) { for (@$list_ids) { diff --git a/lib/PublicInbox/WwwText.pm b/lib/PublicInbox/WwwText.pm index 2008ba09..b23a415e 100644 --- a/lib/PublicInbox/WwwText.pm +++ b/lib/PublicInbox/WwwText.pm @@ -151,7 +151,7 @@ sub inbox_config ($$$) { url =3D https://example.com/$name/ url =3D http://example.onion/$name/ EOS - for my $k (qw(address listid infourl)) { + for my $k (qw(address listid infourl watchheader)) { defined(my $v =3D $ibx->{$k}) or next; $$txt .=3D "\t$k =3D $_\n" for @$v; } @@ -171,7 +171,7 @@ EOF } } =20 - for my $k (qw(filter newsgroup obfuscate replyto watchheader)) { + for my $k (qw(filter newsgroup obfuscate replyto)) { defined(my $v =3D $ibx->{$k}) or next; $$txt .=3D "\t$k =3D $v\n"; } diff --git a/t/watch_muliple_headers.t b/t/watch_muliple_headers.t new file mode 100644 index 00000000..8ea4b306 --- /dev/null +++ b/t/watch_muliple_headers.t @@ -0,0 +1,76 @@ +# Copyright (C) 2020 all contributors +# License: AGPL-3.0+ +use strict; +use Test::More; +use PublicInbox::Config; +use PublicInbox::TestCommon; +require_git(2.6); +require_mods(qw(Search::Xapian DBD::SQLite Filesys::Notify::Simple)); +my ($tmpdir, $for_destroy) =3D tmpdir(); +my $inboxdir =3D "$tmpdir/v2"; +my $maildir =3D "$tmpdir/md"; +use_ok 'PublicInbox::WatchMaildir'; +use_ok 'PublicInbox::Emergency'; +my $cfgpfx =3D "publicinbox.test"; +my $addr =3D 'test-public@example.com'; +my @cmd =3D ('-init', '-V2', 'test', $inboxdir, + 'http://example.com/list', $addr); +local $ENV{PI_CONFIG} =3D "$tmpdir/pi_config"; +ok(run_script(\@cmd), 'public-inbox init OK'); + +my $msg_to =3D < +Date: Sat, 18 Apr 2020 00:00:00 +0000 + +content1 +EOF + +my $msg_cc =3D < +Date: Sat, 18 Apr 2020 00:01:00 +0000 + +content2 +EOF + +my $msg_none =3D < +Date: Sat, 18 Apr 2020 00:02:00 +0000 + +content3 +EOF + +PublicInbox::Emergency->new($maildir)->prepare(\$msg_to); +PublicInbox::Emergency->new($maildir)->prepare(\$msg_cc); +PublicInbox::Emergency->new($maildir)->prepare(\$msg_none); + +my $cfg =3D <new(\$cfg); +PublicInbox::WatchMaildir->new($config)->scan('full'); +my $ibx =3D $config->lookup_name('test'); +ok($ibx, 'found inbox by name'); + +my $num =3D $ibx->mm->num_for('to@a.com'); +ok(defined $num, 'Matched for address in To:'); +my $num =3D $ibx->mm->num_for('cc@a.com'); +ok(defined $num, 'Matched for address in Cc:'); +$num =3D $ibx->mm->num_for('none@a.com'); +is($num, undef, 'No match without address in To: or Cc:'); + +done_testing; base-commit: b3f81ce0c71d5d4eca347f259b5ae69660a2cb13 --=20 2.26.0