unofficial mirror of meta@public-inbox.org
 help / color / mirror / Atom feed
From: Eric Wong <e@80x24.org>
To: "Robin H. Johnson" <robbat2@gentoo.org>
Cc: meta@public-inbox.org
Subject: Re: more debugging for gentoo usage & supporting feature requests
Date: Mon, 22 Jul 2024 19:40:55 +0000	[thread overview]
Message-ID: <20240722194055.M603030@dcvr> (raw)
In-Reply-To: <robbat2-20240722T011301-042174550Z@orbis-terrarum.net>

"Robin H. Johnson" <robbat2@gentoo.org> wrote:
> Hi,
> 
> I moved the Gentoo instance to a much beefier machine & newer kernel,
> ingest is a lot faster; but there's still some hiccups.
> 
> 1. Request for more debugging details about mails: Seems that many of
> our oldest mails don't get ingested - and there's no output about why.
> I don't know if -watch actually scanned that folder or not.

I started working on it, but got sidetracked with some bugs in the
FakeInotify implementation on low-time-resolution FS :x

> 1.1. Possibly related:
> Intended config is that the mail should be ingested regardless of the
> email address on the headers. Way back in time, the Gentoo lists were
> renamed a few times, and the files are sorted into the correct folders.
> I think this impacted any attempted ingest via -mda because there's no
> other way to override what list a given mail on stdin should be
> associated with.
> 
> The headers may be inconsistent, changed style, name, or even be absent
> in a few cases.

Fwiw, there can be multiple publicinbox.*.address directives for
a given inbox.  You can also use publicinbox.*.watchheader to
match arbitrary headers (e.g. List-Id, X-BeenThere, etc...)
I think "public-inbox-ctl import" will be needed to handle
odd messages without any matching headers

> 2.
> What's the intended way for public-inbox-mda to function with no
> SpamAssassin installed at all? "spamcheck = " doesn't seem to do it.

spamcheck=none

You can also use --no-precheck to disabl some builtin rules.

> 3.
> As a formal feature request:
> Change the arguments of: public-inbox-watch 
> - Add --all to mean all lists in the config
> - no arguments => implicit --all
> - $LISTNAME/$INBOXPATH => one *OR* more inboxes manually specified.
> 
> I did a hacky split of the configuration for Gentoo, and things are a
> LOT more stable with 120 instances; but it's a little wasteful: I'd like
> to give the high-traffic lists their own instance, and group the
> low-traffic instances together.

Fwiw, the IMAP code for watch is already 1:1
process:IMAP-mailbox because of the Mail::IMAPClient API.

How about making that an option for Maildirs, too and at least get
some benefit from copy-on-write memory savings...

> 4.
> Make public-inbox-init NOT attempt to write to any configuration files.
> 
> Trying to implement segregation of roles:
> - config files owned by root only; readable by public-inbox users.

OK, it'd probably have to write a $INBOX_DIR/config.snippet.sample
file with comments, then..

> - source maildirs read-only to user running public-inbox-watch
> - public-inbox dirs writable to user running public-inbox-watch
> - public-inbox dirs readable to user running public-inbox-httpd

The last 3 has been what I've been doing since the beginning.

      parent reply	other threads:[~2024-07-22 19:41 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-07-22  1:27 more debugging for gentoo usage & supporting feature requests Robin H. Johnson
2024-07-22  6:23 ` more debugging for gentoo usage & supporting feature requests - infinite loop on disk full Robin H. Johnson
2024-07-22 19:44   ` Eric Wong
2024-07-23 21:30   ` Eric Wong
2024-07-22 19:40 ` Eric Wong [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://public-inbox.org/README

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20240722194055.M603030@dcvr \
    --to=e@80x24.org \
    --cc=meta@public-inbox.org \
    --cc=robbat2@gentoo.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).