* Statistics about senders with notmuch possible? @ 2021-12-27 22:57 signmeup1337 2021-12-28 10:47 ` Michael J Gruber 2022-01-07 17:32 ` Carl Worth 0 siblings, 2 replies; 3+ messages in thread From: signmeup1337 @ 2021-12-27 22:57 UTC (permalink / raw) To: notmuch I would like to create some kind of statistics to see from which sender I've got the most emails, having something like a TOP 10 list. In the end of every year I delete all my emails, which are not marked as archive and this year I got over 4.000 and most of them are just notifications (like ebay, if someone buys something). But I really would like to figure out where else the most mails are coming from to avoid them in the next year. Is something like this possible with notmuch? Thanks a lot for you help in advance :) ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Statistics about senders with notmuch possible? 2021-12-27 22:57 Statistics about senders with notmuch possible? signmeup1337 @ 2021-12-28 10:47 ` Michael J Gruber 2022-01-07 17:32 ` Carl Worth 1 sibling, 0 replies; 3+ messages in thread From: Michael J Gruber @ 2021-12-28 10:47 UTC (permalink / raw) To: notmuch signmeup1337@cock.li venit, vidit, dixit 2021-12-27 23:57:18: > I would like to create some kind of statistics to see from which sender > I've got the most emails, having something like a TOP 10 list. > > In the end of every year I delete all my emails, which are not marked as > archive and this year I got over 4.000 and most of them are just > notifications (like ebay, if someone buys something). But I really would > like to figure out where else the most mails are coming from to avoid > them in the next year. > > Is something like this possible with notmuch? `notmuch address` should provide everything you want (bar the sorting): `notmuch address --output=count --deduplicate=address date:2021 | sort -nr | head -n 10` Leave out the dedup option if you want to distinguish senders by the full mailbox address. And, of course, amend the query any way you want. That's the beauty of notmuch's modular approach. Cheers Michael ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Statistics about senders with notmuch possible? 2021-12-27 22:57 Statistics about senders with notmuch possible? signmeup1337 2021-12-28 10:47 ` Michael J Gruber @ 2022-01-07 17:32 ` Carl Worth 1 sibling, 0 replies; 3+ messages in thread From: Carl Worth @ 2022-01-07 17:32 UTC (permalink / raw) To: signmeup1337, notmuch [-- Attachment #1.1: Type: text/plain, Size: 2159 bytes --] On Mon, Dec 27 2021, signmeup wrote: > I would like to create some kind of statistics to see from which sender > I've got the most emails, having something like a TOP 10 list. > [ 4 more citation lines. Click/Enter to show. ] > In the end of every year I delete all my emails, which are not marked as > archive and this year I got over 4.000 and most of them are just > notifications (like ebay, if someone buys something). But I really would > like to figure out where else the most mails are coming from to avoid > them in the next year. > > Is something like this possible with notmuch? Yes, it's definitely possible. Being able to script up operations like this that act on your email store has always been a primary design goal of notmuch, so this isn't actually hard to do. Given any sender address you can ask notmuch to count the emails sent from that address. Such as: notmuch count from:cworth@cworth.org And you can also use notmuch to generate a list of all sender email addresses across all of your email, with: notmuch address --output=sender --output=address '*' So you can take the output from that second command and then run the first command with email address in the output. Doing that all combined, and the sorting the result might look something like the following (assuming you are using bash as your shell): for sender in $(notmuch address --output=sender --output=address '*'); do count=$(notmuch count from:$sender); echo "$count $sender"; done | sort -n If you wanted to run this on a subset of your email, just change the '*' to a search specification for the messages you want to match, (where '*' will match all messages). And of course, this is running a separate notmuch search for every sender email address you have in all of your emails, (not only the top 10 like you really want), so this could take some time if you have a lot of messages. I just tested with a small subset of my own email messages (the 21k+ messages I have matching 'subject:notmuch') and the above script took about 30 seconds on my laptop. I hope that helps, -Carl [-- Attachment #1.2: signature.asc --] [-- Type: application/pgp-signature, Size: 832 bytes --] [-- Attachment #2: Type: text/plain, Size: 0 bytes --] ^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2022-01-07 17:41 UTC | newest] Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2021-12-27 22:57 Statistics about senders with notmuch possible? signmeup1337 2021-12-28 10:47 ` Michael J Gruber 2022-01-07 17:32 ` Carl Worth
Code repositories for project(s) associated with this public inbox https://yhetil.org/notmuch.git/ This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).