From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from localhost (localhost [127.0.0.1]) by arlo.cworth.org (Postfix) with ESMTP id 7EE546DE0352 for ; Sun, 18 Aug 2019 20:29:47 -0700 (PDT) X-Virus-Scanned: Debian amavisd-new at cworth.org X-Spam-Flag: NO X-Spam-Score: -1.314 X-Spam-Level: X-Spam-Status: No, score=-1.314 tagged_above=-999 required=5 tests=[AWL=1.187, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001] autolearn=disabled Received: from arlo.cworth.org ([127.0.0.1]) by localhost (arlo.cworth.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id wM-mNh07AXGZ for ; Sun, 18 Aug 2019 20:29:45 -0700 (PDT) Received: from che.mayfirst.org (che.mayfirst.org [162.247.75.118]) by arlo.cworth.org (Postfix) with ESMTPS id 54DB16DE0279 for ; Sun, 18 Aug 2019 20:29:44 -0700 (PDT) DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/simple; d=fifthhorseman.net; i=@fifthhorseman.net; q=dns/txt; s=2019; t=1566185383; h=from : to : subject : in-reply-to : references : date : message-id : mime-version : content-type : from; bh=3SJqcfiEiwdZcuLKpUuedy82RoQHJwJHKePmF1aoGqk=; b=wx4DNdHSKHKz6KnxW5rq5uu1JiwQWQnSGx/5CfeiPlu+CLBvnMeQmr/p 5hL274Uu2Hiob7ovLZJrQOSIfrlfDA== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=fifthhorseman.net; i=@fifthhorseman.net; q=dns/txt; s=2019rsa; t=1566185382; h=from : to : subject : in-reply-to : references : date : message-id : mime-version : content-type : from; bh=3SJqcfiEiwdZcuLKpUuedy82RoQHJwJHKePmF1aoGqk=; b=1G8rQpZg/8y3TrzW6z4QmDoB672IayJF1Qc9KjHwbnVLWBhrvxk+oYIV 8myOvs8y+J+H5yDzEXPve8GTfuqLQKktTwZ44G4R1Hdnb9wsAk0RMJGuHt yz0i6/KiKQcX2A1XIBJi7VeaTPZe3lsWYlvZWehHvg5FF55ulVgeR7HuFn 9v1RJqfOQsvJtXxa8LN+JGycwFgFD2siwFrft0waEGeB1Nb2PEpFoBKRDF frB+vyU05eWghvRcPA7cAzYlHKdG/TVzDotCS8BiIG6ifYQUSX1nHd1/2/ Oqz32/3SVcz2Da07iZ7J6RTlNhRFco8PI00gUsmMTLFDmstpKPi++g== Received: from fifthhorseman.net (ool-6c3a0662.static.optonline.net [108.58.6.98]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by che.mayfirst.org (Postfix) with ESMTPSA id CC189F99D; Sun, 18 Aug 2019 23:29:41 -0400 (EDT) Received: by fifthhorseman.net (Postfix, from userid 1000) id 2FEA72058E; Sun, 18 Aug 2019 23:20:09 +0300 (EEST) From: Daniel Kahn Gillmor To: "Jorge P. de Morais Neto" , Notmuch Mail Subject: Re: viewing duplicate messages In-Reply-To: <87k1bb8mr9.fsf@disroot.org> References: <877ea7zyss.fsf@caltech.edu> <87pnnztwxh.fsf@disroot.org> <87h89bilzy.fsf@fifthhorseman.net> <87k1bb8mr9.fsf@disroot.org> Autocrypt: addr=dkg@fifthhorseman.net; prefer-encrypt=mutual; keydata= mDMEXEK/AhYJKwYBBAHaRw8BAQdAr/gSROcn+6m8ijTN0DV9AahoHGafy52RRkhCZVwxhEe0K0Rh bmllbCBLYWhuIEdpbGxtb3IgPGRrZ0BmaWZ0aGhvcnNlbWFuLm5ldD6ImQQTFggAQQIbAQUJA8Jn AAULCQgHAgYVCgkICwIEFgIDAQIeAQIXgBYhBMS8Lds4zOlkhevpwvIGkReQOOXGBQJcQsbzAhkB AAoJEPIGkReQOOXG4fkBAO1joRxqAZY57PjdzGieXLpluk9RkWa3ufkt3YUVEpH/AP9c+pgIxtyW +FwMQRjlqljuj8amdN4zuEqaCy4hhz/1DbgzBFxCv4sWCSsGAQQB2kcPAQEHQERSZxSPmgtdw6nN u7uxY7bzb9TnPrGAOp9kClBLRwGfiPUEGBYIACYWIQTEvC3bOMzpZIXr6cLyBpEXkDjlxgUCXEK/ iwIbAgUJAeEzgACBCRDyBpEXkDjlxnYgBBkWCAAdFiEEyQ5tNiAKG5IqFQnndhgZZSmuX/gFAlxC v4sACgkQdhgZZSmuX/iVWgD/fCU4ONzgy8w8UCHGmrmIZfDvdhg512NIBfx+Mz9ls5kA/Rq97vz4 z48MFuBdCuu0W/fVqVjnY7LN5n+CQJwGC0MIA7QA/RyY7Sz2gFIOcrns0RpoHr+3WI+won3xCD8+ sVXSHZvCAP98HCjDnw/b0lGuCR7coTXKLIM44/LFWgXAdZjm1wjODbg4BFxCv50SCisGAQQBl1UB BQEBB0BG4iXnHX/fs35NWKMWQTQoRI7oiAUt0wJHFFJbomxXbAMBCAeIfgQYFggAJhYhBMS8Lds4 zOlkhevpwvIGkReQOOXGBQJcQr+dAhsMBQkB4TOAAAoJEPIGkReQOOXGe/cBAPlek5d9xzcXUn/D kY6jKmxe26CTws3ZkbK6Aa5Ey/qKAP0VuPQSCRxA7RKfcB/XrEphfUFkraL06Xn/xGwJ+D0hCw== Date: Sun, 18 Aug 2019 16:20:08 -0400 Message-ID: <87mug6kyyv.fsf@fifthhorseman.net> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha512; protocol="application/pgp-signature" X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 19 Aug 2019 03:29:47 -0000 --=-=-= Content-Type: text/plain On Sat 2019-08-17 19:12:26 -0300, Jorge P. de Morais Neto wrote: > I have attached a tarball with three homonymous messages from Dell. The > last (most recent) two have the same subject and bodies, but the first > (earliest) one is different and yet they all have Message-Id 1. I have > included the Notmuch list as a recipient because the tarball is a mere > 11252B. thanks for this. Looking at the headers, it occurs to me that the problem might actually be that Dell ("idd_messaging_email@dell.com") might not including a message-id header at all, and it is being added their IronPort/Sophos AV client as it passes through their mail system. I suspect this possibility because the placement of the Message-ID header itself is supiciiously high up in the list of headers (it looks like it might have been placd there by the initial relaying MTA, rather than the MUA). If this is the case, it could be solved in one of two ways: they could inject a proper unique Message-ID before handing the message off to IronPort; or they could fix their IronPort appliance to inject a proper unique Message-ID header. That's all about fixing it on the sender side though. Are there possible fixes on the receiving side? one thought is that notmuch could treat an obviously low-entropy message-ID the same way that it treats a message with no Message-ID at all. Of course, that raises the question: what is a low-entropy message ID? A single-character message-id is pretty clearly too low-entropy to be useful, but if we said "1-character long" was too short, it would at least avoid this particular mistake. i also note that NEWS claims (in the section for notmuch 0.17) that notmuch treats "overlong" message-ids in the same way as missing message-ids, but i don't see where that distinction is done in the code. It doesn't appear to be in lib/message-file.c, where the notmuch-sha1-* generation is done. But anyway, if we are treating "overlong" message-ids as missing, it's nicely symmetric to treat "overshort" message-ids in the same way. --dkg --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iHUEARYKAB0WIQTJDm02IAobkioVCed2GBllKa5f+AUCXVmy+AAKCRB2GBllKa5f +G+2AP0UC5OhCNZiPOHknC9fd/IeI1+5ORZvodncwP83mMJzMgD/bgAYClxvsDde +52c4u7DtTX08y4u3K+pw56lAG84Ig4= =33dA -----END PGP SIGNATURE----- --=-=-=--