From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from localhost (localhost [127.0.0.1]) by arlo.cworth.org (Postfix) with ESMTP id 32D4F6DE026D for ; Wed, 18 Apr 2018 03:18:13 -0700 (PDT) X-Virus-Scanned: Debian amavisd-new at cworth.org X-Spam-Flag: NO X-Spam-Score: -0.7 X-Spam-Level: X-Spam-Status: No, score=-0.7 tagged_above=-999 required=5 tests=[AWL=0.000, RCVD_IN_DNSWL_LOW=-0.7] autolearn=disabled Received: from arlo.cworth.org ([127.0.0.1]) by localhost (arlo.cworth.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id HDOrl2QtokR8 for ; Wed, 18 Apr 2018 03:18:11 -0700 (PDT) Received: from mx0a-001b2d01.pphosted.com (mx0a-001b2d01.pphosted.com [148.163.156.1]) by arlo.cworth.org (Postfix) with ESMTPS id C3AFC6DE026C for ; Wed, 18 Apr 2018 03:18:11 -0700 (PDT) Received: from pps.filterd (m0098409.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.22/8.16.0.22) with SMTP id w3IAEif3088135 for ; Wed, 18 Apr 2018 06:18:09 -0400 Received: from e06smtp12.uk.ibm.com (e06smtp12.uk.ibm.com [195.75.94.108]) by mx0a-001b2d01.pphosted.com with ESMTP id 2he2mc4n2s-1 (version=TLSv1.2 cipher=AES256-SHA256 bits=256 verify=NOT) for ; Wed, 18 Apr 2018 06:18:08 -0400 Received: from localhost by e06smtp12.uk.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Wed, 18 Apr 2018 11:18:06 +0100 Received: from b06cxnps3074.portsmouth.uk.ibm.com (9.149.109.194) by e06smtp12.uk.ibm.com (192.168.101.142) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; Wed, 18 Apr 2018 11:18:03 +0100 Received: from d06av26.portsmouth.uk.ibm.com (d06av26.portsmouth.uk.ibm.com [9.149.105.62]) by b06cxnps3074.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id w3IAI3ii11796942; Wed, 18 Apr 2018 10:18:03 GMT Received: from d06av26.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 7FD45AE057; Wed, 18 Apr 2018 11:07:52 +0100 (BST) Received: from d06av26.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 587F4AE045; Wed, 18 Apr 2018 11:07:52 +0100 (BST) Received: from localhost (unknown [9.124.35.28]) by d06av26.portsmouth.uk.ibm.com (Postfix) with ESMTP; Wed, 18 Apr 2018 11:07:52 +0100 (BST) Date: Wed, 18 Apr 2018 15:48:00 +0530 From: "Naveen N. Rao" Subject: Re: 'notmuch search thread:<>' lists multiple threads To: David Bremner , notmuch@notmuchmail.org References: <1523007700.l8xm6nm6af.naveen@linux.ibm.com> <87sh86v1oc.fsf@tethera.net> <878t9wvbmu.fsf@tethera.net> In-Reply-To: <878t9wvbmu.fsf@tethera.net> User-Agent: astroid/0.11.1 (https://github.com/astroidmail/astroid) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: quoted-printable X-TM-AS-GCONF: 00 x-cbid: 18041810-0008-0000-0000-000004EC9F0C X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 18041810-0009-0000-0000-00001E80A8A5 Message-Id: <1524045467.a0aq8zermb.naveen@linux.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:, , definitions=2018-04-18_02:, , signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1011 lowpriorityscore=0 impostorscore=0 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1709140000 definitions=main-1804180095 X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.26 Precedence: list List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 18 Apr 2018 10:18:13 -0000 David Bremner wrote: > David Bremner writes: >=20 >> At least some of this mail data is public, but I'm not sure if the bad >> threading is reproducible or not; I want to run a complete census >> overnight before I reindex. >> >> Even if the bug is non-deterministic, it probably lives in lib/add-messa= ge.cc >=20 > I have a reproducible test for this bug now >=20 > http://pivot.cs.unb.ca/git?p=3Dnotmuch.git;a=3Dshortlog;h=3Drefs/heads/= fix/thread-search Thanks for looking into this. >=20 > I still need to analyze the mails a bit more, but it looks like at least > one of the strange results is caused by multiple mail files sharing the > same message-id, but with different References headers (and no > In-Reply-To headers). In my case, I seem to be having the In-Reply-To headers. I end up with=20 two files per message: one from my inbox and one from the gmane archive=20 that I pull in. All the messages from the gmane archive seem to have a=20 re-written 'In-Reply-To' header, but 'Message-Id' and 'References' are=20 the same. In the problematic email thread, all other files/messages get allotted a=20 single thread except for one of the messages. The offending message has=20 3 references compared to 1 or 2 references for the rest, but I don't=20 know if that's relevant here. - Naveen =