From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from localhost (localhost [127.0.0.1]) by arlo.cworth.org (Postfix) with ESMTP id A33B56DE0B2B for ; Sun, 2 Apr 2017 07:52:49 -0700 (PDT) X-Virus-Scanned: Debian amavisd-new at cworth.org X-Spam-Flag: NO X-Spam-Score: -0.005 X-Spam-Level: X-Spam-Status: No, score=-0.005 tagged_above=-999 required=5 tests=[AWL=0.006, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01] autolearn=disabled Received: from arlo.cworth.org ([127.0.0.1]) by localhost (arlo.cworth.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id EY9RKEJ32j9U for ; Sun, 2 Apr 2017 07:52:48 -0700 (PDT) Received: from fethera.tethera.net (fethera.tethera.net [198.245.60.197]) by arlo.cworth.org (Postfix) with ESMTPS id 72FFC6DE0B25 for ; Sun, 2 Apr 2017 07:52:48 -0700 (PDT) Received: from remotemail by fethera.tethera.net with local (Exim 4.84_2) (envelope-from ) id 1cugrK-0003L7-GK; Sun, 02 Apr 2017 10:52:02 -0400 Received: (nullmailer pid 12427 invoked by uid 1000); Sun, 02 Apr 2017 14:52:45 -0000 From: David Bremner To: Daniel Kahn Gillmor , Notmuch Mail Subject: Re: [PATCH] WIP: remove all non-prefixed-terms (and stemmed versions) In-Reply-To: <1471178598-9639-1-git-send-email-david@tethera.net> References: <1467970047-8013-16-git-send-email-dkg@fifthhorseman.net> <1471178598-9639-1-git-send-email-david@tethera.net> Date: Sun, 02 Apr 2017 11:52:45 -0300 Message-ID: <871stahneq.fsf@tethera.net> MIME-Version: 1.0 Content-Type: text/plain X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.22 Precedence: list List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 02 Apr 2017 14:52:49 -0000 David Bremner writes: > The testing here is not really suitable for production, since we export > a function just for testing. It would be possible to modify the test > framework to test functions in notmuch-private.h, but this was the quick > and dirty solution. On looking at the problem a second time I think this should really drop all of the non-(tag|property) terms, so including some prefixed terms as well. I think that would be doable; it's probably worth having some performance benchmark before introducing the extra complication versus dkg's approach. d