From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from localhost (localhost [127.0.0.1]) by arlo.cworth.org (Postfix) with ESMTP id 11CD16DE1968 for ; Thu, 16 Feb 2017 19:08:04 -0800 (PST) X-Virus-Scanned: Debian amavisd-new at cworth.org X-Spam-Flag: NO X-Spam-Score: -0.005 X-Spam-Level: X-Spam-Status: No, score=-0.005 tagged_above=-999 required=5 tests=[AWL=0.006, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01] autolearn=disabled Received: from arlo.cworth.org ([127.0.0.1]) by localhost (arlo.cworth.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 5zXBzklzkNTL for ; Thu, 16 Feb 2017 19:08:02 -0800 (PST) Received: from fethera.tethera.net (fethera.tethera.net [198.245.60.197]) by arlo.cworth.org (Postfix) with ESMTPS id 0F06D6DE1962 for ; Thu, 16 Feb 2017 19:08:01 -0800 (PST) Received: from remotemail by fethera.tethera.net with local (Exim 4.84_2) (envelope-from ) id 1ceYtH-00034e-NQ for notmuch@notmuchmail.org; Thu, 16 Feb 2017 22:07:23 -0500 Received: (nullmailer pid 32128 invoked by uid 1000); Fri, 17 Feb 2017 03:07:58 -0000 From: David Bremner To: notmuch@notmuchmail.org Subject: v5 of regexp searching Date: Thu, 16 Feb 2017 23:07:48 -0400 Message-Id: <20170217030754.32069-1-david@tethera.net> X-Mailer: git-send-email 2.11.0 X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.22 Precedence: list List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 17 Feb 2017 03:08:04 -0000 The first 3 patches are general cleanup of the way that the query parser is configured. [patch v5 1/6] lib: merge internal prefix tables [patch v5 2/6] lib: Let Xapian manage the memory for FieldProcessors [patch v5 3/6] lib: create field processors from prefix table This is id:20170121135917.22062-1-david@tethera.net, revised with Tomi's comments, and error handling for missing trailing delimiter. For the moment missing initial delimiter is ignored for future compatibility with paths. [patch v5 4/6] lib: regexp matching in 'subject' and 'from' The next two are RFC. We discussed several options for dealing with the fact that / is a valid message-id character. Here the idea is to used a separate prefix mid: for potential regexp searches. Jani had previously suggested trying both the regexp and non-regexp form of the query. That might be a more DWIM ui, but it would have the potentially surprising side effect of having id: match more than one message. The first four patches can be merge earlier, if we want to continue thinking about this UI (or just drop the last two patches). Also, the id case needs doc and tests. [patch v5 5/6] lib: add mid: as a synonym for id: [patch v5 6/6] lib: Add regexp searching for mid: prefix