unofficial mirror of notmuch@notmuchmail.org
 help / color / mirror / code / Atom feed
From: Austin Clements <amdragon@MIT.EDU>
To: Jani Nikula <jani@nikula.org>
Cc: notmuch@notmuchmail.org
Subject: Re: [PATCH 0/5] lib: make folder: prefix literal
Date: Thu, 30 Jan 2014 17:02:34 -0500	[thread overview]
Message-ID: <20140130220234.GI4375@mit.edu> (raw)
In-Reply-To: <87iot8f4vg.fsf@nikula.org>

Quoth Jani Nikula on Jan 25 at  5:38 pm:
> On Sat, 25 Jan 2014, Jani Nikula <jani@nikula.org> wrote:
> > Perhaps we need to have two prefixes, one of which is the literal
> > filesystem folder and another which hides the implementation details,
> > like I mentioned in my mail to Peter [1]. But consider this: my proposed
> > implementation does cover *all* use cases.
> 
> Here's a thought. With boolean prefix folder:, we can devise a scheme
> where the folder: query defines what is to be matched.
> 
> For example:
> 
> folder:foo	match files in foo, foo/new, and foo/cur.
> folder:foo/	match all files in all subdirectories under foo (this
> 		would handle Tomi's use case), including foo/new and foo/cur.
> folder:foo/.	match in foo only, and specifically not in foo/cur or foo/new.
> folder:foo/new  match in foo/new, and specifically not in foo/cur (this
> 		allows distinguishing between messages in cur and new).
> folder:/	match everything.
> folder:/.	match in top level maildir only.
> folder:""	match in top level maildir, including cur/new.
> 
> This requires indexing all the path components with suitable
> suffixes. For example, a file "foo/new/baz" would get terms "/", "foo",
> "foo/", "foo/new", and "foo/new/.". A file foo/bar would get terms "/",
> "foo", "foo/", and "foo/.".
> 
> It's obviously a concern this increases the database size; not sure how
> it would compare with the current stemmed probabilistic prefix.
> 
> Opinions on this? This would really cover all use cases, and address
> Austin's interface and backward compatibility concerns.

I like this idea in general, though I agree with others that the
specific syntax seems a little wanting.  The concept of adding several
boolean terms seems powerful, and I would be surprised if the extra
terms had any substantive effect on database size.

However, it seems like this is overloading one prefix for two
meanings.  And I think that's because people want two similar but
distinct things.  Several of us want a simple, natural Maildir-aware
folder search (the Maildir folder of "a/b/cur/x:2," is "a/b").  Others
want file system search.  It's easy to conflate these because Maildir
represents folders as directory paths, but maybe they need to be
treated as distinct things.

What if we introduce two prefixes, say folder: and path: (maybe dir:?)
to address both use cases, each as naturally as possible?  Both would
be boolean prefixes because of the limitations of probabilistic
prefixes, but we could take advantage of Jani's idea of generating
several boolean terms.

folder: could work the way I suggested (simply the path to the file,
with {cur,new} stripped off).  path: would support file system search
uses.  These seem more varied, but I think fall into exact match and
recursive match.  Since I don't have this use case, I can't have any
strong opinions about syntax, but I'll throw out an idea: many shells
support "**" for recursive path matching and people are already quite
familiar with glob patterns for paths, so why not simply adopt this?
In other words, when adding the path "a/b/cur/x:2," add path: terms
"a/b/cur" and "a/b/**" and "a/**" and "**".

> BR,
> Jani.

  parent reply	other threads:[~2014-01-30 22:02 UTC|newest]

Thread overview: 31+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-01-09 22:18 [PATCH 0/5] lib: make folder: prefix literal Jani Nikula
2014-01-09 22:18 ` [PATCH 1/5] " Jani Nikula
2014-01-24 21:18   ` Austin Clements
2014-01-09 22:18 ` [PATCH 2/5] test: fix insert folder: searches Jani Nikula
2014-01-24 21:20   ` Austin Clements
2014-01-25 19:32     ` Rob Browning
2014-01-09 22:18 ` [PATCH 3/5] test: fix test for literal folder: search Jani Nikula
2014-01-09 22:18 ` [PATCH 4/5] test: add test database in format version 1 Jani Nikula
2014-01-09 22:18 ` [PATCH 5/5] test: add database upgrade test from " Jani Nikula
2014-01-24 21:17 ` [PATCH 0/5] lib: make folder: prefix literal Austin Clements
2014-01-24 23:21   ` David Bremner
2014-01-25  9:33   ` Jani Nikula
2014-01-25 10:47     ` Tomi Ollila
2014-01-25 11:06       ` Jani Nikula
2014-01-25 11:52         ` Tomi Ollila
2014-01-25 15:38     ` Jani Nikula
2014-01-25 16:58       ` David Bremner
2014-01-25 18:22         ` Jani Nikula
     [not found]       ` <874n4rvcvo.fsf@yoom.home.cworth.org>
2014-01-29 19:05         ` Jani Nikula
     [not found]           ` <87k3dir3ci.fsf@yoom.home.cworth.org>
2014-01-29 20:46             ` Austin Clements
     [not found]               ` <87bnyuqw60.fsf@yoom.home.cworth.org>
2014-01-30  6:34                 ` Jani Nikula
2014-01-30 21:15                   ` Mark Walters
2014-01-30 22:02       ` Austin Clements [this message]
2014-01-31 19:19         ` Rob Browning
2014-02-04 20:14           ` Austin Clements
2014-02-04 20:17             ` Rob Browning
2014-01-31 19:24         ` Rob Browning
2014-02-01 14:54         ` Jani Nikula
2014-02-04 20:02           ` Austin Clements
2014-02-05 13:12             ` Tomi Ollila
2014-02-05 21:12               ` Tomi Ollila

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://notmuchmail.org/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140130220234.GI4375@mit.edu \
    --to=amdragon@mit.edu \
    --cc=jani@nikula.org \
    --cc=notmuch@notmuchmail.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://yhetil.org/notmuch.git/

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).