From mboxrd@z Thu Jan 1 00:00:00 1970 From: zimoun Subject: Re: Inverted index to accelerate guix package search Date: Thu, 16 Jan 2020 21:08:30 +0100 Message-ID: References: <87a76r68u6.fsf@ambrevar.xyz> <87muaqnmod.fsf@ambrevar.xyz> Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Return-path: Received: from eggs.gnu.org ([2001:470:142:3::10]:53877) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1isBRb-0005vU-DQ for guix-devel@gnu.org; Thu, 16 Jan 2020 15:08:44 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1isBRa-0001NP-4j for guix-devel@gnu.org; Thu, 16 Jan 2020 15:08:43 -0500 Received: from mail-qt1-x834.google.com ([2607:f8b0:4864:20::834]:44219) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1isBRa-0001Ms-0i for guix-devel@gnu.org; Thu, 16 Jan 2020 15:08:42 -0500 Received: by mail-qt1-x834.google.com with SMTP id w8so5556588qts.11 for ; Thu, 16 Jan 2020 12:08:41 -0800 (PST) In-Reply-To: List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+gcggd-guix-devel=m.gmane-mx.org@gnu.org Sender: "Guix-devel" To: Arun Isaac Cc: Guix Devel Hi, On Thu, 16 Jan 2020 at 20:06, Arun Isaac wrote: > I looked up xapian's features at https://xapian.org/features and it is > quite impressive. I was introduced to xapian through notmuch. notmuch > does not utilize xapian to the fullest and I therefore ended up > underestimating its value. Of particular importance might be the > following. > > - Relevance feedback - given one or more documents, Xapian can suggest > the most relevant index terms to expand a query, suggest related > documents, categorise documents, etc. > - Phrase and proximity searching - users can search for words occurring > in an exact phrase or within a specified number of words, either in a > specified order, or in any order. > - Supports stemming of search terms (e.g. a search for "football" would > match documents which mention "footballs" or "footballer") Yes Xapian rocks! :-) > For the time being, since we don't have xapian bindings, I think we > should settle for sqlite's full text search capabilities. > > https://www.sqlite.org/fts5.html Thank you for the pointer. I am looking at it. Maybe a good ol' SQL query could improve. :-) All the best, simon