From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:470:142:3::10]:60582) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jBHdb-0006aa-E2 for guix-patches@gnu.org; Mon, 09 Mar 2020 08:36:04 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1jBHda-0001Hl-Bg for guix-patches@gnu.org; Mon, 09 Mar 2020 08:36:03 -0400 Received: from debbugs.gnu.org ([209.51.188.43]:43963) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1jBHda-0001HR-7p for guix-patches@gnu.org; Mon, 09 Mar 2020 08:36:02 -0400 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1jBHda-00027D-4K for guix-patches@gnu.org; Mon, 09 Mar 2020 08:36:02 -0400 Subject: [bug#39258] [PATCH v2 0/3] Xapian for Guix package search Resent-Message-ID: MIME-Version: 1.0 References: <20200307133116.11443-1-arunisaac@systemreboot.net> <87sgijgb1v.fsf@gnu.org> In-Reply-To: <87sgijgb1v.fsf@gnu.org> From: zimoun Date: Mon, 9 Mar 2020 13:34:40 +0100 Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+kyle=kyleam.com@gnu.org Sender: "Guix-patches" To: Ludovic =?UTF-8?Q?Court=C3=A8s?= Cc: Arun Isaac , Pierre Neidhardt , 39258@debbugs.gnu.org Hi, On Sat, 7 Mar 2020 at 21:33, Ludovic Court=C3=A8s wrote: > Arun Isaac skribis: > > Why not use a simpler package search results format like Arch Linux or = Debian > > does? We could just display the package name, version and synopsis like= so. > > > > inkscape 0.92.4 > > Vector graphics editor > > inklingreader 0.8 > > Wacom Inkling sketch format conversion and manipulation > > > > Why do we need the entire recutils format? If the user is interested, t= hey can > > always use `guix package --show` to get the full recutils formatted > > info. Having shorter search results will make everything even faster an= d much > > more readable. WDYT? > > What I like about the recutils format in this context is that it=E2=80=99= s both > human- and machine-readable. The examples in the manual show how it can > be useful to select the information displayed or to refine the search > (info "(guix) Invoking guix package"). > > Also: I=E2=80=99d recommend tackling one thing at a time. :-) I agree with Ludo. And IMHO, we should add "guix search --format=3D" mimicking how "git log" works. By default, displays as Arun proposes. Using '--format=3Dfull" as it is done now by default. And we could imagine "--format=3D%name \t %version \n %description" etc. > I think we should look at a profile of =E2=80=98package->recutils=E2=80= =99, there=E2=80=99s > probably room for improvement there. Interesting. Note that speed was the initial motivation but accuracy is another important one. As we discussed earlier when I showed an example with TF-IDF. And Xapian implemets the state-of-art (BM25) for scoring. All the best, simon