From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp1 ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms11 with LMTPS id iHI0DXwxuV5kEAAA0tVLHw (envelope-from ) for ; Mon, 11 May 2020 11:05:32 +0000 Received: from aspmx1.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp1 with LMTPS id mOOfB4oxuV6uZwAAbx9fmQ (envelope-from ) for ; Mon, 11 May 2020 11:05:46 +0000 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 8EE449411D7 for ; Mon, 11 May 2020 11:05:43 +0000 (UTC) Received: from localhost ([::1]:41576 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jY6Fk-0001AE-BD for larch@yhetil.org; Mon, 11 May 2020 07:05:44 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:54072) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jY6Fa-00019t-7V for guix-devel@gnu.org; Mon, 11 May 2020 07:05:34 -0400 Received: from mail-qt1-x836.google.com ([2607:f8b0:4864:20::836]:45215) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1jY6FZ-0004Vi-IX for guix-devel@gnu.org; Mon, 11 May 2020 07:05:33 -0400 Received: by mail-qt1-x836.google.com with SMTP id j2so7211130qtr.12 for ; Mon, 11 May 2020 04:05:33 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=jrJ4EflUzyfI73tp40XBYP9BUKmObfd1MW/HLw5LqqA=; b=BmD1A2buNZ5KiJngUleri0xfE+KTyyKOvX5rm+kFlVk7MrBZaUx3YGqDzIye6VMM2s YBBzBJnPoLtKEikvuXLz/vx7Nh1u/lFeyJSx1e09pG3XgmV0NLqqXSYoQ58u8fWCd5fY bx8vEsztQ+i59G5krMJinzIKfdU0fZ1HiFjnMlT3XfAHyPrJJ0HKHpHBPRR60JIJAmqu 9A0KCl9giZB9FiW5UGsCUZQJMkUowDcb5x8y81igHgavhhKzb+q73S7qta7ScBgSjdmU lRJqfye/JYmXYaLQOATZ0LcvGwHF6UvvgjX42x/tWGzqNadV7WqBdSGB5rE3qBHmt29/ fD6A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=jrJ4EflUzyfI73tp40XBYP9BUKmObfd1MW/HLw5LqqA=; b=N8WO5C6700P+xO1r+0K5Qs3KyqAc8duTO+jJ18p13FaOYHgPOGB/11C80Zg524tmm+ evKx3L8ewNOE+06kBzur0ogs2q37uyql9QJfFPfekpiXCbQhKt9/6xNtx+QWBbPGg6Ec eNkgoF2ZRyP1TkufAawAjb4IIYcluGP+YDOMskWcQkE6FEQSVefc81OnhpIYdUiwtqar uh7/OGTN22yMhYZEDFNFg+C8DV8nRIH/z6/o1Y90PITLEd/CnTv8wrVm7xAcgAuf41Fc tNCj1B0V5Ccw9UlACmds8Mc68P38PHWUlmWx4a2pINA5VwMeasSXgfWaq88JFSNBcSDM 1WLA== X-Gm-Message-State: AOAM530q/7Wz9wj+3ugxqZaQZDxD36fOSAVQuf1x+LpKVEu2sG/Oatgn yaT6UxdoN5WGRrtBWfygm3/eVx/mncfqVT9HvWefMKQc X-Google-Smtp-Source: ABdhPJw4sPJRMJZlg2QeCYFAhrwAi/5TXtKswzFABNrWjg65Sw+wA/WrC4DYIhXLsDbHxESJI8VKB44QfWXRn4r/aKc= X-Received: by 2002:aed:2c64:: with SMTP id f91mr803831qtd.217.1589195132397; Mon, 11 May 2020 04:05:32 -0700 (PDT) MIME-Version: 1.0 From: zimoun Date: Mon, 11 May 2020 13:05:21 +0200 Message-ID: Subject: "guix search" needs your help To: Guix Devel Content-Type: text/plain; charset="UTF-8" Received-SPF: pass client-ip=2607:f8b0:4864:20::836; envelope-from=zimon.toutoune@gmail.com; helo=mail-qt1-x836.google.com X-detected-operating-system: by eggs.gnu.org: No matching host in p0f cache. That's all we know. X-Spam_score_int: 3 X-Spam_score: 0.3 X-Spam_bar: / X-Spam_report: (0.3 / 5.0 requ) BAYES_00=-1.9, DEAR_NOBODY=2.442, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001 autolearn=_AUTOLEARN X-Spam_action: no action X-BeenThere: guix-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+larch=yhetil.org@gnu.org Sender: "Guix-devel" X-Scanner: scn0 X-Spam-Score: -1.71 Authentication-Results: aspmx1.migadu.com; dkim=pass header.d=gmail.com header.s=20161025 header.b=BmD1A2bu; dmarc=pass (policy=none) header.from=gmail.com; spf=pass (aspmx1.migadu.com: domain of guix-devel-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=guix-devel-bounces@gnu.org X-Scan-Result: default: False [-1.71 / 13.00]; GENERIC_REPUTATION(0.00)[-0.53978040312634]; DWL_DNSWL_FAIL(0.00)[gmail.com:server fail,209.51.188.17:server fail]; R_SPF_ALLOW(-0.20)[+ip4:209.51.188.0/24:c]; FREEMAIL_FROM(0.00)[gmail.com]; IP_REPUTATION_HAM(0.00)[asn: 22989(0.07), country: US(-0.00), ip: 209.51.188.17(-0.54)]; TO_DN_ALL(0.00)[]; DKIM_TRACE(0.00)[gmail.com:+]; MX_GOOD(-0.50)[cached: eggs.gnu.org]; DMARC_POLICY_ALLOW(-0.50)[gmail.com,none]; MAILLIST(-0.20)[mailman]; FORGED_RECIPIENTS_MAILLIST(0.00)[]; RCVD_IN_DNSWL_FAIL(0.00)[209.51.188.17:server fail]; MIME_TRACE(0.00)[0:+]; RCVD_TLS_LAST(0.00)[]; ASN(0.00)[asn:22989, ipnet:209.51.188.0/24, country:US]; TAGGED_FROM(0.00)[larch=yhetil.org]; FROM_NEQ_ENVFROM(0.00)[zimontoutoune@gmail.com,guix-devel-bounces@gnu.org]; ARC_NA(0.00)[]; RCVD_COUNT_FIVE(0.00)[5]; R_DKIM_ALLOW(-0.20)[gmail.com:s=20161025]; FROM_HAS_DN(0.00)[]; URIBL_BLOCKED(0.00)[gnu.org:url]; MIME_GOOD(-0.10)[text/plain]; PREVIOUSLY_DELIVERED(0.00)[guix-devel@gnu.org]; HAS_LIST_UNSUB(-0.01)[]; RCPT_COUNT_ONE(0.00)[1]; RWL_MAILSPIKE_POSSIBLE(0.00)[209.51.188.17:from]; FORGED_SENDER_MAILLIST(0.00)[] X-TUID: SB77fgU3L06T Dear, I am lacking imagination and I need your help for: - what should be the query you type, - then what is the package you expect with such query, in order. Example: guix search source control management # expect: git, mercurial, svn, then other guix search r cytometry # expect: r-flowcore r-flowsom then other Using your imagination, I can benchmark some ranking functions. As reported, the experience with "guix search" is sometimes poor. Aside the issue of pager or output format, there is 2 explorations for improvements discussed in this bug report [1]: faster and more accurate. The benchmark for "faster" is straightforward and the exploration is on going. :-) Currently, the packages are ranked accordingly to 'relevance'. There is good, bad and ugly about this strategy and the state-of-art Okapi BM25 ranking function has to be evaluated to know if such information retrieval strategy is worth or not for search of packages. Well, thanks to Arun guile-xapian bindings, the BM25 ranking is easy to benchmark since all the tough work is relayed to Xapian. [1] http://issues.guix.gnu.org/39258 >From my point of view, such pair (query terms, expected packages) would help to improve the tools: - maybe "guix search" - maybe "guix lint" by establishing some metrics indicating is the synopsis and description are relevant -- compared to all the other packages. Thank you in advance for help me in the exploration of such ideas. Best regards, simon