From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp2 ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms11 with LMTPS id wIZnBuuyR19zLgAA0tVLHw (envelope-from ) for ; Thu, 27 Aug 2020 13:19:39 +0000 Received: from aspmx1.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp2 with LMTPS id SIVEAuuyR18LEAAAB5/wlQ (envelope-from ) for ; Thu, 27 Aug 2020 13:19:39 +0000 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 911959404D3 for ; Thu, 27 Aug 2020 13:19:38 +0000 (UTC) Received: from localhost ([::1]:52602 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kBHoX-0000Xe-Be for larch@yhetil.org; Thu, 27 Aug 2020 09:19:37 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:42834) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kBHoO-0000XG-6V for guix-devel@gnu.org; Thu, 27 Aug 2020 09:19:28 -0400 Received: from relay2-d.mail.gandi.net ([217.70.183.194]:58561) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kBHoL-0003Yh-Pc for guix-devel@gnu.org; Thu, 27 Aug 2020 09:19:27 -0400 X-Originating-IP: 90.92.160.122 Received: from bababa (lfbn-idf2-1-1094-122.w90-92.abo.wanadoo.fr [90.92.160.122]) (Authenticated sender: mail@ambrevar.xyz) by relay2-d.mail.gandi.net (Postfix) with ESMTPSA id AA99440007; Thu, 27 Aug 2020 13:19:22 +0000 (UTC) From: Pierre Neidhardt To: zimoun , guix-devel@gnu.org Subject: Re: File search progress: database review and question on triggers In-Reply-To: <865z94dz83.fsf@gmail.com> References: <87sgcuh8rb.fsf@ambrevar.xyz> <86imd4e7cr.fsf@gmail.com> <87eenspcf8.fsf@ambrevar.xyz> <865z94dz83.fsf@gmail.com> Date: Thu, 27 Aug 2020 15:19:22 +0200 Message-ID: <87zh6gns4l.fsf@ambrevar.xyz> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha256; protocol="application/pgp-signature" Received-SPF: pass client-ip=217.70.183.194; envelope-from=mail@ambrevar.xyz; helo=relay2-d.mail.gandi.net X-detected-operating-system: by eggs.gnu.org: First seen = 2020/08/27 09:19:23 X-ACL-Warn: Detected OS = Linux 3.11 and newer X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, FROM_SUSPICIOUS_NTLD=0.499, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_PDS_OTHER_BAD_TLD=0.01 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: guix-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+larch=yhetil.org@gnu.org Sender: "Guix-devel" X-Scanner: scn0 Authentication-Results: aspmx1.migadu.com; dkim=none; dmarc=none; spf=pass (aspmx1.migadu.com: domain of guix-devel-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=guix-devel-bounces@gnu.org X-Spam-Score: -3.11 X-TUID: vVTaGyc6iqTV --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable zimoun writes: > I am not sure to see how. One needs all the database to search inside > and cannot know in advance which packages etc.. Contrary to =E2=80=9Cgui= x size=E2=80=9D > which manipulates the graph and then download the missing parts. > > Therefore, your suggestion is to download all the database the first > time the user run =E2=80=9Cguix filesearch=E2=80=9D, i.e., download ~10Mi= B. Yes. If the user is not using substitutes, they can also compute the database from their local store items (like I'm doing in the graft). It's less useful but still a bit more convenient than running find/locate on /gnu/store. > Then each time the user runs =E2=80=9Cguix pull=E2=80=9D then =E2=80=9Cgu= ix filesearch=E2=80=9D, two > options, either download the new database for this last Guix generation, > or either download a diff (not sure the complexity is worth for ~10MiB). > > Right? Yes. > And what about the channels? Because if I read correctly, =E2=80=9Cguix = size=E2=80=9D > fails when =E2=80=9Cno available substitute information=E2=80=9C. Just like `guix size', it would work with local items. But if there is no substitute server for channels, then there is no way around it. >> I think this is a bit beyond the scope of this patch set. I'd rather >> focus on files exclusively for now and proceed one step at a time :) > > I do not think it is beyond the scope because Arun introduced an SQLite > database for improving =E2=80=9Cguix search=E2=80=9D. But this path had = been stopped > because of =E2=80=9Cintroducing complexity=E2=80=9D [1]. Therefore, if = =E2=80=9Cguix > filesearch=E2=80=9D introduces a SQL cache, then it seems a good idea to = be also > usable by =E2=80=9Cguix search=E2=80=9D. > > Well, if the table is extended with the fields =E2=80=9Csynopsis=E2=80=9D= and > =E2=80=9Cdescription=E2=80=9D then what is the size of the database? Doe= s it kill the > lookup performance? Good point, I'll test and report my measures. Cheers! =2D-=20 Pierre Neidhardt https://ambrevar.xyz/ --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQEzBAEBCAAdFiEEUPM+LlsMPZAEJKvom9z0l6S7zH8FAl9HstoACgkQm9z0l6S7 zH/6HwgAn+zNv7r10O5lsLZB5bMgFHlx7zpti3JcsjTTreoIHWOs+k3IWAxGcm0z /Z7pIiGWUFbyycZ95wpR+Nt6QiFUH3lRtHeRPbrzurXd3A1wqo25eHaBjkzPrrRH 56BD2MuVRQ+69iaUy1v+kgF+VQ2eh4Hj0as1FInRUJpo4bvus/QHH0fIbWdUDmlF xbkIoUl5BuTIpkVwsxPpKIK+ztdio/IOgWyGwSC+0pwTFjzmZ2lrHyEgeFYm1x9o JJx3S+nEGJ32I2DeoPnfG0ajFjw38GTvLk03UV4EbvMg0KnAZQC5cjoNs8GL3MVV ElQ3Al/LKnQCWu4WS1ZjNruRQtfThg== =X6W0 -----END PGP SIGNATURE----- --=-=-=--