From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Adam Porter Newsgroups: gmane.emacs.bugs Subject: bug#50686: Show number of downloads on packages on GNU ELPA/NonGNU ELPA Date: Mon, 11 Mar 2024 15:07:04 -0500 Message-ID: References: <985acef0-69f1-39c3-1354-9a49149c9df9@alphapapa.net> <1f2a10bf-c135-480d-9b79-17b64090fc7e@alphapapa.net> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="17141"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla Thunderbird Cc: 50686@debbugs.gnu.org, stefan@marxist.se, larsi@gnus.org To: Stefan Monnier Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Mon Mar 11 21:07:52 2024 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1rjlw7-0004G6-KD for geb-bug-gnu-emacs@m.gmane-mx.org; Mon, 11 Mar 2024 21:07:52 +0100 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1rjlvm-0007EE-LI; Mon, 11 Mar 2024 16:07:30 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1rjlvk-0007Dm-RL for bug-gnu-emacs@gnu.org; Mon, 11 Mar 2024 16:07:28 -0400 Original-Received: from debbugs.gnu.org ([2001:470:142:5::43]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1rjlvk-0001eu-Hw for bug-gnu-emacs@gnu.org; Mon, 11 Mar 2024 16:07:28 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1rjlwI-00072k-DU for bug-gnu-emacs@gnu.org; Mon, 11 Mar 2024 16:08:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Adam Porter Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Mon, 11 Mar 2024 20:08:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 50686 X-GNU-PR-Package: emacs Original-Received: via spool by 50686-submit@debbugs.gnu.org id=B50686.171018767327058 (code B ref 50686); Mon, 11 Mar 2024 20:08:02 +0000 Original-Received: (at 50686) by debbugs.gnu.org; 11 Mar 2024 20:07:53 +0000 Original-Received: from localhost ([127.0.0.1]:41350 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rjlw9-00072L-2p for submit@debbugs.gnu.org; Mon, 11 Mar 2024 16:07:53 -0400 Original-Received: from toucan.tulip.relay.mailchannels.net ([23.83.218.254]:36523) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rjlw4-000728-1b for 50686@debbugs.gnu.org; Mon, 11 Mar 2024 16:07:52 -0400 X-Sender-Id: dreamhost|x-authsender|adam@alphapapa.net Original-Received: from relay.mailchannels.net (localhost [127.0.0.1]) by relay.mailchannels.net (Postfix) with ESMTP id 48B84827E1; Mon, 11 Mar 2024 20:07:12 +0000 (UTC) Original-Received: from pdx1-sub0-mail-a311.dreamhost.com (unknown [127.0.0.6]) (Authenticated sender: dreamhost) by relay.mailchannels.net (Postfix) with ESMTPA id A6DFC81805; Mon, 11 Mar 2024 20:07:11 +0000 (UTC) ARC-Seal: i=1; s=arc-2022; d=mailchannels.net; t=1710187631; a=rsa-sha256; cv=none; b=JLhpb7/hl47yPv2Ns0G42RsDb3NEtg699st3VtlHt1vicOsWugT1FQw6wrSULi1WIMORKv BuPXsg/fnp3okmTvmI0vqOp3Vz2/7ZUdRjeqlCMyNlnZBtZURe/w2CtJkWDe+WGbBMRtlB OW7ABJc60DPM6wgWDl//7MH2qwmoFFqO5DulY31UZTghxGDQ//nVNQEfn6QOBJwdQLWhsZ 1kPztbhSOiKsuGKcDgXPGjRcxRXTPlFWlSpdX0X2ShsPXbZDox7tIkKPcZbtQsslJ0ZBiw hqSDEfqd7Gf0tLPAYJrwxnRG37+4YNDBN2geaMgODH7fKly1Q2/FwnafoRgq2w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=mailchannels.net; s=arc-2022; t=1710187631; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=FEcqzGff7ViJ3nGZo54V/b3r77YGKqEPJswHQpmQpDI=; b=TmQaKmjQQGMvee+Ci2kfAe5LqK4rB1RkgQGKQvNIGLF2qDrp7eZmhPIdSVGNoe1s8Y5bIK nGJblHJcpKfd8yjknBSkHoxPLebYcgI0T7JICm8AgiZsjWtvaZu4tyi6HWSFEyrmPItWri EbFsrhoSl5Aw+H6fM+MSR5NvusyZIuwB7XzBsi68k2f62wU+LtqNNjFsJuSekMpTWeu/xc delx/BR1Cu8mFbumMiZoT0kZsJ1FIGqE8XN9dozleLZBioNwFn9bjRQ2D+tbpm80sePtJI ewYTyhCvrbgXdo6cPRGNQSjrv1uEhOgrJoXm2v8Ezk1EfOxMch480FHkfKUbxQ== ARC-Authentication-Results: i=1; rspamd-67fb46fddd-bjwl5; auth=pass smtp.auth=dreamhost smtp.mailfrom=adam@alphapapa.net X-Sender-Id: dreamhost|x-authsender|adam@alphapapa.net X-MC-Relay: Neutral X-MailChannels-SenderId: dreamhost|x-authsender|adam@alphapapa.net X-MailChannels-Auth-Id: dreamhost X-Slimy-Desert: 0a8aa1e171624de9_1710187632139_294536518 X-MC-Loop-Signature: 1710187632139:2986185458 X-MC-Ingress-Time: 1710187632138 Original-Received: from pdx1-sub0-mail-a311.dreamhost.com (pop.dreamhost.com [64.90.62.162]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384) by 100.116.254.144 (trex/6.9.2); Mon, 11 Mar 2024 20:07:12 +0000 Original-Received: from [10.66.7.46] (unknown [91.193.232.98]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) (Authenticated sender: adam@alphapapa.net) by pdx1-sub0-mail-a311.dreamhost.com (Postfix) with ESMTPSA id 4Ttnqf6BHWzRF; Mon, 11 Mar 2024 13:07:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=alphapapa.net; s=dreamhost; t=1710187631; bh=FEcqzGff7ViJ3nGZo54V/b3r77YGKqEPJswHQpmQpDI=; h=Date:Subject:To:Cc:From:Content-Type:Content-Transfer-Encoding; b=xwCsMcdSFAIMgJbfdQlVfjpMjqhWZ9IccoxGQB04uAp6KstGTF70YPuhp7wp4cWjh i90dOxnLfh7gS8jwB+eGkM5mGky5bd63qRMBtoAi1twzPuC1wJ9GSn8pMwSrAPGUwQ QDDmOGN53L46/s3fjRJefFvZHixrSEaDUocOidFVFEM966R+NQA6d8VEbtpHq7NfR3 2GocXPOXVE6G+uXp8RJ5WUNDWeuUTNI5mhK8MjECe33BANOcdMjxgsIub+dArY3fCG 3O0S5ZAK5BnLt/5RnvxVtINmxB3vHKlPy+DAzZcTahW8gGFp9NriQvcOPaQJzVzm6l gJYCgBB2BSuCA== Content-Language: en-US In-Reply-To: X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.bugs:281493 Archived-At: Hi Stefan, On 3/9/24 08:37, Stefan Monnier wrote: >>> If you go to http://elpa.gnu.org/packages/ you'll now see a new column >>> "Rank" which shows a percentile ranking for each package. >> That's very cool. I guess it's not looking very far back in the download >> data (yet?), > > I had the logs only for a two weeks or so (plus some old logs from > many years ago, actually), indeed. I see. Are the rest of the logs still available on the ELPA server, or is that all we have for historical data? >> a list of downloads per version, etc. > > Currently I count the "interest" in the package, so I don't distinguish > the version of the package, nor whether the access is for the tarball or > the package's web page, or the package's readme.txt, or the package's badge. That seems like a very different kind of data than the number of times a package has been downloaded (i.e. by an Emacs instance). IME a small fraction of hits to a package's GitHub repo seem to result in installations; "interest" tends to be far more than "interested enough to install." > I'd like to the keep the stats database reasonably small (it's currently > around 150kB, and I expect it'll take a year before it reaches 1MB), so > I'd rather not segregate per version. Is there a way that I could change your mind about that? Having the actual download counts per version would be very useful. As far as database size, the download counts per version (i.e. per tarball filename) could be stored in a table like: FILENAME | DOWNLOAD_COUNT | LAST_UPDATED Which could be updated when the logs are processed (omitting any logged download from before the LAST_UPDATED timestamp). And while that wouldn't show when the downloads occurred, it would still be useful to get an idea of how many users a package has (i.e. ones that actually install updates to it), and it would be a very small amount of data to store. --Adam