From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Tim Cross Newsgroups: gmane.emacs.devel Subject: Re: Gathering data on user preferences Date: Wed, 08 Sep 2021 23:02:55 +1000 Message-ID: <87ilzbfccb.fsf@gmail.com> References: <87h7exkphw.fsf@gmail.com> <20210907064208.GB4097@tuxteam.de> <874kavh5cb.fsf@posteo.net> <87r1dzfp14.fsf@gmail.com> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="25984"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: mu4e 1.7.0; emacs 27.2.50 Cc: emacs-devel@gnu.org To: Daniel Fleischer Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Wed Sep 08 15:06:53 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mNxHu-0006Rg-Rb for ged-emacs-devel@m.gmane-mx.org; Wed, 08 Sep 2021 15:06:50 +0200 Original-Received: from localhost ([::1]:48930 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mNxHt-0006zm-Ol for ged-emacs-devel@m.gmane-mx.org; Wed, 08 Sep 2021 09:06:49 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:51210) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mNxFL-0004iA-UC for emacs-devel@gnu.org; Wed, 08 Sep 2021 09:04:12 -0400 Original-Received: from mail-pg1-x530.google.com ([2607:f8b0:4864:20::530]:43599) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1mNxFK-00033V-3l for emacs-devel@gnu.org; Wed, 08 Sep 2021 09:04:11 -0400 Original-Received: by mail-pg1-x530.google.com with SMTP id r2so2511516pgl.10 for ; Wed, 08 Sep 2021 06:04:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=references:user-agent:from:to:cc:subject:date:in-reply-to :message-id:mime-version; bh=ob1IcESwwHymR9sxaErQy0wQ8hnqSxfvQCFRfnFgYt8=; b=p98gdNM5EE8Vkg7VlgSucS4eSiY3Ujk4zgLb7b86E2Iox0u2hLpsiJ1Y2E5DPRVal0 Yg7QFlOStA0I0ejzZ46NNBX32OV0UJVa0vWHhcjrO0gKqV8fWzs7VmnhmybzajmMjwh2 umuYSkn16FeKCaRs8JcWcZejMZUsKoa1Rlp2lqxCoH5UoDwCMVL8yP2sqoWwz139HyD2 HgRccWq8X9v5JQ0dsSCR++QuNqvFxu2SdXgx5lWlClE/Mr7oIUXR38iNSCk5Ed4RYFKX ZfslQjLrzc3UIL6eJrzTFamTPm5d+8OSM6ID9ii3AJUyCpwsHZOfgBVKG8lUjOc+L+V4 CWcA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:references:user-agent:from:to:cc:subject:date :in-reply-to:message-id:mime-version; bh=ob1IcESwwHymR9sxaErQy0wQ8hnqSxfvQCFRfnFgYt8=; b=p9YFsMydF/17iLGtMsf92IALPiP9WWyXUD0pVNJSci2KpCiinVJ/I+UjFK4iWZIQDY EyOTuqvqjmfzw9dUK8Qbvpv7IHbh44yjWc7FtBjdfwar3pWNQgZ6R1b8SA3nGN+qyIVu rBIj9yINmp32ewvIcMR64Hn74guxNwhed6iLsVoTdJiyC+Mf+2J2+uS745pYGG6oecQD ZZys1CwSGUN/dziSYgJBn3YwSgLAQ0JvYplf9PKw6t7m/Q11R/5/oFJe/iu8NhWZiTcE 6NwU/A7zROFVKuVAgHz/wt+6AyOnh05izRPETpVrpDcv+8jrj4vAEnfIRNCJI627y9lC osaA== X-Gm-Message-State: AOAM533a1XMm72+6eOFpkduFJq6uMqCVBEE/N5gPNuzVIvht6oObhoxE QgMw63EkpKOy/kZGqd+4+32aerc0WE0= X-Google-Smtp-Source: ABdhPJyPsGVZsh3SLt7j2eleETrLQQR1LGTlM76gZohRn20g7GJe4C40u7d3Y3LBWJsfPPYaV1c4FQ== X-Received: by 2002:a65:5845:: with SMTP id s5mr3596896pgr.227.1631106248594; Wed, 08 Sep 2021 06:04:08 -0700 (PDT) Original-Received: from tim-desktop (106-69-152-50.dyn.iinet.net.au. [106.69.152.50]) by smtp.gmail.com with ESMTPSA id q29sm3059480pgc.91.2021.09.08.06.04.06 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 08 Sep 2021 06:04:08 -0700 (PDT) In-reply-to: Received-SPF: pass client-ip=2607:f8b0:4864:20::530; envelope-from=theophilusx@gmail.com; helo=mail-pg1-x530.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:274335 Archived-At: Daniel Fleischer writes: > Tim Cross [2021-09-08 Wed 18:14] wrote: > >> I would imagine the package I'm proposing would likely leverage off some >> of the code which generates the data attached to bug reports. I also >> think it would be good to be able to mine the data which exists in bug >> reports, but have no idea how easily that could be done. >> >> I do think there is probably some really valuable data currently sitting >> in the Emacs bug tracker and it would be an interesting project to try >> and mine that repository to see what we could get out of it. However, I >> have no idea what API the system provides or how easy this would be to >> do (without significant human intervention). > > I can download the mbox files for each month, going back a few years and > then aggregate them according to month or even week. I'll do it in > python and can share the data and code if I get something interesting. It will be interesting to see what you get and how hard it was to extract. I suspect the bug reports could be a gold mine of valuable data, but it may be difficult to process.