Re: [NonGNU ELPA] New package: llm

all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed

From: Jim Porter <jporterbugs@gmail.com>
To: Andrew Hyatt <ahyatt@gmail.com>
Cc: Daniel Fleischer <danflscr@gmail.com>,
	Richard Stallman <rms@gnu.org>,
	emacs-devel@gnu.org
Subject: Re: [NonGNU ELPA] New package: llm
Date: Sun, 20 Aug 2023 23:03:30 -0700	[thread overview]
Message-ID: <8c8d1109-d6f3-70b5-010b-31042b5baa18@gmail.com> (raw)
In-Reply-To: <CAM6wYYKiodexr6C4M1HyMznXDmZCMfDRWmh5fcSfGE2n1736eQ@mail.gmail.com>

On 8/20/2023 10:12 PM, Andrew Hyatt wrote:
> The training of these is fairly straightforward, at least if you are 
> familiar with the area. ... the LLM we are talking about here use this technique to train and execute, changing some parameters and adding things like more attention heads, but keeping the fundamental architecture the same.

I think the parameters would be a key part of this (or potentially all 
of the code they used for the training, if it does something unique), as 
well as the *actual* training datasets. That's why I'm especially 
concerned about the line in their docs saying "great efforts have been 
taken to clean the pretraining data". I couldn't find out whether they 
provided the cleaned data or only the "raw" data. From my understanding, 
properly cleaning the data is labor-intensive, and you wouldn't be able 
to reproduce another team's efforts in that area unless they gave you a 
diff or something equivalent.

> I'm not an expert, but I believe that due to the use of stochastic 
> processes in training, even if you had the exact code, parameters and 
> data used in training, you would never be able to reproduce the model 
> they make available.  It should be equivalent in quality, perhaps, but 
> not the same.

This is a problem for reproducibility (it would be nice if you could 
*verify* that a model was built the way its makers said it was), but I 
don't think it's a critical problem for freedom.

> To me, I believe it should be about freedom.  Not absolute freedom, but 
> relative freedom: do you, the user, have the same amount of freedom as 
> anyone else, including the creator?  For the LLMs like huggingface and 
> many other research LLMs, the answer is yes.

So long as the creators provide all the necessary parameters to retrain 
the model from "scratch", I think I'd agree. If some of these aren't 
provided (cleaned datasets, training parameters, any direct human 
intervention if applicable, etc), then I think the answer is no. For 
example, the creator could decide that one data source is bad for some 
reason, and retrain their model without it. Would I be able to do that 
work independently with just what the creator has given me?

I see that there was a presentation at LibrePlanet 2023 (or maybe 
shortly after) by Leandro von Werra of HuggingFace on the ethics of 
code-generating LLMs[1]. It says that it hasn't been published online 
yet, though. This might not be the final answer on all the concerns 
about incorporating LLMs into Emacs, but hopefully it would help.

In practice though, I think if Emacs were to support communicating with 
LLMs, it would be good if - at minimum - we could direct users to an 
essay explaining the potential ethical/freedom issues with them. On that 
note, maybe we could also take a bit of inspiration from Emacs dynamic 
modules. They require a GPL compatibility symbol[2] in order to load, 
and perhaps a hypothetical 'llm-foobar' package that interfaces with the 
'foobar' LLM could announce whether it respects users' freedom via some 
variable/symbol. Freedom-respecting LLMs wouldn't need a warning message 
then. We could even forbid packages that talk to particularly "bad" 
LLMs. (I suppose we can't stop users from writing their own packages and 
just lying about whether they're ok, but we could prevent their 
inclusion in ELPA.)

[1] 
https://www.fsf.org/bulletin/2023/spring/trademarks-volunteering-and-code-generating-llm

[2] 
https://www.gnu.org/software/emacs/manual/html_node/elisp/Module-Initialization.html

next prev parent reply	other threads:[~2023-08-21  6:03 UTC|newest]

Thread overview: 68+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-08-07 23:54 [NonGNU ELPA] New package: llm Andrew Hyatt
2023-08-08  5:42 ` Philip Kaludercic
2023-08-08 15:08   ` Spencer Baugh
2023-08-08 15:09   ` Andrew Hyatt
2023-08-09  3:47 ` Richard Stallman
2023-08-09  4:37   ` Andrew Hyatt
2023-08-13  1:43     ` Richard Stallman
2023-08-13  1:43     ` Richard Stallman
2023-08-13  2:11       ` Emanuel Berg
2023-08-15  5:14       ` Andrew Hyatt
2023-08-15 17:12         ` Jim Porter
2023-08-17  2:02           ` Richard Stallman
2023-08-17  2:48             ` Andrew Hyatt
2023-08-19  1:51               ` Richard Stallman
2023-08-19  9:08                 ` Ihor Radchenko
2023-08-21  1:12                   ` Richard Stallman
2023-08-21  8:26                     ` Ihor Radchenko
2023-08-17 17:08             ` Daniel Fleischer
2023-08-19  1:49               ` Richard Stallman
2023-08-19  8:15                 ` Daniel Fleischer
2023-08-21  1:12                   ` Richard Stallman
2023-08-21  4:48               ` Jim Porter
2023-08-21  5:12                 ` Andrew Hyatt
2023-08-21  6:03                   ` Jim Porter [this message]
2023-08-21  6:36                 ` Daniel Fleischer
2023-08-22  1:06                 ` Richard Stallman
2023-08-16  2:30         ` Richard Stallman
2023-08-16  5:11           ` Tomas Hlavaty
2023-08-18  2:10             ` Richard Stallman
2023-08-27  1:07       ` Andrew Hyatt
2023-08-27 13:11         ` Philip Kaludercic
2023-08-28  1:31           ` Richard Stallman
2023-08-28  2:32             ` Andrew Hyatt
2023-08-28  2:59               ` Jim Porter
2023-08-28  4:54                 ` Andrew Hyatt
2023-08-31  2:10                 ` Richard Stallman
2023-08-31  9:06                   ` Ihor Radchenko
2023-08-31 16:29                     ` chad
2023-09-01  9:53                       ` Ihor Radchenko
2023-09-04  1:27                     ` Richard Stallman
2023-09-04  1:27                     ` Richard Stallman
2023-09-06 12:25                       ` Ihor Radchenko
2023-09-06 12:51                       ` Is ChatGTP SaaSS? (was: [NonGNU ELPA] New package: llm) Ihor Radchenko
2023-09-06 16:59                         ` Andrew Hyatt
2023-09-09  0:37                           ` Richard Stallman
2023-09-06 22:52                         ` Emanuel Berg
2023-09-07  7:28                           ` Lucien Cartier-Tilet
2023-09-07  7:57                             ` Emanuel Berg
2023-09-09  0:38                         ` Richard Stallman
2023-09-09 10:28                           ` Collaborative training of Libre LLMs (was: Is ChatGTP SaaSS? (was: [NonGNU ELPA] New package: llm)) Ihor Radchenko
2023-09-09 11:19                             ` Jean Louis
2023-09-10  0:22                             ` Richard Stallman
2023-09-10  2:18                               ` Debanjum Singh Solanky
2023-08-27 18:36         ` [NonGNU ELPA] New package: llm Jim Porter
2023-08-28  0:19           ` Andrew Hyatt
2023-09-04  1:27           ` Richard Stallman
2023-09-04  5:18             ` Andrew Hyatt
2023-09-07  1:21               ` Richard Stallman
2023-09-12  4:54                 ` Andrew Hyatt
2023-09-12  9:57                   ` Philip Kaludercic
2023-09-12 15:05                   ` Stefan Kangas
2023-09-19 16:26                     ` Andrew Hyatt
2023-09-19 16:34                       ` Philip Kaludercic
2023-09-19 18:19                         ` Andrew Hyatt
2023-09-04  1:27         ` Richard Stallman
2023-08-09  3:47 ` Richard Stallman
2023-08-09  4:06   ` Andrew Hyatt
2023-08-12  2:44     ` Richard Stallman

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=8c8d1109-d6f3-70b5-010b-31042b5baa18@gmail.com \
    --to=jporterbugs@gmail.com \
    --cc=ahyatt@gmail.com \
    --cc=danflscr@gmail.com \
    --cc=emacs-devel@gnu.org \
    --cc=rms@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.