From mboxrd@z Thu Jan  1 00:00:00 1970
Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail
From: Debanjum Singh Solanky <debanjum@khoj.dev>
Newsgroups: gmane.emacs.tangents
Subject: Re: Collaborative training of Libre LLMs (was: Is ChatGTP SaaSS?
 (was: [NonGNU ELPA] New package: llm))
Date: Sat, 9 Sep 2023 19:18:44 -0700
Message-ID: <CAM_WPJStUsCM0jSyPEBjnjFwg-aV=fL7JHkSEGFT=_Xir_qRGA@mail.gmail.com>
References: <CAM6wYYJHa+tCUKO_SsnT77g-4MUM0x4FrkoCekr=T9-UF1ADDA@mail.gmail.com>
 <E1qTaA2-00038O-UA@fencepost.gnu.org>
 <CAM6wYY+E=z5VqV2xXMbhbpN7vn+-tyzfOGKFAuG0s+croRmEPA@mail.gmail.com>
 <E1qV08g-0001mb-11@fencepost.gnu.org>
 <CAM6wYYLZ26E4rpo2Ae2PyxKSBYQKAXQ6U5_QGMoGx5SQy7AMSA@mail.gmail.com>
 <87v8d0iqa5.fsf@posteo.net> <E1qaR6l-00012I-VP@fencepost.gnu.org>
 <CAM6wYYLYrQL9+3cgUELYavUdHQg5m0bqdW89_qJFvk050-sGNQ@mail.gmail.com>
 <fd98dcaf-5016-1a84-f281-36ef6eb108c5@gmail.com>
 <E1qbX8C-0004EP-3M@fencepost.gnu.org>
 <87cyz3vaws.fsf@localhost> <E1qcyN3-0001al-5t@fencepost.gnu.org>
 <87a5tzsbvl.fsf@localhost> <E1qelzp-0003lR-Nq@fencepost.gnu.org>
 <87fs3n7i98.fsf@localhost> <E1qf8Dj-0000ay-02@fencepost.gnu.org>
Mime-Version: 1.0
Content-Type: multipart/alternative; boundary="000000000000ea21360604f7d4bb"
Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214";
	logging-data="33412"; mail-complaints-to="usenet@ciao.gmane.io"
Cc: Ihor Radchenko <yantar92@posteo.net>, emacs-tangents@gnu.org,
 jporterbugs@gmail.com, ahyatt@gmail.com, team@khoj.dev
To: rms@gnu.org
Original-X-From: emacs-tangents-bounces+get-emacs-tangents=m.gmane-mx.org@gnu.org Sun Sep 10 06:56:56 2023
Return-path: <emacs-tangents-bounces+get-emacs-tangents=m.gmane-mx.org@gnu.org>
Envelope-to: get-emacs-tangents@m.gmane-mx.org
Original-Received: from lists.gnu.org ([209.51.188.17])
	by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
	(Exim 4.92)
	(envelope-from <emacs-tangents-bounces+get-emacs-tangents=m.gmane-mx.org@gnu.org>)
	id 1qfCVD-0008VZ-M6
	for get-emacs-tangents@m.gmane-mx.org; Sun, 10 Sep 2023 06:56:55 +0200
Original-Received: from localhost ([::1] helo=lists1p.gnu.org)
	by lists.gnu.org with esmtp (Exim 4.90_1)
	(envelope-from <emacs-tangents-bounces@gnu.org>)
	id 1qfCUn-0001UO-4L; Sun, 10 Sep 2023 00:56:29 -0400
Original-Received: from eggs.gnu.org ([2001:470:142:3::10])
 by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <debanjum@khoj.dev>) id 1qfA2S-0005eg-Jy
 for emacs-tangents@gnu.org; Sat, 09 Sep 2023 22:19:04 -0400
Original-Received: from mail-wm1-x32c.google.com ([2a00:1450:4864:20::32c])
 by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128)
 (Exim 4.90_1) (envelope-from <debanjum@khoj.dev>) id 1qfA2M-0004gN-7N
 for emacs-tangents@gnu.org; Sat, 09 Sep 2023 22:19:04 -0400
Original-Received: by mail-wm1-x32c.google.com with SMTP id
 5b1f17b1804b1-401da71b85eso36157105e9.1
 for <emacs-tangents@gnu.org>; Sat, 09 Sep 2023 19:18:57 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=khoj.dev; s=google; t=1694312336; x=1694917136; darn=gnu.org;
 h=cc:to:subject:message-id:date:from:in-reply-to:references
 :mime-version:from:to:cc:subject:date:message-id:reply-to;
 bh=QTgogHPPQi+zdG+pWrYl3mNtseFGiyiieTstWyyPfSI=;
 b=FepXHLMQs82ZHYfp9k7Vwrse8n9nru993yQrri36XwN6LfCaQVjxY8Dq60m4Ye5o5h
 S/oUsMU+OBc85+c8ktlcS0+jB2JpLUnfKX7E4j4z/vOt24OGcOHm/V/zmGEZ0O+eSOpe
 7eD0u47mPvsnqmI7ICopYN+L00xNep5AZgQTuTkLykZOCnEOP+mrbxGPPe7Cp5Vtn+fI
 k+9ff3qF7Ax+4Oyh950r6Bzj9WPWR3sSvi3zg8iI+cbkZYpur5NqiCEHueqB2NfHyJx6
 Q/BtLC6klGvitcu9ztcfwHqt3xJzolz6RFivLrzU55eEjGnLP4yzrguECuKyevrF/2dH
 QtuQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20230601; t=1694312336; x=1694917136;
 h=cc:to:subject:message-id:date:from:in-reply-to:references
 :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id
 :reply-to;
 bh=QTgogHPPQi+zdG+pWrYl3mNtseFGiyiieTstWyyPfSI=;
 b=jOXpzfoGAHYgZkXWk/XYFQWX0GUGttvjfajXfT+Y7Us33dWX30N265g/MfPDrcL0Wf
 /WfDhraXXhTVQJZmL/uEmF19bzbB7QRyaJPJIuoKFtcZ+hC5rmtJ5YtZ9a9iUfT3epz7
 tK2HzKs/4Feva4CtDr9yWY0g84PPKBOWay/5sqX8SmYiVfmvdJT57bGcNbUFgiIBFtma
 vsYK5yTKbVVO9+uoiKYiMqIjhXnrVtEzr2Io8tzkx8s8J1UrW722Pa500Dv8vbSAypvW
 Q7vrnvMG2kCgHBM3a7I35/p9T4JDsulsd7np5y4byohIzEqYJXLKSeAnZQnjB+j2JOR4
 MLYQ==
X-Gm-Message-State: AOJu0YwyUjKHog/JpCKyieo8JmqUyc0r6uVW1D1AZBC4m1JjFEAQBGwc
 XN9sI00WQMBfxtF84XJyA5c4x4WNbmB8WJnOQl1aLQ==
X-Google-Smtp-Source: AGHT+IGKx3y7Okh4MLDaRYsXtoIi+JfIbtwtKA9aPH4yetWPlGbkas0FPdTJ4PGhS2Fi8aWQJbDuvQrEoAL+99gEUAU=
X-Received: by 2002:a1c:f70b:0:b0:3fe:22fd:1b23 with SMTP id
 v11-20020a1cf70b000000b003fe22fd1b23mr5195043wmh.34.1694312335804; Sat, 09
 Sep 2023 19:18:55 -0700 (PDT)
In-Reply-To: <E1qf8Dj-0000ay-02@fencepost.gnu.org>
Received-SPF: pass client-ip=2a00:1450:4864:20::32c;
 envelope-from=debanjum@khoj.dev; helo=mail-wm1-x32c.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
 T_SPF_TEMPERROR=0.01 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-Mailman-Approved-At: Sun, 10 Sep 2023 00:56:27 -0400
X-BeenThere: emacs-tangents@gnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Emacs news and miscellaneous discussions outside the scope of other
 Emacs mailing lists <emacs-tangents.gnu.org>
List-Unsubscribe: <https://lists.gnu.org/mailman/options/emacs-tangents>,
 <mailto:emacs-tangents-request@gnu.org?subject=unsubscribe>
List-Archive: <https://lists.gnu.org/archive/html/emacs-tangents>
List-Post: <mailto:emacs-tangents@gnu.org>
List-Help: <mailto:emacs-tangents-request@gnu.org?subject=help>
List-Subscribe: <https://lists.gnu.org/mailman/listinfo/emacs-tangents>,
 <mailto:emacs-tangents-request@gnu.org?subject=subscribe>
Errors-To: emacs-tangents-bounces+get-emacs-tangents=m.gmane-mx.org@gnu.org
Original-Sender: emacs-tangents-bounces+get-emacs-tangents=m.gmane-mx.org@gnu.org
Xref: news.gmane.io gmane.emacs.tangents:1062
Archived-At: <http://permalink.gmane.org/gmane.emacs.tangents/1062>

--000000000000ea21360604f7d4bb
Content-Type: text/plain; charset="UTF-8"

>   > However, if the "patching" technology can only serve a single "patch" +
>   > main model, there is a problem. Improving libre neural networks will
>   > become difficult, unless people utilize collaborative server to
>   > continuously improve a model.
>
>   > Such collaborative server, similar to ChatGPT, will combine "editing"
>   > (training) and "consulting" together. And, unlike Wikipedia, these
>   > activities are hard to separate.
>
> If the users in this "community" can't move their work outside of a
> private "collaborative server", they are in effect prisoners of that
> server.  Whoever keeps them stuck there will have power, and that will
> tempt per to mistreat them with it.
>

Versus traditional software, AI systems rely critically on the usage
data generated to improve the original model. Using copyleft licensed
models maybe enough to prevent a server owner from being able
to train a better closed model? This would prevent them from holding
users hostage on their server.


>   > This raises a moral question about practical ways to improve libre
>   > neural networks without falling into SaaSS practices.
>
> From the example above, I conclude it is crucial that people who use a
> particular platform to modify and run the model have the feasible
> freedom of copying their modified versions off that platform and onto
> any other platform that satisfies the specs needed to run these models.
>

Platform portability does not solve for how to improve libre
neural networks in an open, community guided way.

To collaboratively develop better open models we'd need the generated
usage data to be publically shareable. Attempts like open-assistant
(https://open-assistant.io) that share usage data under cc-by-sa maybe
a good enough solution for this. But it'll fall on the server owners
to get explicit user consent and clean sensitive usage data to share
this data publically without liability.

--
Debanjum Singh Solanky
Founder, Khoj (https://khoj.dev/)

--000000000000ea21360604f7d4bb
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr"><div dir=3D"ltr"><div dir=3D"ltr"><div cl=
ass=3D"gmail_quote"><div>=C2=A0</div><blockquote class=3D"gmail_quote" styl=
e=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);paddin=
g-left:1ex">
=C2=A0 &gt; However, if the &quot;patching&quot; technology can only serve =
a single &quot;patch&quot; +<br>
=C2=A0 &gt; main model, there is a problem. Improving libre neural networks=
 will<br>
=C2=A0 &gt; become difficult, unless people utilize collaborative server to=
<br>
=C2=A0 &gt; continuously improve a model.<br>
<br>
=C2=A0 &gt; Such collaborative server, similar to ChatGPT, will combine &qu=
ot;editing&quot;<br>
=C2=A0 &gt; (training) and &quot;consulting&quot; together. And, unlike Wik=
ipedia, these<br>
=C2=A0 &gt; activities are hard to separate.<br>
<br>
If the users in this &quot;community&quot; can&#39;t move their work outsid=
e of a<br>
private &quot;collaborative server&quot;, they are in effect prisoners of t=
hat<br>
server.=C2=A0 Whoever keeps them stuck there will have power, and that will=
<br>
tempt per to mistreat them with it.<br></blockquote><div><br></div><div><di=
v class=3D"gmail_default" style=3D"font-family:tahoma,sans-serif"><div clas=
s=3D"gmail_default">Versus traditional software, AI systems rely critically=
 on the usage</div><div class=3D"gmail_default">data generated to improve t=
he original model. Using copyleft licensed</div><div class=3D"gmail_default=
">models maybe enough to prevent a server owner from being able</div><div c=
lass=3D"gmail_default">to train a better closed model? This would prevent t=
hem from holding</div><div class=3D"gmail_default">users hostage on their s=
erver.</div></div></div><div class=3D"gmail_default" style=3D"font-family:t=
ahoma,sans-serif"><br></div><div>=C2=A0</div><blockquote class=3D"gmail_quo=
te" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204=
);padding-left:1ex">
=C2=A0 &gt; This raises a moral question about practical ways to improve li=
bre<br>
=C2=A0 &gt; neural networks without falling into SaaSS practices.<br>
<br>
>From the example above, I conclude it is crucial that people who use a<br>
particular platform to modify and run the model have the feasible<br>
freedom of copying their modified versions off that platform and onto<br>
any other platform that satisfies the specs needed to run these models.<br>=
</blockquote><div><span style=3D"font-family:tahoma,sans-serif"></span><br>=
</div><div class=3D"gmail_default"><span style=3D"font-family:tahoma,sans-s=
erif">Platform portability does not solve for how to improve libre</span><b=
r></div><div class=3D"gmail_default"><font face=3D"tahoma, sans-serif">neur=
al networks in an open, community guided way.</font></div><div class=3D"gma=
il_default"><font face=3D"tahoma, sans-serif"><br></font></div><div class=
=3D"gmail_default"><div class=3D"gmail_default">To collaboratively develop =
better open models we&#39;d need the generated</div><div class=3D"gmail_def=
ault">usage data to be publically shareable. Attempts like open-assistant</=
div><div class=3D"gmail_default">(<a href=3D"https://open-assistant.io">htt=
ps://open-assistant.io</a>) that share usage data under cc-by-sa maybe</div=
><div class=3D"gmail_default">a good enough solution for this. But it&#39;l=
l fall on the server owners</div><div class=3D"gmail_default">to get explic=
it user consent and clean sensitive usage data to share</div><div class=3D"=
gmail_default">this data publically without liability.</div></div></div></d=
iv></div></div><br clear=3D"all"><div><div dir=3D"ltr" class=3D"gmail_signa=
ture" data-smartmail=3D"gmail_signature"><div dir=3D"ltr"><font color=3D"#6=
66666">--</font><div><font color=3D"#444444" face=3D"tahoma, sans-serif">De=
banjum Singh Solanky</font><div><font color=3D"#444444" face=3D"tahoma, san=
s-serif">Founder, Khoj<span class=3D"gmail_default" style=3D"font-family:ta=
homa,sans-serif"> (</span></font><a href=3D"https://khoj.dev/" target=3D"_b=
lank" style=3D"font-family:tahoma,sans-serif">https://khoj.dev/</a><span cl=
ass=3D"gmail_default" style=3D"font-family:tahoma,sans-serif">)</span></div=
></div></div></div></div></div>

--000000000000ea21360604f7d4bb--