From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Philip Kaludercic Newsgroups: gmane.emacs.tangents Subject: Re: Help building Pen.el (GPT for emacs) Date: Sat, 24 Jul 2021 14:49:02 +0000 Message-ID: <871r7n22e9.fsf@posteo.net> References: <83im1948mj.fsf@gnu.org> <83lf642jeh.fsf@gnu.org> <83r1fp1es9.fsf@gnu.org> <837dhg1a1l.fsf@gnu.org> <87czr89n1a.fsf@posteo.net> <83y29wywfg.fsf@gnu.org> <87a6mc12it.fsf@posteo.net> <83mtqcyn6h.fsf@gnu.org> <875ywz23vs.fsf@posteo.net> <83czr7zsjw.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="15897"; mail-complaints-to="usenet@ciao.gmane.io" Cc: stefan@marxist.se, emacs-tangents@gnu.org, mullikine@gmail.com, rms@gnu.org, bugs@gnu.support To: Eli Zaretskii Original-X-From: emacs-tangents-bounces+get-emacs-tangents=m.gmane-mx.org@gnu.org Sat Jul 24 17:13:08 2021 Return-path: Envelope-to: get-emacs-tangents@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1m7JKu-0003zk-Ce for get-emacs-tangents@m.gmane-mx.org; Sat, 24 Jul 2021 17:13:08 +0200 Original-Received: from localhost ([::1]:43400 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1m7JKt-00012h-FO for get-emacs-tangents@m.gmane-mx.org; Sat, 24 Jul 2021 11:13:07 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:44514) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1m7Ixi-0007OP-Hw for emacs-tangents@gnu.org; Sat, 24 Jul 2021 10:49:10 -0400 Original-Received: from mout02.posteo.de ([185.67.36.66]:44917) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1m7Ixe-0007OU-Ak for emacs-tangents@gnu.org; Sat, 24 Jul 2021 10:49:10 -0400 Original-Received: from submission (posteo.de [89.146.220.130]) by mout02.posteo.de (Postfix) with ESMTPS id AD1BF24010A for ; Sat, 24 Jul 2021 16:49:03 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=posteo.net; s=2017; t=1627138143; bh=TPtjWVF08VYe7S1pg7kzSWeqmZrJs/Xg/qkHThEYbEU=; h=From:To:Cc:Subject:Autocrypt:Date:From; b=GC0amVmbBWSoNLyNe7htXV0uN6DCuyvyA1F4ErdH52Tx2+1qMNiPTwW+iBxcPIiN4 DZGsJRXzI6XpvlDdbxB0VSe8vhtOmoEHf5zwbAVb97Kx2EK7LY4cEsTeK748uiMKJy dfQq6U21Rl/Y/W5XKXgaNuHWtCovXXKk5B0B8Uyl67qNliVQJM6gUSxw4Rk3Hl7x+Y DsIVTr/EDIFbMKdnL5hzqz/EvcylxWRsa0+aOeYYZK2menhV5urUkAwjhsPiR+tET9 u8kK7KLU1uc1VqnM04kblrwB5RpeadeoZgfHQcyRhGREhSE9iMxGygEYpu758LwMsR BrFmdlCX0eScQ== Original-Received: from customer (localhost [127.0.0.1]) by submission (posteo.de) with ESMTPSA id 4GX8G70RcNz6tmX; Sat, 24 Jul 2021 16:49:02 +0200 (CEST) Autocrypt: addr=philipk@posteo.net; prefer-encrypt=mutual; keydata= mDMEYHHqUhYJKwYBBAHaRw8BAQdAp3GdmYJ6tm5McweY6dEvIYIiry+Oz9rU4MH6NHWK0Ee0QlBo aWxpcCBLYWx1ZGVyY2ljIChnZW5lcmF0ZWQgYnkgYXV0b2NyeXB0LmVsKSA8cGhpbGlwa0Bwb3N0 ZW8ubmV0PoiQBBMWCAA4FiEEDM2H44ZoPt9Ms0eHtVrAHPRh1FwFAmBx6lICGwMFCwkIBwIGFQoJ CAsCBBYCAwECHgECF4AACgkQtVrAHPRh1FyTkgEAjlbGPxFchvMbxzAES3r8QLuZgCxeAXunM9gh io0ePtUBALVhh9G6wIoZhl0gUCbQpoN/UJHI08Gm1qDob5zDxnIHuDgEYHHqUhIKKwYBBAGXVQEF AQEHQNcRB+MUimTMqoxxMMUERpOR+Q4b1KgncDZkhrO2ql1tAwEIB4h4BBgWCAAgFiEEDM2H44Zo Pt9Ms0eHtVrAHPRh1FwFAmBx6lICGwwACgkQtVrAHPRh1Fw1JwD/Qo7kvtib8jy7puyWrSv0MeTS g8qIxgoRWJE/KKdkCLEA/jb9b9/g8nnX+UcwHf/4VfKsjExlnND3FrBviXUW6NcB In-Reply-To: <83czr7zsjw.fsf@gnu.org> (Eli Zaretskii's message of "Sat, 24 Jul 2021 17:37:39 +0300") Received-SPF: pass client-ip=185.67.36.66; envelope-from=philipk@posteo.net; helo=mout02.posteo.de X-Spam_score_int: -27 X-Spam_score: -2.8 X-Spam_bar: -- X-Spam_report: (-2.8 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H4=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=unavailable autolearn_force=no X-Spam_action: no action X-Mailman-Approved-At: Sat, 24 Jul 2021 11:12:02 -0400 X-BeenThere: emacs-tangents@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Emacs news and miscellaneous discussions outside the scope of other Emacs mailing lists List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-tangents-bounces+get-emacs-tangents=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-tangents" Xref: news.gmane.io gmane.emacs.tangents:691 Archived-At: Eli Zaretskii writes: >> From: Philip Kaludercic >> Cc: rms@gnu.org, mullikine@gmail.com, emacs-tangents@gnu.org, >> stefan@marxist.se, bugs@gnu.support >> Date: Sat, 24 Jul 2021 14:16:55 +0000 >> >> > Sorry, I don't understand what you have in mind. Can you show an >> > example of useful code that could be copied verbatim into a program >> > without at least some renaming, without breaking the program? >> >> To take the example from the article I mentioned above >> >> public static String humanReadableByteCount(long bytes, boolean si) { >> int unit = si ? 1000 : 1024; >> if (bytes < unit) return bytes + " B"; >> int exp = (int) (Math.log(bytes) / Math.log(unit)); >> String pre = (si ? "kMGTPE" : "KMGTPE").charAt(exp-1) + (si ? "" : "i"); >> return String.format("%.1f %sB", bytes / Math.pow(unit, exp), pre); >> } >> >> can be copied into a Java program, and assuming that there is no other >> method called humanReadableByteCount in the same class, it should >> compile and run without renaming or re-typing. > > How would one know it's 'long' and not some other data type? I am not sure what you mean? "long" makes sense here because Java will automatically up-cast any other type to fit. >> CoPilot might generate this from a comment like, >> >> // Convert a byte count to a human-readable string >> >> since it is mentioned over 6000 times on GitHub (and this method even >> has a bug, as the article explains -- but that is a totally different >> issue). > > That's not how AI works: it doesn't just count the number of times > something is mentioned. That usually leads to unsatisfactory results. Of course, that would be oversimplifying. At the same time, if the training samples have common patterns, a model is more likely to reproduce that behaviour. But since these are neural networks we are talking about, it is hard to determine causality to begin with, which probably makes the whole situation even more difficult (speaking as a non-lawyer). -- Philip Kaludercic