From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: Rocky Bernstein Newsgroups: gmane.emacs.devel Subject: Re: GNU is looking for Google Summer of Code Projects Date: Thu, 19 Mar 2020 14:19:24 -0400 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="00000000000010249305a1393661" Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="17544"; mail-complaints-to="usenet@ciao.gmane.io" Cc: Stefan Monnier , emacs-devel To: Andrea Corallo Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Thu Mar 19 19:20:23 2020 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1jEzmJ-0004SW-Ex for ged-emacs-devel@m.gmane-mx.org; Thu, 19 Mar 2020 19:20:23 +0100 Original-Received: from localhost ([::1]:41632 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jEzmI-0004BZ-EK for ged-emacs-devel@m.gmane-mx.org; Thu, 19 Mar 2020 14:20:22 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:44606) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jEzlb-0003ig-Kh for emacs-devel@gnu.org; Thu, 19 Mar 2020 14:19:41 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1jEzlZ-0001iF-PJ for emacs-devel@gnu.org; Thu, 19 Mar 2020 14:19:39 -0400 Original-Received: from mail-lf1-f41.google.com ([209.85.167.41]:42729) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1jEzlZ-0001hr-HI for emacs-devel@gnu.org; Thu, 19 Mar 2020 14:19:37 -0400 Original-Received: by mail-lf1-f41.google.com with SMTP id t21so2438838lfe.9 for ; Thu, 19 Mar 2020 11:19:37 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=NnFpt4vP10IKx2rOoiWdz4S5+EmdXDCgotq5TBRQ5r8=; b=pZd0vZuCwYLM8w4ZgT0IBq6ZrjzXBEBHzYIak6tHJgoP6yS/m6ryWUEu/V5SD76D7b mLa54dWokMDy3Ww8wPP7Vr8ZIshv4T8YUNzxBtwP4wtYY+Bj7XOwHnYka66VXdPR0IhE K1qD6PVhDAE/BFNLGL4cKnq2+dG3zAPnMypJ7/nOuoR60Pkp0EH57R/m2NB8Mg1rYEBn mEzhMY5wciwXqM+14sBmjFlc+o9itxYN69HbfOp59IqES5UTmpHi1XOZwCXINGZ1i5MP eKUWzy8sktww/v1Tv7qVjq323PK0PSAVD9jVs1hI6tsXDJL3OB+J7aHOfPz2CVh9VDFc xnKA== X-Gm-Message-State: ANhLgQ10VPk1MD5PEf+etffDOYVLsqGW9PDswk+dlXM0dOIPTBjoh4na gmJMs2BWwos+5fQFAMdp1ILBoRkoGf1mwFliVsY= X-Google-Smtp-Source: ADFU+vu2fCqmG78YoK7c2VelgDyKCUciQOu45fGFX/tp0vtTXj2h032pCqydaKgkRGAIWeMLu/H9m+U9uC4RXIWhQH8= X-Received: by 2002:a19:6711:: with SMTP id b17mr2823192lfc.140.1584641975787; Thu, 19 Mar 2020 11:19:35 -0700 (PDT) In-Reply-To: X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] [fuzzy] X-Received-From: 209.85.167.41 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:245586 Archived-At: --00000000000010249305a1393661 Content-Type: text/plain; charset="UTF-8" On Thu, Mar 19, 2020 at 1:56 PM Andrea Corallo wrote: > Stefan Monnier writes: > > > It should be easy (much smaller than a summer project) to change the > > C code so that a bytecode offset can be extracted from the backtrace. > > > > The harder and more interesting part is how to propagate source > > information (line numbers and/or lexical variable names and location) to > > byte-code. There are many parts to this, so it's definitely possible to > > get some summer project(s) out of it. E.g. one such project is to change > > the reader so it outputs "fat cons cells" (i.e. cons-cells with line-num > > info), then arrange for that info to survive `macroexpand-all` and > > `cconv.el`. That could already be used to give more precise line > > numbers in bytecompiler warnings. > > > > Another is to devise a way to annotate bytecode objects with a map from > > byte-offsets to information about the lexical vars in-scope at that point > > and their location (i.e. position in the stack or in the closure). > > And then teach Emacs's debugger to use that info. > > > >> But enough about me. What is most in need of help in GNU Emacs that a > >> summer student might reasonably make progress on? > > > > I'm sure there are lots of desires. One I'd suggest is to introduce an > > "object description" that can be used both by the GC and pdump code (and > > maybe also by `equal` and `print--preprocess`?), so that when changing > > the representation of objects or introducing new types we don't have to > > make corresponding changes in so many different places. XEmacs had such > > a thing, so there's previous experience on which we can build. > > It could also be a step towards replacing our GC with one that's > > incremental such the one in XEmacs (or even better: concurrent, unlike > > that of XEmacs). > > > > > > Stefan > > It's probably definitely early to discuss but can't resist. > > Do we really need some dedicated low level object? This should be all > overhead that disappears with compilation anyway. > > Also wanted to ask, am I wrong or something has been attempted in this > field? > In the bit that I have come across looking over byteocde work and history e,g. see http://rocky.github.io/elisp-bytecode.pdf it has become extremely clear that there are precious few who understand how the bytecode and runtime system work. And the people who wrote this initially, e.g. rms, and later jwz, no longer do so. No slight to Stefan, Jim Blandy, Paul Eggert or Tom Tromey, but if nothing else we need a new generation of people to pick up the torch and carry on. > > I'm quite curious on this because the day we get source locations > crossing byte-code we could use the native compiler also as a diagnostic > tool. > > Andrea > > -- > akrl@sdf.org > --00000000000010249305a1393661 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable


=
On Thu, Mar 19, 2020 at 1:56 PM Andre= a Corallo <akrl@sdf.org> wrote:
Stefan Monnier &l= t;monnier@iro= .umontreal.ca> writes:

> It should be easy (much smaller than a summer project) to change the > C code so that a bytecode offset can be extracted from the backtrace.<= br> >
> The harder and more interesting part is how to propagate source
> information (line numbers and/or lexical variable names and location) = to
> byte-code.=C2=A0 There are many parts to this, so it's definitely = possible to
> get some summer project(s) out of it.=C2=A0 E.g. one such project is t= o change
> the reader so it outputs "fat cons cells" (i.e. cons-cells w= ith line-num
> info), then arrange for that info to survive `macroexpand-all` and
> `cconv.el`.=C2=A0 That could already be used to give more precise line=
> numbers in bytecompiler warnings.
>
> Another is to devise a way to annotate bytecode objects with a map fro= m
> byte-offsets to information about the lexical vars in-scope at that po= int
> and their location (i.e. position in the stack or in the closure).
> And then teach Emacs's debugger to use that info.
>
>> But enough about me. What is most in need of help in GNU Emacs tha= t a
>> summer student might reasonably make progress on?
>
> I'm sure there are lots of desires.=C2=A0 One I'd suggest is t= o introduce an
> "object description" that can be used both by the GC and pdu= mp code (and
> maybe also by `equal` and `print--preprocess`?), so that when changing=
> the representation of objects or introducing new types we don't ha= ve to
> make corresponding changes in so many different places.=C2=A0 XEmacs h= ad such
> a thing, so there's previous experience on which we can build.
> It could also be a step towards replacing our GC with one that's > incremental such the one in XEmacs (or even better: concurrent, unlike=
> that of XEmacs).
>
>
>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0Stefan

It's probably definitely early to discuss but can't resist.

Do we really need some dedicated low level object?=C2=A0 This should be all=
overhead that disappears with compilation anyway.

Also wanted to ask, am I wrong or something has been attempted in this
field?

In the bit that I have come acro= ss looking over byteocde=C2=A0work and history e,g. see=C2=A0http://rocky.github.io/elisp-byteco= de.pdf=C2=A0it has become extremely clear that there are precious few w= ho understand how the bytecode and runtime system work. And the people who = wrote this initially, e.g. rms, and later jwz, no longer do so.=C2=A0
=

No slight to Stefan, Jim Blandy, Paul Eggert or Tom Tro= mey, but if nothing else we need a new generation of people to pick up the = torch and carry on.=C2=A0
=C2=A0

I'm quite curious on this because the day we get source locations
crossing byte-code we could use the native compiler also as a diagnostic tool.

=C2=A0 Andrea

--
akrl@sdf.org
--00000000000010249305a1393661--