From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Daniel Colascione Newsgroups: gmane.emacs.devel Subject: Re: Dynamic loading progress Date: Mon, 16 Feb 2015 12:01:40 -0800 Message-ID: <54E24CA4.9020601@dancol.org> References: <838ugdf251.fsf@gnu.org> <87bnl1vmqf.fsf@lifelogs.com> <87vbj8tow4.fsf@lifelogs.com> <87r3twtagf.fsf@lifelogs.com> <85siebl7ws.fsf@stephe-leake.org> <85a90ilwmm.fsf@stephe-leake.org> <83386a6f7z.fsf@gnu.org> <85h9upjz7v.fsf@stephe-leake.org> <83wq3k3kl4.fsf@gnu.org> <85bnkwil1c.fsf@stephe-leake.org> <83pp9cwky8.fsf@gnu.org> <85a90ggf2d.fsf@stephe-leake.org> <54E0A40F.5080603@dancol.org> <83sie7un20.fsf@gnu.org> <54E0D181.2080802@dancol.org> <83r3trulse.fsf@gnu.org> <54E0D7E0.305@[87.69.4.28]> <83h9unukbg.fsf@gnu.org> <54E0DEF8.7020901@dancol> <83egpruiyp.fsf@gnu.org> <54E0FF93.2000104@dancol.org> <833865vp4d.fsf@gnu.org> <54E2355A.90@[87.69.4.28]> <83vbj1u020.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="WqQ7Lj2iO22UJD9Mot10w0tof9Q6VjOGr" X-Trace: ger.gmane.org 1424116937 27035 80.91.229.3 (16 Feb 2015 20:02:17 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 16 Feb 2015 20:02:17 +0000 (UTC) Cc: stephen_leake@stephe-leake.org, emacs-devel@gnu.org To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Feb 16 21:02:07 2015 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YNRrq-0005G4-Tw for ged-emacs-devel@m.gmane.org; Mon, 16 Feb 2015 21:02:07 +0100 Original-Received: from localhost ([::1]:42014 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YNRrq-0005HN-DS for ged-emacs-devel@m.gmane.org; Mon, 16 Feb 2015 15:02:06 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:55021) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YNRra-0005GN-I9 for emacs-devel@gnu.org; Mon, 16 Feb 2015 15:01:52 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YNRrX-0007rx-9h for emacs-devel@gnu.org; Mon, 16 Feb 2015 15:01:50 -0500 Original-Received: from dancol.org ([2600:3c01::f03c:91ff:fedf:adf3]:44045) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YNRrW-0007rj-Rw; Mon, 16 Feb 2015 15:01:47 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=dancol.org; s=x; h=Content-Type:In-Reply-To:References:Subject:CC:To:MIME-Version:From:Date:Message-ID; bh=WGm9gf3YBiRPilTuKWHG/BJq5zUDmJrwvu3IY+Bsx18=; b=N/zVpd5OPacDy71Ocv57x0uh3eCMP9G1bcUipRrV4+2iTDcDjYql5Vl7hPxjgsJC8xdWXWJX6gS+CA7Cx2z9wguCCmpsBcYoc4wFk9cx/DAwMGC1MFDdgB7MHUH1utye1WflrHb+2CCan2nh7kv+gjxoTJ9mGCHbslFzZ0nLFSk4L5tyaGHOn8VN7wiruVVoifCqQeGZV3ijOJUXB0jsEYThr6HDTi9LsXhyYnoUpIc22MJoc4V2sajuMYu2ANIB9XWICw7Q/c82J6S+5F3QlFupMgNSi1/qpctdWYuf9zh3OCj1zlmGBpap5dCcSQAfgmIpOH5g90DjmpqHHQR8KA==; Original-Received: from c-73-221-38-18.hsd1.wa.comcast.net ([73.221.38.18] helo=[192.168.1.210]) by dancol.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.84) (envelope-from ) id 1YNRrU-0000Us-RJ; Mon, 16 Feb 2015 12:01:44 -0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.4.0 In-Reply-To: <83vbj1u020.fsf@gnu.org> X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2600:3c01::f03c:91ff:fedf:adf3 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:183177 Archived-At: This is an OpenPGP/MIME signed message (RFC 4880 and 3156) --WqQ7Lj2iO22UJD9Mot10w0tof9Q6VjOGr Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable On 02/16/2015 11:29 AM, Eli Zaretskii wrote: >> Date: Mon, 16 Feb 2015 10:22:18 -0800 >> From: Daniel Colascione >> CC: stephen_leake@stephe-leake.org, emacs-devel@gnu.org >> >>>> The `size' member tells modules how long the emacs_runtime structure= >>>> is. (It's better to use size than an explicit version field: this wa= y, >>>> .size =3D sizeof(struct emacs_runtime) is always correct.) >>> >>> This approach requires us to change the size each time we change the >>> layout, even if the change itself leaves the size intact. Using a >>> version field doesn't have this disadvantage. >> >> Emacs is allowed to pass a new version of the runtime and context >> structures to modules expecting old versions. That can only work if th= e >> structures are append-only, so `size' is adequate here. >=20 > I understand that, but appending members does not necessarily increase > the size of the struct, due to alignment, bit fields, etc. If it's just a table of function pointers and everything has natural alignment, adding members will increase the size. >>> You say "we don't >>> lock ourselves into conservative stack-scanning GC", which I interpre= t >>> as saying you don't want to rely on stack scanning to avoid a >>> destructive GC in this case. But if we don't rely on that, where's >>> the guarantee that such emacs_value will survive GC? >> >> Emacs stores emacs_value values in a local reference table before >> handing them to module code. >=20 > That's something that wasn't in your original writeup. Sounds like a > significant part of the design. I thought I'd made that clear. I'm glad I've been able to clarify it. Stefan doesn't seem to think the indirection is necessary, but I'm not sure I agree with him, especially because his scheme has a space and speed cost on 32-bit platforms. >>>> We'll represent all Lisp values as an opaque pointer typedef >>>> emacs_value. >>> >>> This doesn't play well with --with-wide-int, where a value can be >>> wider than a pointer. I think we should instead go with intmax_t or >>> inptr_t, whichever is wider on the host. >> >> emacs_value objects are not literally Lisp_Object values. They're >> indirected through a reference table (either local or global) so that = we >> can GC them. A pointer value can address all possible memory locations= , >> so it's fine here. (In the wide-int case, the pointed-to table entries= >> are wider than pointers, but modules don't need to know that.) >=20 > How this will work when some Lisp calls a primitive implemented by a > module. From the Lisp interpreter POV, it's just a call to a > primitive, so how will this special handling come into existence? > (And why? just to save us from picking up a wider data type?) I think there's some confusion about how function calls work. =46rom the Lisp interpreter perspective, we're calling a regular Lisp function. That Lisp function closes over the information needed to call a module function and then calls into an Emacs core function that actually makes the module call. Say a module wants to provide a function "frobricate" taking two arguments. It defines a C-level function (call it do_frobricate) with a signature matching emacs_subr. It then calls `make_function' like this: emacs_value frobricate =3D env->make_function(env, 2, 2, do_frobricate)= if (frobricate =3D=3D NULL) { FAILURE(); } Now `frobricate' is a Lisp value usable in Lisp funcall; if we bind this value to the function slot of some symbol, Lisp code can call frobricate like any other function. Say do_frobricate has address 0xA3B41234. Implementation-wise, the value `frobricate` that `make_function' returned is a Lisp closure that essentially looks like this: (lambda (&rest args) (%do-module-call frobricate-module 0xA3B41234 2 2 args)) %do-module-call then clears the pending-error information, packs its arguments into a C array, and calls 0xA3B41234 like this: emacs_subr module_function =3D (module_function) 0xA3B41234; result =3D module_function(some_env, args, nargs); (I've left out the part where we transform emacs_value to Lisp_Object and vice versa.) >>> What about the doc string? >> >> We'll set that at the lisp level. >=20 > I'm not following: what Lisp level? What I had in mind was a module > that implements a primitive. Are you saying that every primitive a > module wants to implement will need a Lisp part and a C part? That > sounds cumbersome, just to save us from passing yet another argument > to make_function. No: that's silly. I'm saying that creating a globally-named function in Emacs has two parts: 1) create the actual _function object_. 2) bind that function object to a name by setting some symbol's function slot to that function object. We can do step #2 entirely in Lisp code that the module calls, and we can set a documentation property at this time. It's only step #1 that requires a magical incantation. I've only described the magical, ABI-relevant bits of the interface. In order to make a module actually usable, we'll need glue, but since we can implement this glue in Lisp and call it from module code using the narrow interface I've described here, I'm not worried about it. >>>> emacs_value (*funcall)( >>>> emacs_env* env, >>>> emacs_value function, >>>> int nargs, >>>> emacs_value args[]); >>> >>> Shouldn't funcall use emacs_subr? >> >> Why? >=20 > Because otherwise I see no point in having emacs_subr, it's almost > unused. emacs_subr is a typedef for a pointer to a module-provided function. Every module-provided function is an emacs_subr. >>>> If Lisp signals or throws, `funcall' returns NULL. >>> >>> I suggest some other value or indication of that. NULL is a valid >>> return value, so usurping it for errors might be too harsh. >> >> No it isn't. Qnil is distinct from NULL in this model because >> emacs_value is not a Lisp_Object in disguise. Qnil is not special here= =2E >=20 > I wasn't thinking about Qnil. I simply don't see why it is a good > idea to forbid funcall from returning NULL in a normal case. We're not preventing functions from returning NULL in the general case. We're deciding that NULL is not a valid value of emacs_value type. This property makes NULL a good candidate for an error sentinel return value for functions that would otherwise return valid emacs_value values. >>>> `intern' also does the obvious thing. >>> >>> Do we need 'unintern' as well? >> >> Modules can call unintern through Lisp. >=20 > Then why provide 'intern'? It, too, can be called from Lisp. Modules have no way other than `intern' of getting an emacs_value that corresponds to a known symbol. They can call `unintern' by first interning the symbol "unintern", then calling it. >>>> emacs_value (*type_of)( >>>> emacs_env* env, >>>> emacs_value value); >>>> >>>> Like Lisp type-of: returns a symbol. >>> >>> What is a "symbol", from the module's C code POV? >> >> It's an emacs_value. You can compare symbols (by calling `eq'), call >> them as functions, or use them as function arguments. That's sufficien= t. >> >> Come to think of it, though, we do need a C-level `eq'. >=20 > Why not simply return a C enumeration type? Why make the set of elisp primitive types part of the ABI? We already have a way to refer to symbols. There's no need to an enum, which would just be a redundant way of naming types. >>> . direct access to buffer text (using buffer-substring means consing= >>> a lot of strings) >> >> Is that really a problem for the use cases we want to support? >=20 > It could be a problem for a module that processes buffer text. >=20 >>> . not sure how will a module "provide" its feature >> >> It doesn't. You don't require a module: you load it. Most modules will= >> come with a small Lisp wrapper, just like Python modules. >=20 > So do we _require_ a Lisp wrapper? Otherwise, how would Emacs know > whether the module is loaded or not? By path, I imagine. Why don't you have this problem with (load "myfile.el")? I'm not opposed to hooking modules into the provide-and-require machinery, but I don't see a particular need either. >>> One thing that's inconvenient is the need to drag the environment >>> pointer through all the calls. Why exactly is that needed? >> >> Modules have no other way of accessing Emacs internals. Dragging one >> parameter through the system isn't that bad and lets us sanely track >> which module is doing what. >=20 > The environment must be known to the module, that's for sure. But why > does it have to be plugged into every call to every interface > function? How else are the interface functions supposed to know the module for which they're being called? >>> Also, the buffer returned by find-file when it returns normally is >>> lost here, isn't it? >> >> Sure. It's just a tiny example, and the returned buffer itself is stil= l >> GCed. >=20 > That's not what bothered me. I don't understand how will the module > receive that return value in the first place. An additional argument > to funcall, perhaps? funcall returns an emacs_value corresponding to the value the called function returned. Here's an alternate implementation of emacs_find_file:= emacs_value emacs_find_file(emacs_env* env, const char* filename) { emacs_value e_filename =3D env->make_string(env, filename); if(env->error_check(env)) return NULL; emacs_value e_find_file =3D env->intern(env, "find-file"); if(env->error_check(env)) return NULL; return env->funcall(env, e_find_file, &e_filename, 1); } Now emacs_find_file returns NULL on error (with error information set in the environment) and an emacs_value on success. The returned value corresponds to whatever find-file returned, so in this case, it's a buffe= r. --WqQ7Lj2iO22UJD9Mot10w0tof9Q6VjOGr Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v1 iQIcBAEBCAAGBQJU4kykAAoJEN4WImmbpWBlhAkP/0pkFPDVwlIbwGYmvN2Th+7q AY2jgg8Tz+/atFvsdB7Jt+jk+WZlpKn7zybD0L1VFU7p+B5YTY+JuWUyMtUubDKr 3NkTNGR/CIN1zpiCRn157w7pDwXWSOjoGvVEW/ky/nw/2NJUOPCf0YoLx7LWJyuR VMy+1S1wIaf6BqZMMr3UzmwY9B871h+v7XrLwIpkV1M/1WKCv0ZDwK+W2OvmopCr 9bUvxPe7a+RqeU9aUthj/KQYtkvTOdZvM6bPS7SZil43bzWgK/6MXJw9mQlhsIWs rk9C7GGaqebXB+JelemDjN3KLNmhbP4f8tN/OCkBsNs51TZrtoWze1lvWLTpEXfy UyM3Kf7gWdL4pb7R6E2Hlndh3ci9ZS6x3U17J3LLU3jaKAvnRU49jo1wRhx84/2h u/sN+C9jiF4LsrkfOsl15duLeDZZzAauxX5xam4mSg9cZdoOrJOw0oI0YOWispPr 6gt/eUYusR9QXyMc48YnSc4N4r38AYX2jYmWd4gJ2VbtWs6h971BzeemyhPA0HWx TgKAZLtisJE345zyBVhOGzf/D9XkAXXTFGj72CXLsOT+y1HsuDe5BbsQo6tjkaYs I6nOsVPkhF1JjgPsYrVyfd9PH6ipQS3C43KmvX9up6TfOGjO0eZSQnqATEWZOpsI 1Mrsi28t4m/WKYJDGWfZ =Yw7q -----END PGP SIGNATURE----- --WqQ7Lj2iO22UJD9Mot10w0tof9Q6VjOGr--