unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* bug#41955: 28.0.50; Monorepos and project.el
@ 2020-06-19 20:42 Theodor Thornhill
  2020-06-20  7:17 ` Eli Zaretskii
                   ` (2 more replies)
  0 siblings, 3 replies; 10+ messages in thread
From: Theodor Thornhill @ 2020-06-19 20:42 UTC (permalink / raw)
  To: 41955


Hi!

At work I've had the following issue. Assume we have some sort of
gigarepo, like this maybe:

gigarepo/
├── clients
│   ├── client1
│   ├── client2
│   ├── client3
│   └── client4
└── services
    ├── service1
    ├── service2
    ├── service3
    └── service4

The services are made in different languages, say, some in c#, f#, java -
as are clients, some are Typescript projects, some are JS etc etc.

Everything is rooted in a ".git" in "gigarepo/", and there are no
submodules or any fancyness.

As project.el works now, there are several issues arising. I'll just
note them down here, and probably split things up later, if that is ok.

* Lsp server/client
  In most of the projects, the lsp servers are indexing from what they
  consider root. Typically a tsconfig.json, elm.json etc. They get
  confused, eglot a bit more than lsp-mode (it has its own root finding
  algorithm). What happens is they look in root (gigarepo/), and it has
  no executable for lsp-server. One solution is then to install the
  server in root, but then it indexes the whole thing, and gets super
  slow, and indexes a lot of unrelated stuff (I'm not even working on
  client 2-4.)
  
* Buffer switching
  Lets say several of the clients uses a module called AuthService.ts.
  If I'm working on several of these projects you get a lot of identical
  files, so it is a bit hit and miss.
  
* Grepping
  This one was the worst for me, since grepping was very slow given the
  size of the project, and grepping loads of unrelated files returns a
  lot of noise. 


What would be nice is to be able to get the benefits of the vc-dir
version of project.el, but not having to "git init" inside the child
projects. More specifically, to be able to choose "project context",
one as the closest project-root and one as the gigarepo project-root.

Also, If I've worked on both a service and a client, running git or
magit should probably be done from root rather than the subproject I am
currently in, to get the whole context.

Is there a way to do this? I realize this may be an odd situation, but
it came up nonetheless.

This report is getting long, and I think I'm already rambling a bit (and
forgetting stuff.), so I'll leave it for now.

Theo

P.S:
What I did to circumvent this (this is only a MVP, not at all optimized
for anything):

(defvar project-root-markers
  '("package.json"
    "tsconfig.json"
    "jsconfig.json"
    "elm.json"
    "*.sln")
  "Files or directories that indicate the root of a project.")

(defvar project-exclusion-list
  '("node_modules"
    "target"
    "build"
    "package-lock.json"
    "elm-stuff")
  "Things not to be included in project-find-file.")

(defun project-exclude-dirs ()
  (mapcar (lambda (dir)
          (add-to-list 'vc-directory-exclusion-list dir))
        project-exclusion-list))

(defun project-find-root (path)
  "Tail-recursive search in PATH for root markers."
  (let* ((this-dir (file-name-as-directory (file-truename path)))
         (parent-dir (expand-file-name (concat this-dir "../")))
         (system-root-dir (expand-file-name "/")))
    (cond
     ((project-root-p this-dir) (cons 'transient this-dir))
     ((equal system-root-dir this-dir) nil)
     (t (project-find-root parent-dir)))))

(defun project-root-p (path)
  "Check if current PATH has any of project root markers."
  (let ((results (mapcar (lambda (marker)
                           (file-exists-p (concat path marker)))
                         project-root-markers)))
    (eval `(or ,@ results))))






^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#41955: 28.0.50; Monorepos and project.el
  2020-06-19 20:42 bug#41955: 28.0.50; Monorepos and project.el Theodor Thornhill
@ 2020-06-20  7:17 ` Eli Zaretskii
  2020-06-20  7:48   ` Theodor Thornhill
  2020-06-20 23:29 ` Juri Linkov
  2020-06-21  1:48 ` Dmitry Gutov
  2 siblings, 1 reply; 10+ messages in thread
From: Eli Zaretskii @ 2020-06-20  7:17 UTC (permalink / raw)
  To: Theodor Thornhill; +Cc: 41955

> Date: Fri, 19 Jun 2020 20:42:26 +0000
> From: Theodor Thornhill <theo@thornhill.no>
> 
> gigarepo/
> ├── clients
> │   ├── client1
> │   ├── client2
> │   ├── client3
> │   └── client4
> └── services
>     ├── service1
>     ├── service2
>     ├── service3
>     └── service4
> 
> The services are made in different languages, say, some in c#, f#, java -
> as are clients, some are Typescript projects, some are JS etc etc.
> 
> Everything is rooted in a ".git" in "gigarepo/", and there are no
> submodules or any fancyness.
> 
> As project.el works now, there are several issues arising. I'll just
> note them down here, and probably split things up later, if that is ok.
> 
> * Lsp server/client
>   In most of the projects, the lsp servers are indexing from what they
>   consider root. Typically a tsconfig.json, elm.json etc. They get
>   confused, eglot a bit more than lsp-mode (it has its own root finding
>   algorithm). What happens is they look in root (gigarepo/), and it has
>   no executable for lsp-server. One solution is then to install the
>   server in root, but then it indexes the whole thing, and gets super
>   slow, and indexes a lot of unrelated stuff (I'm not even working on
>   client 2-4.)
>   
> * Buffer switching
>   Lets say several of the clients uses a module called AuthService.ts.
>   If I'm working on several of these projects you get a lot of identical
>   files, so it is a bit hit and miss.
>   
> * Grepping
>   This one was the worst for me, since grepping was very slow given the
>   size of the project, and grepping loads of unrelated files returns a
>   lot of noise. 
> 
> 
> What would be nice is to be able to get the benefits of the vc-dir
> version of project.el, but not having to "git init" inside the child
> projects. More specifically, to be able to choose "project context",
> one as the closest project-root and one as the gigarepo project-root.

Would it help to have facilities of specifying the files in a project
by starting with an empty project, and then adding the files one by
one?  Also, to be able to say that all the files in a given directory
(optionally, only files that match some shell wildcard), recursively,
should be added to a project?

This would probably need a way of making the list of files/directories
in a project persistent between sessions, because currently we rely on
the filesystem or a VCS to record that.





^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#41955: 28.0.50; Monorepos and project.el
  2020-06-20  7:17 ` Eli Zaretskii
@ 2020-06-20  7:48   ` Theodor Thornhill
  2020-06-20  8:57     ` Eli Zaretskii
  2020-06-22 17:17     ` Felician Nemeth
  0 siblings, 2 replies; 10+ messages in thread
From: Theodor Thornhill @ 2020-06-20  7:48 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: 41955

[-- Attachment #1: Type: text/plain, Size: 735 bytes --]

> Would it help to have facilities of specifying the files in a project
> by starting with an empty project, and then adding the files one by
> one? Also, to be able to say that all the files in a given directory
> (optionally, only files that match some shell wildcard), recursively,
> should be added to a project?

Yeah! I actually suggested something like that. However, this will not fix the issue with programs looking for a specific directory for root. Such as eglot and whatever-language-server.

> This would probably need a way of making the list of files/directories
> in a project persistent between sessions, because currently we rely on
> the filesystem or a VCS to record that.

Yeah, maybe something like that too.
Theo

[-- Attachment #2: Type: text/html, Size: 965 bytes --]

^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#41955: 28.0.50; Monorepos and project.el
  2020-06-20  7:48   ` Theodor Thornhill
@ 2020-06-20  8:57     ` Eli Zaretskii
  2020-06-21  1:25       ` Dmitry Gutov
  2020-06-22 17:17     ` Felician Nemeth
  1 sibling, 1 reply; 10+ messages in thread
From: Eli Zaretskii @ 2020-06-20  8:57 UTC (permalink / raw)
  To: Theodor Thornhill; +Cc: 41955

> Date: Sat, 20 Jun 2020 07:48:53 +0000
> From: Theodor Thornhill <theo@thornhill.no>
> Cc: 41955@debbugs.gnu.org
> 
>  Would it help to have facilities of specifying the files in a project
>  by starting with an empty project, and then adding the files one by
>  one? Also, to be able to say that all the files in a given directory
>  (optionally, only files that match some shell wildcard), recursively,
>  should be added to a project?
> 
> Yeah! I actually suggested something like that. However, this will not fix the issue with programs looking for a
> specific directory for root. Such as eglot and whatever-language-server. 

They will need to learn to use the facilities we provide to define a
project in terms other than just the root directory.





^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#41955: 28.0.50; Monorepos and project.el
  2020-06-19 20:42 bug#41955: 28.0.50; Monorepos and project.el Theodor Thornhill
  2020-06-20  7:17 ` Eli Zaretskii
@ 2020-06-20 23:29 ` Juri Linkov
  2020-06-21  1:23   ` Dmitry Gutov
  2020-06-21  1:48 ` Dmitry Gutov
  2 siblings, 1 reply; 10+ messages in thread
From: Juri Linkov @ 2020-06-20 23:29 UTC (permalink / raw)
  To: Theodor Thornhill; +Cc: 41955

> What would be nice is to be able to get the benefits of the vc-dir
> version of project.el, but not having to "git init" inside the child
> projects. More specifically, to be able to choose "project context",
> one as the closest project-root and one as the gigarepo project-root.

Isn't this the same problem that we discussed recently in bug#41572 to
define a project's root using a special file .project or more generally
.dir-locals.el.





^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#41955: 28.0.50; Monorepos and project.el
  2020-06-20 23:29 ` Juri Linkov
@ 2020-06-21  1:23   ` Dmitry Gutov
  0 siblings, 0 replies; 10+ messages in thread
From: Dmitry Gutov @ 2020-06-21  1:23 UTC (permalink / raw)
  To: Juri Linkov, Theodor Thornhill; +Cc: 41955

On 21.06.2020 02:29, Juri Linkov wrote:
> Isn't this the same problem that we discussed recently in bug#41572 to
> define a project's root using a special file .project or more generally
> .dir-locals.el.

Perhaps.

But as a monorepo with lots of files, this project (or a set of 
projects) would be particularly susceptible to the problem of slow file 
listing if the new backend doesn't reuse 'git ls-files' somehow, or 
implements some fast strategy of its own.





^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#41955: 28.0.50; Monorepos and project.el
  2020-06-20  8:57     ` Eli Zaretskii
@ 2020-06-21  1:25       ` Dmitry Gutov
  0 siblings, 0 replies; 10+ messages in thread
From: Dmitry Gutov @ 2020-06-21  1:25 UTC (permalink / raw)
  To: Eli Zaretskii, Theodor Thornhill; +Cc: 41955

On 20.06.2020 11:57, Eli Zaretskii wrote:
> They will need to learn to use the facilities we provide to define a
> project in terms other than just the root directory.

Eglot just cares about the return value of project-root.





^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#41955: 28.0.50; Monorepos and project.el
  2020-06-19 20:42 bug#41955: 28.0.50; Monorepos and project.el Theodor Thornhill
  2020-06-20  7:17 ` Eli Zaretskii
  2020-06-20 23:29 ` Juri Linkov
@ 2020-06-21  1:48 ` Dmitry Gutov
  2 siblings, 0 replies; 10+ messages in thread
From: Dmitry Gutov @ 2020-06-21  1:48 UTC (permalink / raw)
  To: Theodor Thornhill, 41955; +Cc: João Távora

On 19.06.2020 23:42, Theodor Thornhill wrote:

> At work I've had the following issue. Assume we have some sort of
> gigarepo, like this maybe:
> 
> gigarepo/
> ├── clients
> │   ├── client1
> │   ├── client2
> │   ├── client3
> │   └── client4
> └── services
>      ├── service1
>      ├── service2
>      ├── service3
>      └── service4
> 
> The services are made in different languages, say, some in c#, f#, java -
> as are clients, some are Typescript projects, some are JS etc etc.
> 
> Everything is rooted in a ".git" in "gigarepo/", and there are no
> submodules or any fancyness.
> 
> As project.el works now, there are several issues arising. I'll just
> note them down here, and probably split things up later, if that is ok.

Thank you for the report and the description. Before moving on to more 
complete solutions, here are some initial comments:

> * Lsp server/client
>    In most of the projects, the lsp servers are indexing from what they
>    consider root. Typically a tsconfig.json, elm.json etc. They get
>    confused, eglot a bit more than lsp-mode (it has its own root finding
>    algorithm). What happens is they look in root (gigarepo/), and it has
>    no executable for lsp-server. One solution is then to install the
>    server in root, but then it indexes the whole thing, and gets super
>    slow, and indexes a lot of unrelated stuff (I'm not even working on
>    client 2-4.)

I think Eglot is wrong in simply defaulting to project-root, without 
looking for tsconfig.json, etc. It would be pretty easy to implement, 
and would bring it closer to VS Code's behavior in this aspect (I think).

But that would require it to maintain a list of such files.

> * Buffer switching
>    Lets say several of the clients uses a module called AuthService.ts.
>    If I'm working on several of these projects you get a lot of identical
>    files, so it is a bit hit and miss.

Uniquify should help, but yeah, it sounds like a pain point indeed.

> * Grepping
>    This one was the worst for me, since grepping was very slow given the
>    size of the project, and grepping loads of unrelated files returns a
>    lot of noise.

This is just a small step, but have you tried the patch here (with rg)?

https://lists.gnu.org/archive/html/emacs-devel/2020-06/msg00547.html

> What would be nice is to be able to get the benefits of the vc-dir
> version of project.el, but not having to "git init" inside the child
> projects. More specifically, to be able to choose "project context",
> one as the closest project-root and one as the gigarepo project-root.

Here's some code based on yours.

Also rough and unofficial, but should hopefully be faster.

(defvar project-root-markers
   '("package.json"
     "tsconfig.json"
     "jsconfig.json"
     "elm.json"
     "*.sln")
   "Files or directories that indicate the root of a project.")

;; Probably better in .dir-locals.el.
(setq project-vc-ignores
       '("node_modules"
         "target"
         "build"
         "package-lock.json"
         "elm-stuff"))

(defun project-find-nomono (dir)
   (let ((root
          (locate-dominating-file
           dir
           (lambda (d)
             (let ((default-directory d))
               (seq-some #'file-expand-wildcards
                         project-root-markers))))))
     (when (and root
                (ignore-errors
                  (vc-responsible-backend root)))
       (cons 'vc root))))

(add-hook 'project-find-functions #'project-find-nomono)






^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#41955: 28.0.50; Monorepos and project.el
  2020-06-20  7:48   ` Theodor Thornhill
  2020-06-20  8:57     ` Eli Zaretskii
@ 2020-06-22 17:17     ` Felician Nemeth
  2020-06-22 21:06       ` Dmitry Gutov
  1 sibling, 1 reply; 10+ messages in thread
From: Felician Nemeth @ 2020-06-22 17:17 UTC (permalink / raw)
  To: 41955; +Cc: Theodor Thornhill, Dmitry Gutov

>> Would it help to have facilities of specifying the files in a project
>> by starting with an empty project, and then adding the files one by
>> one? Also, to be able to say that all the files in a given directory
>> (optionally, only files that match some shell wildcard), recursively,
>> should be added to a project?

I'm not completely sure, because I haven't really used EDE (the Emacs
Development Environment), but I think you can specify a project like
that in EDE.  And project.el supports EDE if I recall correctly.  So
maybe this complex "monorepo" use-case is already covered by existing
features of Emacs.





^ permalink raw reply	[flat|nested] 10+ messages in thread

* bug#41955: 28.0.50; Monorepos and project.el
  2020-06-22 17:17     ` Felician Nemeth
@ 2020-06-22 21:06       ` Dmitry Gutov
  0 siblings, 0 replies; 10+ messages in thread
From: Dmitry Gutov @ 2020-06-22 21:06 UTC (permalink / raw)
  To: Felician Nemeth, 41955; +Cc: Theodor Thornhill

On 22.06.2020 20:17, Felician Nemeth wrote:
> I'm not completely sure, because I haven't really used EDE (the Emacs
> Development Environment), but I think you can specify a project like
> that in EDE.  And project.el supports EDE if I recall correctly.  So
> maybe this complex "monorepo" use-case is already covered by existing
> features of Emacs.

Interesting thought.

The current EDE integration is pretty basic (only project-root, 
basically), but it can be extended fairly quickly, if an EDE project 
contains relevant data.

But that doesn't seem to be the case: it seems to me that EDE uses the 
same "one directory tree" model for the contents of the project. And the 
:include-path and :system-include-path keys we see in the definition 
examples are about code analysis (where to find external type and 
function definitions), and not about describing the main bulk of the 
files belonging to the project.





^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2020-06-22 21:06 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-06-19 20:42 bug#41955: 28.0.50; Monorepos and project.el Theodor Thornhill
2020-06-20  7:17 ` Eli Zaretskii
2020-06-20  7:48   ` Theodor Thornhill
2020-06-20  8:57     ` Eli Zaretskii
2020-06-21  1:25       ` Dmitry Gutov
2020-06-22 17:17     ` Felician Nemeth
2020-06-22 21:06       ` Dmitry Gutov
2020-06-20 23:29 ` Juri Linkov
2020-06-21  1:23   ` Dmitry Gutov
2020-06-21  1:48 ` Dmitry Gutov

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).