* Impossible to remove all offload machines @ 2024-08-17 16:40 Ian Eure 2024-09-14 14:55 ` bug#72686: " Maxim Cournoyer 0 siblings, 1 reply; 9+ messages in thread From: Ian Eure @ 2024-08-17 16:40 UTC (permalink / raw) To: bug-guix; +Cc: guix-devel Ran into this issue last week. If you: - Configure some offload build machines in your operating-system configuration. - Reconfigure your system. - Remove all offload build machines. - Reconfigure your system again. ...then various guix operations will still try to connect to offload machines, even if you reboot the affected client. This is caused by a bug in the `guix-activation' procedure: ;; ... and /etc/guix/machines.scm. #$(if (null? (guix-configuration-build-machines config)) #~#f (guix-machines-files-installation #~(list #$@(guix-configuration-build-machines config)))) If there are no build machines defined in the configuration, no operation is performed (#f is returned), which leaves the previous generation’s /etc/guix/machines.scm in place. The same issue appears to affect channels: ;; ... and /etc/guix/channels.scm... #$(and channels (install-channels-file channels)) I’d be happy to take a stab at fixing this, but I’m not certain what direction to go, or how much to refactor to get there. Should the channels/machines files be removed (ignoring errors if they don’t exist)? Should empty files be installed? Should that happen inline in `guix-activation', or in another procedure? Should the filenames be extracted to %variables to avoid duplicating between the two places they’ll be used? If someone would like to provide answered, I would contribute a patch. Thanks, — Ian ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: bug#72686: Impossible to remove all offload machines 2024-08-17 16:40 Impossible to remove all offload machines Ian Eure @ 2024-09-14 14:55 ` Maxim Cournoyer 2024-09-15 3:24 ` Ian Eure 0 siblings, 1 reply; 9+ messages in thread From: Maxim Cournoyer @ 2024-09-14 14:55 UTC (permalink / raw) To: Ian Eure; +Cc: 72686, guix-devel Hi Ian, Ian Eure <ian@retrospec.tv> writes: > Ran into this issue last week. If you: > > - Configure some offload build machines in your operating-system > configuration. > - Reconfigure your system. > - Remove all offload build machines. > - Reconfigure your system again. > > ...then various guix operations will still try to connect to offload > machines, even if you reboot the affected client. > > This is caused by a bug in the `guix-activation' procedure: > > ;; ... and /etc/guix/machines.scm. > #$(if (null? (guix-configuration-build-machines config)) > #~#f > (guix-machines-files-installation > #~(list #$@(guix-configuration-build-machines > config)))) > > If there are no build machines defined in the configuration, no > operation is performed (#f is returned), which leaves the previous > generation’s /etc/guix/machines.scm in place. > > The same issue appears to affect channels: > > ;; ... and /etc/guix/channels.scm... > #$(and channels (install-channels-file channels)) Interesting! > I’d be happy to take a stab at fixing this, but I’m not certain what > direction to go, or how much to refactor to get there. Should the > channels/machines files be removed (ignoring errors if they don’t > exist)? Should empty files be installed? Should that happen inline > in `guix-activation', or in another procedure? Should the filenames be > extracted to %variables to avoid duplicating between the two places > they’ll be used? > > If someone would like to provide answered, I would contribute a patch. I guess the simplest would be to attempt to remove the files when there are no offload machines or channels, in this already existing activation procedure. Extracting the file names to %variables sounds preferable yes, if there's a logical place to store them that is easily shared. A patch would be dandy! -- Thanks, Maxim ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: bug#72686: Impossible to remove all offload machines 2024-09-14 14:55 ` bug#72686: " Maxim Cournoyer @ 2024-09-15 3:24 ` Ian Eure 2024-09-15 3:53 ` Ian Eure 2024-09-22 2:26 ` Maxim Cournoyer 0 siblings, 2 replies; 9+ messages in thread From: Ian Eure @ 2024-09-15 3:24 UTC (permalink / raw) To: Maxim Cournoyer; +Cc: 72686, guix-devel Hi Maxim, Maxim Cournoyer <maxim.cournoyer@gmail.com> writes: > Hi Ian, > > Ian Eure <ian@retrospec.tv> writes: > >> Ran into this issue last week. If you: >> >> - Configure some offload build machines in your >> operating-system >> configuration. >> - Reconfigure your system. >> - Remove all offload build machines. >> - Reconfigure your system again. >> >> ...then various guix operations will still try to connect to >> offload >> machines, even if you reboot the affected client. >> >> This is caused by a bug in the `guix-activation' procedure: >> >> ;; ... and /etc/guix/machines.scm. >> #$(if (null? (guix-configuration-build-machines config)) >> #~#f >> (guix-machines-files-installation >> #~(list #$@(guix-configuration-build-machines >> config)))) >> >> If there are no build machines defined in the configuration, no >> operation is performed (#f is returned), which leaves the >> previous >> generation’s /etc/guix/machines.scm in place. >> >> The same issue appears to affect channels: >> >> ;; ... and /etc/guix/channels.scm... >> #$(and channels (install-channels-file channels)) > > Interesting! > >> I’d be happy to take a stab at fixing this, but I’m not certain >> what >> direction to go, or how much to refactor to get there. Should >> the >> channels/machines files be removed (ignoring errors if they >> don’t >> exist)? Should empty files be installed? Should that happen >> inline >> in `guix-activation', or in another procedure? Should the >> filenames be >> extracted to %variables to avoid duplicating between the two >> places >> they’ll be used? >> >> If someone would like to provide answered, I would contribute a >> patch. > > I guess the simplest would be to attempt to remove the files > when there > are no offload machines or channels, in this already existing > activation > procedure. Extracting the file names to %variables sounds > preferable > yes, if there's a logical place to store them that is easily > shared. > As I was putting together a patch for this, I realized there’s a problem: if a user is *manually* managing either /etc/guix/machines.scm or channels.scm, these files would be deleted, which likely isn’t what they want. The current code lets users choose to manage these files manually or declaritively, and there’s no way to know if the files on disk are the result of a previous system generation or a user’s creation. Since the channel management is a relatively new feature, I suspect there are quite a few folks with manually-managed channels that this would negatively impact. I know there was some disruption just moving to declaritive management of channels (but I can’t find the thread/s at the moment). I don’t see an elegant technical solution to this. I think the best option is probably to say that those files should *always* be managed through operating-system, and put a fat warning in the channel news to update your config if they’re still handled manually. The only other option I can see would be to keep the existing filenames for user configuration, and declaritively manage different files -- like declaritive-channels.scm. This comes with its own set of problems, like needing to update the Guix daemon to read and combine multiple files; and the inability to know whether a given `channels.scm' is declaritively- or manually-managed means a bumpy upgrade path (ex. should this preexisting channels.scm file be left as-is, or renamed to the new name?) I’m inclined to go with the fat-warning option, but am also thinking this likely needs some guix-devel discussion. What do you think? Thanks, — Ian ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: bug#72686: Impossible to remove all offload machines 2024-09-15 3:24 ` Ian Eure @ 2024-09-15 3:53 ` Ian Eure 2024-09-15 19:06 ` Tomas Volf 2024-09-22 2:26 ` Maxim Cournoyer 1 sibling, 1 reply; 9+ messages in thread From: Ian Eure @ 2024-09-15 3:53 UTC (permalink / raw) To: Maxim Cournoyer; +Cc: 72686, guix-devel Hi Maxim, Ian Eure <ian@retrospec.tv> writes: > Hi Maxim, > > Maxim Cournoyer <maxim.cournoyer@gmail.com> writes: > >> Hi Ian, >> >> Ian Eure <ian@retrospec.tv> writes: >> >>> Ran into this issue last week. If you: >>> >>> - Configure some offload build machines in your >>> operating-system >>> configuration. >>> - Reconfigure your system. >>> - Remove all offload build machines. >>> - Reconfigure your system again. >>> >>> ...then various guix operations will still try to connect to >>> offload >>> machines, even if you reboot the affected client. >>> >>> This is caused by a bug in the `guix-activation' procedure: >>> >>> ;; ... and /etc/guix/machines.scm. >>> #$(if (null? (guix-configuration-build-machines config)) >>> #~#f >>> (guix-machines-files-installation >>> #~(list #$@(guix-configuration-build-machines >>> config)))) >>> >>> If there are no build machines defined in the configuration, >>> no >>> operation is performed (#f is returned), which leaves the >>> previous >>> generation’s /etc/guix/machines.scm in place. >>> >>> The same issue appears to affect channels: >>> >>> ;; ... and /etc/guix/channels.scm... >>> #$(and channels (install-channels-file channels)) >> >> Interesting! >> >>> I’d be happy to take a stab at fixing this, but I’m not >>> certain >>> what >>> direction to go, or how much to refactor to get there. Should >>> the >>> channels/machines files be removed (ignoring errors if they >>> don’t >>> exist)? Should empty files be installed? Should that happen >>> inline >>> in `guix-activation', or in another procedure? Should the >>> filenames >>> be >>> extracted to %variables to avoid duplicating between the two >>> places >>> they’ll be used? >>> >>> If someone would like to provide answered, I would contribute >>> a >>> patch. >> >> I guess the simplest would be to attempt to remove the files >> when >> there >> are no offload machines or channels, in this already existing >> activation >> procedure. Extracting the file names to %variables sounds >> preferable >> yes, if there's a logical place to store them that is easily >> shared. >> > > As I was putting together a patch for this, I realized there’s a > problem: if a user is *manually* managing either > /etc/guix/machines.scm or channels.scm, these files would be > deleted, > which likely isn’t what they want. The current code lets users > choose > to manage these files manually or declaritively, and there’s no > way to > know if the files on disk are the result of a previous system > generation or a user’s creation. Since the channel management > is a > relatively new feature, I suspect there are quite a few folks > with > manually-managed channels that this would negatively impact. I > know > there was some disruption just moving to declaritive management > of > channels (but I can’t find the thread/s at the moment). > > I don’t see an elegant technical solution to this. I think the > best > option is probably to say that those files should *always* be > managed > through operating-system, and put a fat warning in the channel > news to > update your config if they’re still handled manually. > > The only other option I can see would be to keep the existing > filenames for user configuration, and declaritively manage > different > files -- like declaritive-channels.scm. This comes with its own > set > of problems, like needing to update the Guix daemon to read and > combine multiple files; and the inability to know whether a > given > `channels.scm' is declaritively- or manually-managed means a > bumpy > upgrade path (ex. should this preexisting channels.scm file be > left > as-is, or renamed to the new name?) > > I’m inclined to go with the fat-warning option, but am also > thinking > this likely needs some guix-devel discussion. > > What do you think? > Disregard this, I continued thinking after sending the email (as one does) and realized that any managed file will be a link into the store -- so if the system is reconfigured with no build-machines or channels *and* the corresponding file is a store link, it should be removed; otherwise, it should remain untouched. I can work with this. Thanks, — Ian ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: bug#72686: Impossible to remove all offload machines 2024-09-15 3:53 ` Ian Eure @ 2024-09-15 19:06 ` Tomas Volf 2024-09-19 0:35 ` Ian Eure 0 siblings, 1 reply; 9+ messages in thread From: Tomas Volf @ 2024-09-15 19:06 UTC (permalink / raw) To: Ian Eure; +Cc: Maxim Cournoyer, 72686, guix-devel [-- Attachment #1: Type: text/plain, Size: 702 bytes --] Hello, Ian Eure <ian@retrospec.tv> writes: > Disregard this, I continued thinking after sending the email (as one does) and > realized that any managed file will be a link into the store -- so if the system > is reconfigured with no build-machines or channels *and* the corresponding file > is a store link, it should be removed; otherwise, it should remain untouched. I > can work with this. Will this correctly handle cases where user is managing the file using for example extra-special-file? I wonder whether fat-warning approach would not be better. Tomas -- There are only two hard things in Computer Science: cache invalidation, naming things and off-by-one errors. [-- Attachment #2: signature.asc --] [-- Type: application/pgp-signature, Size: 853 bytes --] ^ permalink raw reply [flat|nested] 9+ messages in thread
* bug#72686: Impossible to remove all offload machines 2024-09-15 19:06 ` Tomas Volf @ 2024-09-19 0:35 ` Ian Eure 0 siblings, 0 replies; 9+ messages in thread From: Ian Eure @ 2024-09-19 0:35 UTC (permalink / raw) To: Tomas Volf; +Cc: guix-devel, 72686, Maxim Cournoyer Tomas Volf <~@wolfsden.cz> writes: > [[PGP Signed Part:Undecided]] > > Hello, > > Ian Eure <ian@retrospec.tv> writes: > >> Disregard this, I continued thinking after sending the email >> (as one does) and >> realized that any managed file will be a link into the store -- >> so if the system >> is reconfigured with no build-machines or channels *and* the >> corresponding file >> is a store link, it should be removed; otherwise, it should >> remain untouched. I >> can work with this. > > Will this correctly handle cases where user is managing the file > using > for example extra-special-file? > No, it wouldn’t. > I wonder whether fat-warning approach would not be better. > I think I agree. — Ian ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: bug#72686: Impossible to remove all offload machines 2024-09-15 3:24 ` Ian Eure 2024-09-15 3:53 ` Ian Eure @ 2024-09-22 2:26 ` Maxim Cournoyer 2024-12-01 19:05 ` Ian Eure 1 sibling, 1 reply; 9+ messages in thread From: Maxim Cournoyer @ 2024-09-22 2:26 UTC (permalink / raw) To: Ian Eure; +Cc: 72686, guix-devel Hi Ian, Ian Eure <ian@retrospec.tv> writes: [...] > The only other option I can see would be to keep the existing > filenames for user configuration, and declaritively manage different > files -- like declaritive-channels.scm. This comes with its own set > of problems, like needing to update the Guix daemon to read and > combine multiple files; and the inability to know whether a given > `channels.scm' is declaritively- or manually-managed means a bumpy > upgrade path (ex. should this preexisting channels.scm file be left > as-is, or renamed to the new name?) I'd think that be a great option to pursue, although it's more work more thoughts. Perhaps it could work along these lines (brainstorming) I like the idea to leave the original, potentially manually written file in place and complement it with a declarative counterpart. The same would also have benefited /etc/guix/acl, which suffers from the same ambiguity. -- Thanks, Maxim ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: bug#72686: Impossible to remove all offload machines 2024-09-22 2:26 ` Maxim Cournoyer @ 2024-12-01 19:05 ` Ian Eure 2025-01-02 15:02 ` Maxim Cournoyer 0 siblings, 1 reply; 9+ messages in thread From: Ian Eure @ 2024-12-01 19:05 UTC (permalink / raw) To: Maxim Cournoyer; +Cc: 72686, guix-devel Hi Maxim, Maxim Cournoyer <maxim.cournoyer@gmail.com> writes: > Hi Ian, > > Ian Eure <ian@retrospec.tv> writes: > > [...] > >> The only other option I can see would be to keep the existing >> filenames for user configuration, and declaritively manage >> different >> files -- like declaritive-channels.scm. This comes with its >> own set >> of problems, like needing to update the Guix daemon to read and >> combine multiple files; and the inability to know whether a >> given >> `channels.scm' is declaritively- or manually-managed means a >> bumpy >> upgrade path (ex. should this preexisting channels.scm file be >> left >> as-is, or renamed to the new name?) > > I'd think that be a great option to pursue, although it's more > work more > thoughts. Perhaps it could work along these lines > (brainstorming) > > I like the idea to leave the original, potentially manually > written file > in place and complement it with a declarative counterpart. The > same > would also have benefited /etc/guix/acl, which suffers from the > same > ambiguity. > Apologies for the silence, life stuff has been eating most of my free time, but I have a bit of bandwidth to spend on this problem again. I took a swing at this, it wasn’t as difficult as I expected. While this approach gives a smooth upgrade path for those who’ve configured channels in a stateful way switching to declarative configuration, it’s possibly bumpy for those already using a declarative config. If a machine with declarative channels is reconfigured, the channels will be duplicated from /etc/guix/channels.scm to /etc/guix/channels-declarative.scm. Using `delete-duplicates' on the merged channels should avoid major problems, but I think it still needs a loud entry in news and manual action (deleting /etc/guix/channels.scm) to upgrade. Given that both approaches will require manual action, I’m a bit inclined to go with the simpler, and take over the existing file. That said, I think the failure mode of the simpler approach (stomping on channels a user may have configured) is undeniably worse than potentially duplicating channels or continuing to pull in old ones unexpectedly. Do either of you have a strong opinion or more information which would help guide this decision? The root issue at work behind all these problems is that activation code only sees the desired target config, rather than the current and target configs. Comparing the current and target configs would allow the code to more precisely compute the needd change to move from one state to the next. I think that could be a good change to make, though it’s obviously going to be much more involved, and IMO will require discussion outside the scope of this specific bug. I have a draft patch series I hope to send up soon, but need to get Guix System up in a VM to test first. It does separate declarative channels into their own config, but doesn’t do the same for build machines. While I think many fewer users configure build machines than channels, it’s probably a good idea to use the same approach for both channels and machines. — Ian ^ permalink raw reply [flat|nested] 9+ messages in thread
* bug#72686: Impossible to remove all offload machines 2024-12-01 19:05 ` Ian Eure @ 2025-01-02 15:02 ` Maxim Cournoyer 0 siblings, 0 replies; 9+ messages in thread From: Maxim Cournoyer @ 2025-01-02 15:02 UTC (permalink / raw) To: Ian Eure; +Cc: guix-devel, 72686 Hi Ian, Ian Eure <ian@retrospec.tv> writes: [...] > Apologies for the silence, life stuff has been eating most of my free > time, but I have a bit of bandwidth to spend on this problem again. As you can see, you are not alone :-). > I took a swing at this, it wasn’t as difficult as I expected. While > this approach gives a smooth upgrade path for those who’ve configured > channels in a stateful way switching to declarative configuration, > it’s possibly bumpy for those already using a declarative config. If > a machine with declarative channels is reconfigured, the channels will > be duplicated from /etc/guix/channels.scm to > /etc/guix/channels-declarative.scm. Using `delete-duplicates' on the > merged channels should avoid major problems, but I think it still > needs a loud entry in news and manual action (deleting > /etc/guix/channels.scm) to upgrade. Given that both approaches will > require manual action, I’m a bit inclined to go with the simpler, and > take over the existing file. That said, I think the failure mode of > the simpler approach (stomping on channels a user may have configured) > is undeniably worse than potentially duplicating channels or > continuing to pull in old ones unexpectedly. Do either of you have a > strong opinion or more information which would help guide this > decision? I still like the option to leave a handcrafted channels.scm alone; I don't think this imperative variant will be obsolete in the future; it's still useful to experiment quickly, avoiding costly reconfigure just to change some bits about a new offload machine. > The root issue at work behind all these problems is that activation > code only sees the desired target config, rather than the current and > target configs. Comparing the current and target configs would allow > the code to more precisely compute the needd change to move from one > state to the next. I think that could be a good change to make, > though it’s obviously going to be much more involved, and IMO will > require discussion outside the scope of this specific bug. The activation is just a script, so if it's useful to check the current value, it should be OK to do so. > I have a draft patch series I hope to send up soon, but need to get > Guix System up in a VM to test first. It does separate declarative > channels into their own config, but doesn’t do the same for build > machines. While I think many fewer users configure build machines > than channels, it’s probably a good idea to use the same approach for > both channels and machines. Yes, it'd be nice to have a uniform approach. The acl file could benefit from the same treatment, I believe (that, and 'guix authorize' should validate that we're not adding the same key multiple times -- warn something along "key XXXX is already authorized". -- Thanks, Maxim ^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2025-01-02 15:03 UTC | newest] Thread overview: 9+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2024-08-17 16:40 Impossible to remove all offload machines Ian Eure 2024-09-14 14:55 ` bug#72686: " Maxim Cournoyer 2024-09-15 3:24 ` Ian Eure 2024-09-15 3:53 ` Ian Eure 2024-09-15 19:06 ` Tomas Volf 2024-09-19 0:35 ` Ian Eure 2024-09-22 2:26 ` Maxim Cournoyer 2024-12-01 19:05 ` Ian Eure 2025-01-02 15:02 ` Maxim Cournoyer
Code repositories for project(s) associated with this public inbox https://git.savannah.gnu.org/cgit/guix.git This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).