* bug#70877: guix-daemon fails to copy 4+GB file to store
@ 2024-05-11 10:52 Ricardo Wurmus
2024-05-12 7:12 ` Efraim Flashner
2024-05-13 10:10 ` Ludovic Courtès
0 siblings, 2 replies; 6+ messages in thread
From: Ricardo Wurmus @ 2024-05-11 10:52 UTC (permalink / raw)
To: 70877; +Cc: ludo
[-- Attachment #1: Type: text/plain, Size: 727 bytes --]
The guix-daemon's libutil/util.cc uses copy_file_range to copy a
downloaded file into the store. copy_file_range fails on files larger
than 4GB with an error like this:
guix build: error: short write in copy_file_range `15' to `16': No such file or directory
The man page for copy_file_range says that it could return EFBIG when
the range exceeds the maximum range. The daemon code does not check any
limits and will attempt to copy the whole file.
I believe our code ought to check the value of st.size and fall back to
a boring copy if it exceeds some "reasonable" value.
This is where copy_file_range is used:
https://git.savannah.gnu.org/cgit/guix.git/tree/nix/libutil/util.cc#n382
Here is a little reproducer:
[-- Attachment #2: bug.scm --]
[-- Type: text/plain, Size: 425 bytes --]
(use-modules (guix download)
(guix packages)
(guix build-system trivial))
(package
(name "chungus")
(version "1")
(source
(origin
(method url-fetch)
(uri "http://localhost:1111/chungus")
(sha256
(base32 "0nx67d4ls2nfwcfdmg81vf240z6lpwpdqypssr1wzn3hyz4szci4"))))
(build-system trivial-build-system)
(home-page "")
(synopsis "")
(description "")
(license #f))
[-- Attachment #3: Type: text/plain, Size: 468 bytes --]
--8<---------------cut here---------------start------------->8---
# generate a big file
dd bs=1M count=4096 if=/dev/zero of=/tmp/chungus
# serve it
guix shell woof -- woof -i 127.0.0.1 -p 1111 -c 1 /tmp/chungus
# build the source derivation
guix build --no-grafts -Sf bug.scm
# observe the error
# guix build: error: short write in copy_file_range `15' to `16': No such file or directory
--8<---------------cut here---------------end--------------->8---
--
Ricardo
^ permalink raw reply [flat|nested] 6+ messages in thread
* bug#70877: guix-daemon fails to copy 4+GB file to store
2024-05-11 10:52 bug#70877: guix-daemon fails to copy 4+GB file to store Ricardo Wurmus
@ 2024-05-12 7:12 ` Efraim Flashner
2024-05-13 10:10 ` Ludovic Courtès
1 sibling, 0 replies; 6+ messages in thread
From: Efraim Flashner @ 2024-05-12 7:12 UTC (permalink / raw)
To: Ricardo Wurmus; +Cc: ludo, 70877
[-- Attachment #1: Type: text/plain, Size: 2191 bytes --]
On Sat, May 11, 2024 at 12:52:53PM +0200, Ricardo Wurmus wrote:
> The guix-daemon's libutil/util.cc uses copy_file_range to copy a
> downloaded file into the store. copy_file_range fails on files larger
> than 4GB with an error like this:
>
> guix build: error: short write in copy_file_range `15' to `16': No such file or directory
>
> The man page for copy_file_range says that it could return EFBIG when
> the range exceeds the maximum range. The daemon code does not check any
> limits and will attempt to copy the whole file.
>
> I believe our code ought to check the value of st.size and fall back to
> a boring copy if it exceeds some "reasonable" value.
>
> This is where copy_file_range is used:
> https://git.savannah.gnu.org/cgit/guix.git/tree/nix/libutil/util.cc#n382
>
> Here is a little reproducer:
>
> (use-modules (guix download)
> (guix packages)
> (guix build-system trivial))
>
> (package
> (name "chungus")
> (version "1")
> (source
> (origin
> (method url-fetch)
> (uri "http://localhost:1111/chungus")
> (sha256
> (base32 "0nx67d4ls2nfwcfdmg81vf240z6lpwpdqypssr1wzn3hyz4szci4"))))
> (build-system trivial-build-system)
> (home-page "")
> (synopsis "")
> (description "")
> (license #f))
>
> --8<---------------cut here---------------start------------->8---
> # generate a big file
> dd bs=1M count=4096 if=/dev/zero of=/tmp/chungus
> # serve it
> guix shell woof -- woof -i 127.0.0.1 -p 1111 -c 1 /tmp/chungus
> # build the source derivation
> guix build --no-grafts -Sf bug.scm
> # observe the error
> # guix build: error: short write in copy_file_range `15' to `16': No such file or directory
> --8<---------------cut here---------------end--------------->8---
>
This sounds like a similar failure to bug 65714 that I ran into with
guix copy, but I wasn't able to diagnose it.
https://issues.guix.gnu.org/65714
--
Efraim Flashner <efraim@flashner.co.il> רנשלפ םירפא
GPG key = A28B F40C 3E55 1372 662D 14F7 41AA E7DC CA3D 8351
Confidentiality cannot be guaranteed on emails sent or received unencrypted
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 833 bytes --]
^ permalink raw reply [flat|nested] 6+ messages in thread
* bug#70877: guix-daemon fails to copy 4+GB file to store
2024-05-11 10:52 bug#70877: guix-daemon fails to copy 4+GB file to store Ricardo Wurmus
2024-05-12 7:12 ` Efraim Flashner
@ 2024-05-13 10:10 ` Ludovic Courtès
2024-05-13 12:09 ` Ricardo Wurmus
1 sibling, 1 reply; 6+ messages in thread
From: Ludovic Courtès @ 2024-05-13 10:10 UTC (permalink / raw)
To: Ricardo Wurmus; +Cc: 70877
[-- Attachment #1: Type: text/plain, Size: 1640 bytes --]
Hi,
Thanks for the bug report and nice reproducer!
Ricardo Wurmus <rekado@elephly.net> skribis:
> The guix-daemon's libutil/util.cc uses copy_file_range to copy a
> downloaded file into the store. copy_file_range fails on files larger
> than 4GB with an error like this:
>
> guix build: error: short write in copy_file_range `15' to `16': No such file or directory
>
> The man page for copy_file_range says that it could return EFBIG when
> the range exceeds the maximum range. The daemon code does not check any
> limits and will attempt to copy the whole file.
>
> I believe our code ought to check the value of st.size and fall back to
> a boring copy if it exceeds some "reasonable" value.
The goal leading to this error message looks like this:
copy_file_range(15, NULL, 16, NULL, 4294967297, 0) = 2147479552
… which is precisely 2 GiB - 4 KiB.
Reading the man page, it’s entirely fine: like ‘write’,
‘copy_file_range’ might copy less than asked for, so it’s really a
mistake of mine to assume that short writes can’t happen. Presumably
there’s an internal limit here we’re reaching that explains why it won’t
copy more than 2 GiB at once.
With the following change, we get:
newfstatat(15, "", {st_mode=S_IFREG|0644, st_size=4294967297, ...}, AT_EMPTY_PATH) = 0
copy_file_range(15, NULL, 16, NULL, 4294967297, 0) = 2147479552
copy_file_range(15, NULL, 16, NULL, 2147487745, 0) = 2147479552
copy_file_range(15, NULL, 16, NULL, 8193, 0) = 8193
fchown(16, 30001, 30000) = 0
Could you confirm that it works for you?
Thanks,
Ludo’.
[-- Attachment #2: the patch --]
[-- Type: text/x-patch, Size: 1783 bytes --]
From efd9f3383756df9959651125c0f2e2e769630851 Mon Sep 17 00:00:00 2001
Message-ID: <efd9f3383756df9959651125c0f2e2e769630851.1715594931.git.ludo@gnu.org>
From: =?UTF-8?q?Ludovic=20Court=C3=A8s?= <ludo@gnu.org>
Date: Mon, 13 May 2024 12:02:30 +0200
Subject: [PATCH] =?UTF-8?q?daemon:=20Loop=20over=20=E2=80=98copy=5Ffile=5F?=
=?UTF-8?q?range=E2=80=99=20upon=20short=20writes.?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Fixes <https://issues.guix.gnu.org/70877>.
* nix/libutil/util.cc (copyFile): Loop over ‘copy_file_range’ instead of
throwing upon short write.
Reported-by: Ricardo Wurmus <rekado@elephly.net>
Change-Id: Id7b8a65ea59006c2d91bc23732309a68665b9ca0
---
nix/libutil/util.cc | 11 ++++++++---
1 file changed, 8 insertions(+), 3 deletions(-)
diff --git a/nix/libutil/util.cc b/nix/libutil/util.cc
index 578d6572934..3206dea11b1 100644
--- a/nix/libutil/util.cc
+++ b/nix/libutil/util.cc
@@ -397,9 +397,14 @@ static void copyFile(int sourceFd, int destinationFd)
} else {
if (result < 0)
throw SysError(format("copy_file_range `%1%' to `%2%'") % sourceFd % destinationFd);
- if (result < st.st_size)
- throw SysError(format("short write in copy_file_range `%1%' to `%2%'")
- % sourceFd % destinationFd);
+
+ /* If 'copy_file_range' copied less than requested, try again. */
+ for (ssize_t copied = result; copied < st.st_size; copied += result) {
+ result = copy_file_range(sourceFd, NULL, destinationFd, NULL,
+ st.st_size - copied, 0);
+ if (result < 0)
+ throw SysError(format("copy_file_range `%1%' to `%2%'") % sourceFd % destinationFd);
+ }
}
}
base-commit: 89cd778f6a45cd9b43a4dc1f236dcd0a87af955c
--
2.41.0
^ permalink raw reply related [flat|nested] 6+ messages in thread
* bug#70877: guix-daemon fails to copy 4+GB file to store
2024-05-13 10:10 ` Ludovic Courtès
@ 2024-05-13 12:09 ` Ricardo Wurmus
2024-05-13 14:34 ` Ludovic Courtès
0 siblings, 1 reply; 6+ messages in thread
From: Ricardo Wurmus @ 2024-05-13 12:09 UTC (permalink / raw)
To: Ludovic Courtès; +Cc: 70877
Ludovic Courtès <ludo@gnu.org> writes:
> Could you confirm that it works for you?
I've applied this locally, started the new daemon, and used it to build
the 4+GB source code derivation of a big package that used to fail
before. It works now. Thank you!
--
Ricardo
^ permalink raw reply [flat|nested] 6+ messages in thread
* bug#70877: guix-daemon fails to copy 4+GB file to store
2024-05-13 12:09 ` Ricardo Wurmus
@ 2024-05-13 14:34 ` Ludovic Courtès
2024-05-13 16:24 ` Ludovic Courtès
0 siblings, 1 reply; 6+ messages in thread
From: Ludovic Courtès @ 2024-05-13 14:34 UTC (permalink / raw)
To: Ricardo Wurmus; +Cc: 70877
Ricardo Wurmus <rekado@elephly.net> skribis:
> Ludovic Courtès <ludo@gnu.org> writes:
>
>> Could you confirm that it works for you?
>
> I've applied this locally, started the new daemon, and used it to build
> the 4+GB source code derivation of a big package that used to fail
> before. It works now. Thank you!
Pushed as 7757fdd491862fa5c33f1f894503346b89898a01.
I’ll update the ‘guix’ package to make the fix available.
Thanks for testing!
Ludo’.
^ permalink raw reply [flat|nested] 6+ messages in thread
* bug#70877: guix-daemon fails to copy 4+GB file to store
2024-05-13 14:34 ` Ludovic Courtès
@ 2024-05-13 16:24 ` Ludovic Courtès
0 siblings, 0 replies; 6+ messages in thread
From: Ludovic Courtès @ 2024-05-13 16:24 UTC (permalink / raw)
To: Ricardo Wurmus; +Cc: 70877-done
Ludovic Courtès <ludo@gnu.org> skribis:
> Pushed as 7757fdd491862fa5c33f1f894503346b89898a01.
>
> I’ll update the ‘guix’ package to make the fix available.
Done in 58be9a79e2862d5fa9842d73f498ce2e5442b9ce.
Ludo'.
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2024-05-13 16:25 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-05-11 10:52 bug#70877: guix-daemon fails to copy 4+GB file to store Ricardo Wurmus
2024-05-12 7:12 ` Efraim Flashner
2024-05-13 10:10 ` Ludovic Courtès
2024-05-13 12:09 ` Ricardo Wurmus
2024-05-13 14:34 ` Ludovic Courtès
2024-05-13 16:24 ` Ludovic Courtès
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/guix.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).