From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp1 ([2001:41d0:2:bcc0::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms0.migadu.com with LMTPS id uJ8iEvptrmAVwAAAgWs5BA (envelope-from ) for ; Wed, 26 May 2021 17:49:14 +0200 Received: from aspmx1.migadu.com ([2001:41d0:2:bcc0::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp1 with LMTPS id OFrTDfptrmATJAAAbx9fmQ (envelope-from ) for ; Wed, 26 May 2021 15:49:14 +0000 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id EBEDAC568 for ; Wed, 26 May 2021 17:49:12 +0200 (CEST) Received: from localhost ([::1]:46172 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1llvmO-0008Ji-7k for larch@yhetil.org; Wed, 26 May 2021 11:49:08 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:57964) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1llvmH-0008JP-UK for bug-guix@gnu.org; Wed, 26 May 2021 11:49:01 -0400 Received: from debbugs.gnu.org ([209.51.188.43]:38211) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1llvmH-00026p-N9 for bug-guix@gnu.org; Wed, 26 May 2021 11:49:01 -0400 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1llvmH-0001sV-L4 for bug-guix@gnu.org; Wed, 26 May 2021 11:49:01 -0400 X-Loop: help-debbugs@gnu.org Subject: bug#41625: [PATCH v2] offload: Handle a possible EOF response from read-repl-response. Resent-From: Marius Bakke Original-Sender: "Debbugs-submit" Resent-CC: bug-guix@gnu.org Resent-Date: Wed, 26 May 2021 15:49:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 41625 X-GNU-PR-Package: guix X-GNU-PR-Keywords: To: Maxim Cournoyer , Ludovic =?UTF-8?Q?Court=C3=A8s?= Received: via spool by 41625-submit@debbugs.gnu.org id=B41625.16220441237188 (code B ref 41625); Wed, 26 May 2021 15:49:01 +0000 Received: (at 41625) by debbugs.gnu.org; 26 May 2021 15:48:43 +0000 Received: from localhost ([127.0.0.1]:49757 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1llvlz-0001rr-HQ for submit@debbugs.gnu.org; Wed, 26 May 2021 11:48:43 -0400 Received: from eggs.gnu.org ([209.51.188.92]:35964) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1llvly-0001rf-3w for 41625@debbugs.gnu.org; Wed, 26 May 2021 11:48:42 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:37940) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1llvls-0001qI-5s; Wed, 26 May 2021 11:48:36 -0400 Received: from host-37-191-231-185.lynet.no ([37.191.231.185]:48442 helo=localhost) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1llvlr-00075C-K5; Wed, 26 May 2021 11:48:35 -0400 From: Marius Bakke In-Reply-To: <87mtsikwsm.fsf_-_@gmail.com> References: <87mtsky9um.fsf@gmail.com> <20210525155003.27590-1-maxim.cournoyer@gmail.com> <875yz61rvt.fsf@gnu.org> <87mtsikwsm.fsf_-_@gmail.com> Date: Wed, 26 May 2021 17:48:32 +0200 Message-ID: <87h7ipa433.fsf@gnu.org> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha512; protocol="application/pgp-signature" X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-guix@gnu.org List-Id: Bug reports for GNU Guix List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: 41625@debbugs.gnu.org Errors-To: bug-guix-bounces+larch=yhetil.org@gnu.org Sender: "bug-Guix" X-Migadu-Flow: FLOW_IN ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1622044154; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:resent-cc:resent-from:resent-sender: resent-message-id:in-reply-to:in-reply-to:references:references: list-id:list-help:list-unsubscribe:list-subscribe:list-post; bh=mqGuAnXkgCWKopH/OUVq38owdnTPBu2CARXr1F3tTGc=; b=iTNDqcsflJOJaKaba/jsL2DeBO0fiAj2l8o6IxSTQIHpJFbEFiZD8iTqWpVxANOK30MqV6 DL/jeQqbdRnG0AHatRycVjfO0uZX/cntjAmkBYSrhxWeKRFWSy0/732VoNbhqIasKCw+FP uNdmZL8BX8v3bTT1Zyf3TXEMWPTH25x9fQ0KM4eQdcop+UpwDQMb3QREXLInBrCprPrjgv cMXxO6D9jrLM6iwETCWD3EFOjFshAixmy/OvRonVzEYfpT0xpfbxzymJISq44pkj3beWX8 SmC9kKMwmo5U2HtflHV/AMGA05T3fE9xtZCEwbFY7N2TeoPewPy/5P/5kJI6pg== ARC-Seal: i=1; s=key1; d=yhetil.org; t=1622044154; a=rsa-sha256; cv=none; b=JFkrER320FJWBBRSTQOtj0OUIOJTFs4yo6KI5ywbeMojBHUyhTclXKmdIGMU8+nFKcns9b j7BuXyisqkWOnB+R5M7QN021njrwLWThjKi1JCMkkWVr0W7DVkB2Z6kDR4j18Imk3nsPaV eAciRS6d3IJYfYZ/QybqPpCVD3GUTRpUjgVBQFWtvUlbnJ6q+RaB0LPRYvyhBoi8IXpPPt pB7cgcMsON3N5X6VKiaWwk/NMAO7N9ebsvjf6SHqaoo3sWEiMz61b6rvjOPYsEJttd65EK sXDkgSCutcZDvxYzm9ECum+/+xLycNmsthivCN3PjKZWex3+c+xtaG98/OBeCw== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=none; dmarc=pass (policy=none) header.from=gnu.org; spf=pass (aspmx1.migadu.com: domain of bug-guix-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=bug-guix-bounces@gnu.org X-Migadu-Spam-Score: -5.03 Authentication-Results: aspmx1.migadu.com; dkim=none; dmarc=pass (policy=none) header.from=gnu.org; spf=pass (aspmx1.migadu.com: domain of bug-guix-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=bug-guix-bounces@gnu.org X-Migadu-Queue-Id: EBEDAC568 X-Spam-Score: -5.03 X-Migadu-Scanner: scn1.migadu.com X-TUID: YKNpUXXSEPTp --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Maxim Cournoyer skriver: >> Is running =E2=80=98guix offload test /etc/guix/machines.scm overdrive1= =E2=80=99 on >> berlin enough to reproduce the issue? If so, we could monitor/strace >> sshd on overdrive1 to get a better understanding of what=E2=80=99s going= on. > > It's actually difficult to trigger it; it seems to happen mostly on the > first try after a long time without connecting to the machine; on the > 2nd and later tries, everything is smooth. Waiting a few minutes is not > enough to re-trigger the problem. > > I've managed to see the problem a few lucky times with: > > --8<---------------cut here---------------start------------->8--- > while true; do guix offload test /etc/guix/machines.scm overdrive1; done > --8<---------------cut here---------------end--------------->8--- I used to be able to reproduce it by inducing a high load on the target machine and just let Guix keep trying to connect. But now I did that, and set overload threshold to 0.0 for good measure, and Guix has been waiting patiently for two hours without failure. So AFAICT this bug has been fixed. Perhaps Berlin or the Overdrive simply needs to be updated? --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iIUEARYKAC0WIQRNTknu3zbaMQ2ddzTocYulkRQQdwUCYK5t0A8cbWFyaXVzQGdu dS5vcmcACgkQ6HGLpZEUEHcy7QEAlX2skErbZBDILToutLkOSP6mLf3xK2tdqBcO ttgbpX0BANb7F5k14C9HhyQBYoexMCS9Mydjfx43TLIeAuKw2gkL =41ZX -----END PGP SIGNATURE----- --=-=-=--