From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp0 ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms11 with LMTPS id kEqaO2L+kl8MFAAA0tVLHw (envelope-from ) for ; Fri, 23 Oct 2020 16:01:38 +0000 Received: from aspmx1.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp0 with LMTPS id uMz4NmL+kl+jMgAA1q6Kng (envelope-from ) for ; Fri, 23 Oct 2020 16:01:38 +0000 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 1F0CF9402A8 for ; Fri, 23 Oct 2020 16:01:38 +0000 (UTC) Received: from localhost ([::1]:35094 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kVzVX-0007jf-K4 for larch@yhetil.org; Fri, 23 Oct 2020 12:01:35 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:59954) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kVyy6-00036A-PQ for guix-patches@gnu.org; Fri, 23 Oct 2020 11:27:02 -0400 Received: from debbugs.gnu.org ([209.51.188.43]:46483) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1kVyy6-0001SA-Cg for guix-patches@gnu.org; Fri, 23 Oct 2020 11:27:02 -0400 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1kVyy6-0005XD-7l for guix-patches@gnu.org; Fri, 23 Oct 2020 11:27:02 -0400 X-Loop: help-debbugs@gnu.org Subject: [bug#39588] gnu: Add mpich, scalapack-mpich, mumps-mpich, pt-scotch-mpich, python-mpi4py-mpich Resent-From: Ludovic =?UTF-8?Q?Court=C3=A8s?= Original-Sender: "Debbugs-submit" Resent-CC: guix-patches@gnu.org Resent-Date: Fri, 23 Oct 2020 15:27:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 39588 X-GNU-PR-Package: guix-patches X-GNU-PR-Keywords: To: Maurice =?UTF-8?Q?Br=C3=A9mond?= Cc: 39588@debbugs.gnu.org, zimoun Received: via spool by 39588-submit@debbugs.gnu.org id=B39588.160346680721251 (code B ref 39588); Fri, 23 Oct 2020 15:27:02 +0000 Received: (at 39588) by debbugs.gnu.org; 23 Oct 2020 15:26:47 +0000 Received: from localhost ([127.0.0.1]:58029 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kVyxq-0005Wh-M6 for submit@debbugs.gnu.org; Fri, 23 Oct 2020 11:26:46 -0400 Received: from mail2-relais-roc.national.inria.fr ([192.134.164.83]:37750) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kVyxo-0005WO-E7 for 39588@debbugs.gnu.org; Fri, 23 Oct 2020 11:26:45 -0400 X-IronPort-AV: E=Sophos;i="5.77,408,1596492000"; d="scan'208";a="474076465" Received: from 91-160-117-201.subs.proxad.net (HELO ribbon) ([91.160.117.201]) by mail2-relais-roc.national.inria.fr with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 23 Oct 2020 17:26:37 +0200 From: Ludovic =?UTF-8?Q?Court=C3=A8s?= References: <87blq2rclk.fsf@inria.fr> <87o8tx3z2q.fsf@gnu.org> <87eeupd3t1.fsf@gnu.org> <861rhz1d7b.fsf@gmail.com> <87o8l28qjh.fsf@gnu.org> <87lfg2pbv7.fsf@inria.fr> <87v9f4bowq.fsf@gnu.org> <87v9f12so9.fsf@inria.fr> X-URL: http://www.fdn.fr/~lcourtes/ X-Revolutionary-Date: 2 Brumaire an 229 de la =?UTF-8?Q?R=C3=A9volution?= X-PGP-Key-ID: 0x090B11993D9AEBB5 X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4 0CFB 090B 1199 3D9A EBB5 X-OS: x86_64-pc-linux-gnu Date: Fri, 23 Oct 2020 17:26:36 +0200 In-Reply-To: <87v9f12so9.fsf@inria.fr> ("Maurice =?UTF-8?Q?Br=C3=A9mond?="'s message of "Fri, 23 Oct 2020 11:33:10 +0200") Message-ID: <873625m09f.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.1 (gnu/linux) MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="=-=-=" X-Spam-Score: -5.0 (-----) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-Spam-Score: -6.0 (------) X-BeenThere: guix-patches@gnu.org List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+larch=yhetil.org@gnu.org Sender: "Guix-patches" X-Scanner: scn0 Authentication-Results: aspmx1.migadu.com; dkim=none; dmarc=none; spf=pass (aspmx1.migadu.com: domain of guix-patches-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=guix-patches-bounces@gnu.org X-Spam-Score: 0.49 X-TUID: TQUTBrHKY0BD --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Hi Maurice, Maurice Br=C3=A9mond skribis: > Apparently at the mpich configuration level, using the experimental > device ch4 instead of ch3 solves the problem : just remove comment on > "--with-device=3Dch4:ucx". Reversely, with mpich 3.4a2 (for which ch4 is > de default) setting --with-device=3Dch3 leads to the same failure as with > 3.3.2. Nice, we have a way forward. With the patch below, I have successfully built: guix build mumps-openmpi --with-input=3Dopenmpi=3Dmpich and I confirm that despite the name it depends exclusively on MPICH. :-) If that=E2=80=99s fine with you I=E2=80=99ll go ahead and commit it; let me= know! > I also checked sock channel for ch3 : with-device=3Dch3:sock, but then on > my laptop, scotch tests hang at > > mpirun -n 3 ./test_scotch_dgraph_check data/bump.grf > > For the moment, there isn't a stable 3.4 version yet for mpich. I had a > try with the latest 3.4b1 but a test failed... We=E2=80=99ll see, but having a solution that works with 3.3 and is likely = to work with 3.4 is good. I guess we should also check whether we=E2=80=99re obtaining the expected performance. This builds fine too: guix build intel-mpi-benchmarks --with-input=3Dopenmpi=3Dmpich Thank you! Ludo=E2=80=99. --=-=-= Content-Type: text/x-patch Content-Disposition: inline diff --git a/gnu/packages/mpi.scm b/gnu/packages/mpi.scm index 06a82cce95..9035147441 100644 --- a/gnu/packages/mpi.scm +++ b/gnu/packages/mpi.scm @@ -436,7 +436,12 @@ arrays) that expose a buffer interface.") `(#:configure-flags (list "--disable-silent-rules" ;let's see what's happening "--enable-debuginfo" - ;; "--with-device=ch4:ucx" ; --with-device=ch4:ofi segfaults in tests + + ;; Default to "ch4", as will be the case in 3.4. It also works + ;; around issues when running test suites of packages that use + ;; MPICH: . + "--with-device=ch4:ucx" ; --with-device=ch4:ofi segfaults in tests + (string-append "--with-hwloc-prefix=" (assoc-ref %build-inputs "hwloc")) --=-=-=--