From mboxrd@z Thu Jan 1 00:00:00 1970 From: ludo@gnu.org (Ludovic =?utf-8?Q?Court=C3=A8s?=) Subject: Re: OpenBLAS and performance Date: Thu, 21 Dec 2017 15:43:16 +0100 Message-ID: <87tvwk5egr.fsf@gnu.org> References: <20171219104956.GB806@thebird.nl> <87tvwl7h4w.fsf@albion.it.manchester.ac.uk> <87h8sl78vp.fsf@albion.it.manchester.ac.uk> <20171220172215.GA7926@thebird.nl> <87d139xo3v.fsf@elephly.net> <20171220192802.GA8426@thebird.nl> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:55058) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1eS24B-0002st-S6 for guix-devel@gnu.org; Thu, 21 Dec 2017 09:43:24 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1eS24A-0002Ow-Uo for guix-devel@gnu.org; Thu, 21 Dec 2017 09:43:23 -0500 In-Reply-To: <20171220192802.GA8426@thebird.nl> (Pjotr Prins's message of "Wed, 20 Dec 2017 20:28:02 +0100") List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+gcggd-guix-devel=m.gmane.org@gnu.org Sender: "Guix-devel" To: Pjotr Prins Cc: guix-devel@gnu.org, Dave Love Pjotr Prins skribis: > On Wed, Dec 20, 2017 at 07:15:16PM +0100, Ricardo Wurmus wrote: [...] >> The FAQ document[1] says this: >>=20 >> The environment variable which control the kernel selection is >> OPENBLAS_CORETYPE (see driver/others/dynamic.c) e.g. export >> OPENBLAS_CORETYPE=3DHaswell. And the function char* >> openblas_get_corename() returns the used target. >>=20 >> [1]: https://github.com/xianyi/OpenBLAS/wiki/Faq >>=20 >> Have you tried this and compared the performance? > > About 10x difference on 24+ cores for matrix multiplication (my > version vs what comes with Guix). Even when you use OPENBLAS_CORETYPE=3Dhaswell (lower-case?)? That would be surprising: it=E2=80=99s the same code after all. The only difference should be what happens at load time. Ludo=E2=80=99.