From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp12.migadu.com ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms5.migadu.com with LMTPS id aDxlGRiQZmLASwAAbAwnHQ (envelope-from ) for ; Mon, 25 Apr 2022 14:12:08 +0200 Received: from aspmx1.migadu.com ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp12.migadu.com with LMTPS id II0bGRiQZmIXZQEAauVa8A (envelope-from ) for ; Mon, 25 Apr 2022 14:12:08 +0200 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 19D8E1A420 for ; Mon, 25 Apr 2022 14:12:08 +0200 (CEST) Received: from localhost ([::1]:47062 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1nixZW-0003Rc-TK for larch@yhetil.org; Mon, 25 Apr 2022 08:12:06 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:39602) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nisFm-0004oQ-CB for guix-science@gnu.org; Mon, 25 Apr 2022 02:31:24 -0400 Received: from mout-p-101.mailbox.org ([2001:67c:2050:0:465::101]:46866) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_CHACHA20_POLY1305:256) (Exim 4.90_1) (envelope-from ) id 1nisFj-0001H2-Tq for guix-science@gnu.org; Mon, 25 Apr 2022 02:31:21 -0400 Received: from smtp2.mailbox.org (smtp2.mailbox.org [IPv6:2001:67c:2050:105:465:1:2:0]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-384) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by mout-p-101.mailbox.org (Postfix) with ESMTPS id 4KmwBq5Rg6z9sZN; Mon, 25 Apr 2022 08:31:15 +0200 (CEST) Date: Mon, 25 Apr 2022 08:31:09 +0200 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=6xq.net; s=MBO0001; t=1650868273; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=7kX6pkfLAf7aqAhPlF7fqEvrjfqmKQXWXHGl/Pb2m6I=; b=LZR+y63xjELTSJa+j+lkNfscdLCwfSS8KHJyn5OT7jvMW08sr9rICBxdgvObtwhcKY70lM ZampKa1FqGm0wzmaLCx7wFMB2Nop58TooQAkA7L/6fBt9wSfKz/zt6+VMBltIwZDXyMPbr qaMnPPuWTixYa8i6kHBBHV5U6/cvCLd+mOs1Brpl1EgFmrqqzHVBgf6VfaMorBZbnlj8OA p01aNzFFFmpdKcn6RNf9h514bTR3uaQ2l2rJZjg1GbJ8Ym7XIrGX9M3EiJRubTc28sV4J1 BRJy3XYXIWTsGMZGFceN6xyBuBHr/dY2nu7LQIecIfYOoAbmXS29mTki/PxMoQ== From: Lars-Dominik Braun To: Zacchaeus Scheffer Cc: guix-science@gnu.org Subject: Re: Freeing Machine Learning with ROCm Message-ID: References: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: Received-SPF: pass client-ip=2001:67c:2050:0:465::101; envelope-from=lars@6xq.net; helo=mout-p-101.mailbox.org X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action X-Mailman-Approved-At: Mon, 25 Apr 2022 08:11:42 -0400 X-BeenThere: guix-science@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-science-bounces+larch=yhetil.org@gnu.org Sender: "Guix-Science" X-Migadu-Flow: FLOW_IN X-Migadu-To: larch@yhetil.org X-Migadu-Country: US ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1650888728; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:list-id:list-help: list-unsubscribe:list-subscribe:list-post:dkim-signature; bh=7kX6pkfLAf7aqAhPlF7fqEvrjfqmKQXWXHGl/Pb2m6I=; b=QTb6C/KoQSl0500otrzvQdjLO8JnL6BZkozdpiCRmI0H+o19GAVwFk9VP3iHSRswF0dwMS FjFpYZ2hnB49wensYFgt40/e/Gxslrw9zgM1Gfkdn6fl5SN3WAIzX9S5cXdPRoeIFHFtyq MpzCt9/D76pvQFDxd/VuejO2OoYaWEoz5+AujOPJdypaMTohthlt/1p/PB5QRe8wQXDJng AoY8rqTkB6of060LixYLQhIIpjXpznYE0wICbIIQc+ulgnnrgA0nOvji+k7RAG5rs0qwk2 0ExqmheTVKnZd/jkvixPifV56kTGL+T9mPWyqORKEyj83FO3BBz9aHD1a6FDdg== ARC-Seal: i=1; s=key1; d=yhetil.org; t=1650888728; a=rsa-sha256; cv=none; b=c84LiPkUyK/tg0xirLQ+dgsapVm+gvRH9ClYFMEPLATaRJQXiC5Efx9REQvODHL+qPrjYM 1M5psd+dOS8gCrivdAWDizSOXSd3Pdhhz5c2Dk9XkHIzZjy8lHAv8yxca5TA0I2pW+0IEL Lg5sedGeHOWQcI9u6PqX4KoGC9Nh2DtF/yrt65AGNvcrCpMrJgu6o+xxKj/Uc+RsZrcA9A x5GN0u2hUsD8KYL4yv+/QxEdndsugxO8d8Va6WMEWwwjpl7i1e5yJxk1qPjh/b8ovWUohp bra7nMpVZlXRDu/NzeUwlOcHMOQtsatE6AVv8eSL5V0eekHTxeZY/VJzj4QSLQ== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=pass header.d=6xq.net header.s=MBO0001 header.b=LZR+y63x; dmarc=pass (policy=none) header.from=6xq.net; spf=pass (aspmx1.migadu.com: domain of "guix-science-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-science-bounces+larch=yhetil.org@gnu.org" X-Migadu-Spam-Score: 2.89 Authentication-Results: aspmx1.migadu.com; dkim=pass header.d=6xq.net header.s=MBO0001 header.b=LZR+y63x; dmarc=pass (policy=none) header.from=6xq.net; spf=pass (aspmx1.migadu.com: domain of "guix-science-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-science-bounces+larch=yhetil.org@gnu.org" X-Migadu-Queue-Id: 19D8E1A420 X-Spam-Score: 2.89 X-Migadu-Scanner: scn1.migadu.com X-TUID: EUgOth8fEfvr Hi Zacchaeus, I packaged ROCm for Guix. > Based on the fact that many ROCm packages exist in guix, and > that I don't see people complain, it seems it must have worked in the > past. Indeed, I am using Guix’ darktable and rocm-opencl-runtime packages for OpenCL-accelerated photo editing. But I’m also doing this on a foreign distribution with a custom kernel (5.15) – not Guix System. > > ROCk module is loaded > > Unable to open /dev/kfd read-write: No such file or directory > > is member of video group Which GPU are you using? Can you see it with `lspci` and does it have the `amdgpu` driver attached? Is the firmware loaded (`dmesg | grep amdgpu`, I’m guessing no, since you use linux-libre)? > In retrospect, could it maybe be that I can use the card without probing it > with rocminfo? It would certainly be nice to be able to check the > temperature (especially so I don't have to leave the fan on full blast) > among other things, but maybe that isn't strictly necessary for doing > machine learning on it? rocminfo does not show the card’s temperature. You need this[1] (unpackaged) tool. Cheers, Lars [1] https://github.com/RadeonOpenCompute/rocm_smi_lib/tree/master/python_smi_tools