* [bug#70175] [PATCH] gnu: llama-cpp: support OpenBLAS for faster prompt processing
@ 2024-04-04 3:46 John Fremlin via Guix-patches via
2024-04-05 11:35 ` bug#70175: " Christopher Baines
0 siblings, 1 reply; 2+ messages in thread
From: John Fremlin via Guix-patches via @ 2024-04-04 3:46 UTC (permalink / raw)
To: 70175; +Cc: John Fremlin
OpenBLAS is recommended by https://github.com/ggerganov/llama.cpp
Change-Id: Iaf6f22252da13e2d6f503992878b35b0da7de0aa
---
gnu/packages/machine-learning.scm | 5 ++++-
1 file changed, 4 insertions(+), 1 deletion(-)
diff --git a/gnu/packages/machine-learning.scm b/gnu/packages/machine-learning.scm
index 225bff0ca2..ea3674ce3e 100644
--- a/gnu/packages/machine-learning.scm
+++ b/gnu/packages/machine-learning.scm
@@ -542,6 +542,8 @@ (define-public llama-cpp
(build-system cmake-build-system)
(arguments
(list
+ #:configure-flags
+ '(list "-DLLAMA_BLAS=ON" "-DLLAMA_BLAS_VENDOR=OpenBLAS")
#:modules '((ice-9 textual-ports)
(guix build utils)
((guix build python-build-system) #:prefix python:)
@@ -576,8 +578,9 @@ (define-public llama-cpp
(lambda _
(copy-file "bin/main" (string-append #$output "/bin/llama")))))))
(inputs (list python))
+ (native-inputs (list pkg-config))
(propagated-inputs
- (list python-numpy python-pytorch python-sentencepiece))
+ (list python-numpy python-pytorch python-sentencepiece openblas))
(home-page "https://github.com/ggerganov/llama.cpp")
(synopsis "Port of Facebook's LLaMA model in C/C++")
(description "This package provides a port to Facebook's LLaMA collection
base-commit: 1441a205b1ebb610ecfae945b5770734cbe8478c
--
2.41.0
^ permalink raw reply related [flat|nested] 2+ messages in thread
* bug#70175: [PATCH] gnu: llama-cpp: support OpenBLAS for faster prompt processing
2024-04-04 3:46 [bug#70175] [PATCH] gnu: llama-cpp: support OpenBLAS for faster prompt processing John Fremlin via Guix-patches via
@ 2024-04-05 11:35 ` Christopher Baines
0 siblings, 0 replies; 2+ messages in thread
From: Christopher Baines @ 2024-04-05 11:35 UTC (permalink / raw)
To: John Fremlin via Guix-patches via; +Cc: John Fremlin, 70175-done
[-- Attachment #1: Type: text/plain, Size: 426 bytes --]
John Fremlin via Guix-patches via <guix-patches@gnu.org> writes:
> OpenBLAS is recommended by https://github.com/ggerganov/llama.cpp
>
> Change-Id: Iaf6f22252da13e2d6f503992878b35b0da7de0aa
> ---
> gnu/packages/machine-learning.scm | 5 ++++-
> 1 file changed, 4 insertions(+), 1 deletion(-)
Looks good to me, I tweaked the commit message a bit and pushed this to
master as d8a63bbcee616f224c10462dbfb117ec009c50d8.
Chris
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 987 bytes --]
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2024-04-05 11:36 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-04-04 3:46 [bug#70175] [PATCH] gnu: llama-cpp: support OpenBLAS for faster prompt processing John Fremlin via Guix-patches via
2024-04-05 11:35 ` bug#70175: " Christopher Baines
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/guix.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).