python312Packages.llama-cpp-python: init at 0.3.1 #349657

kirillrdy · 2024-10-18T23:29:51Z

unlike previous attempt #268712

this uses bundled version of llama-cpp, also tested CUDA support

Things done

Add a 👍 reaction to pull requests you find important.

hoh · 2024-10-23T10:53:14Z

Thanks for the work !

Can you explain the motivations behind switching to the bundled version of llama-cpp ?

The package looks much simpler than the previous version that used patches on llama-cpp, but lacks support for other back-ends (OpenCL, ROCm).

alexvorobiev · 2024-10-23T18:14:04Z

Trying to build with Cuda results in the version error:

llama-cpp-python>   /nix/store/slx40i35cmd7kb3wvdqzckfww8smcy6s-cuda_nvcc-12.2.140/include/crt/host_config.h:143:2:
llama-cpp-python>   error: #error -- unsupported GNU version! gcc versions later than 12 are
llama-cpp-python>   not supported! The nvcc flag '-allow-unsupported-compiler' can be used to
llama-cpp-python>   override this version check; however, using an unsupported host compiler
llama-cpp-python>   may cause compilation failure or incorrect run time execution.  Use at your
llama-cpp-python>   own risk.

The usual workaround

....overridePythonAttrs(attrs: { stdenv = super.gcc12Stdenv; });

works, so it should be possible to add the override to the package.

alexvorobiev · 2024-10-23T18:23:14Z

Thanks for the work !

Can you explain the motivations behind switching to the bundled version of llama-cpp ?

The package looks much simpler than the previous version that used patches on llama-cpp, but lacks support for other back-ends (OpenCL, ROCm).

I am not the author but the code refers to the llama.cpp's subdirectories which are not included in nixpkg.llama-cpp. For instance https://github.com/abetlen/llama-cpp-python/blob/7403e002b8e033c0a34e93fba2b311e2118487fe/CMakeLists.txt#L110.

kirillrdy · 2024-10-23T19:49:16Z

@hoh

Can you explain the motivations behind switching to the bundled version of llama-cpp ?

llama-cpp and llama-cpp-python often get out of sync, so using llama-cpp from nixpkgs requires either breaking lama-cpp-python or not updating llama-cpp

@alexvorobiev CUDA support builds without any overrides, do you change default cuda in your overlays ?

alexvorobiev · 2024-10-25T19:09:35Z

@alexvorobiev CUDA support builds without any overrides, do you change default cuda in your overlays ?

I have to use CUDA 12.2 for now, could that be the issue?

support

kirillrdy · 2024-10-25T19:52:11Z

@alexvorobiev CUDA support builds without any overrides, do you change default cuda in your overlays ?

I have to use CUDA 12.2 for now, could that be the issue?

yes, it seems to only build with 12.4 ( which is default in nixpkgs )

kirillrdy · 2024-10-26T18:14:40Z

@alexvorobiev I've fixed cuda build, tested with 12_3, 12_2, 12_1, 12_0, 11_8 ( stopped here )

alexvorobiev · 2024-10-26T18:59:55Z

@alexvorobiev I've fixed cuda build, tested with 12_3, 12_2, 12_1, 12_0, 11_8 ( stopped here )

Thank you!

github-actions bot added the 6.topic: python label Oct 18, 2024

kirillrdy mentioned this pull request Oct 19, 2024

python3Packages.llama-cpp-python: init at 0.2.18 #268712

Closed

13 tasks

kirillrdy marked this pull request as ready for review October 19, 2024 00:15

nix-owners bot requested a review from natsukium October 19, 2024 00:16

ofborg bot added 8.has: package (new) 11.by: package-maintainer 10.rebuild-darwin: 1-10 10.rebuild-linux: 1-10 labels Oct 19, 2024

kirillrdy added 2 commits October 26, 2024 06:19

python312Packages.llama-cpp-python: init at 0.3.1

3b20a6b

python312Packages.llama-cpp-python: add passthru.test to build with CUDA

4173566

support

kirillrdy force-pushed the llama-cpp-python branch from 3ade7c5 to 4173566 Compare October 25, 2024 19:51

python312Packages.llama-cpp-python: use stdenv from cudaPackages

c9ca8b6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

python312Packages.llama-cpp-python: init at 0.3.1 #349657

python312Packages.llama-cpp-python: init at 0.3.1 #349657

kirillrdy commented Oct 18, 2024 •

edited

Loading

hoh commented Oct 23, 2024

alexvorobiev commented Oct 23, 2024 •

edited

Loading

alexvorobiev commented Oct 23, 2024 •

edited

Loading

kirillrdy commented Oct 23, 2024

alexvorobiev commented Oct 25, 2024 •

edited

Loading

kirillrdy commented Oct 25, 2024

kirillrdy commented Oct 26, 2024

alexvorobiev commented Oct 26, 2024

python312Packages.llama-cpp-python: init at 0.3.1 #349657

Are you sure you want to change the base?

python312Packages.llama-cpp-python: init at 0.3.1 #349657

Conversation

kirillrdy commented Oct 18, 2024 • edited Loading

Things done

hoh commented Oct 23, 2024

alexvorobiev commented Oct 23, 2024 • edited Loading

alexvorobiev commented Oct 23, 2024 • edited Loading

kirillrdy commented Oct 23, 2024

alexvorobiev commented Oct 25, 2024 • edited Loading

kirillrdy commented Oct 25, 2024

kirillrdy commented Oct 26, 2024

alexvorobiev commented Oct 26, 2024

kirillrdy commented Oct 18, 2024 •

edited

Loading

alexvorobiev commented Oct 23, 2024 •

edited

Loading

alexvorobiev commented Oct 23, 2024 •

edited

Loading

alexvorobiev commented Oct 25, 2024 •

edited

Loading