High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
-
Updated
Sep 6, 2024 - C++
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Add a description, image, and links to the bamboo-7b topic page so that developers can more easily learn about it.
To associate your repository with the bamboo-7b topic, visit your repo's landing page and select "manage topics."