llama.cpp / GGUF support #7

sammcj · 2024-09-04T07:52:37Z

It would be great to see OLMoE/OlmoeForCausalLM Llama.cpp/GGUF support.

Really neat project!

AmitKKhanchandani · 2024-09-07T05:22:48Z

+1, need to try this with ollama :)

Bobetele · 2024-09-09T20:18:50Z

yes

Muennighoff · 2024-09-09T20:46:13Z

won't have bandwidth to do this, but if anyone is interested, that'd be amazing!

MrDowntempo · 2024-09-12T21:54:41Z

Yeah, this is hard to work with if it isn't in GGUF format to run locally, or available from Ollama directly. I'm looking into how to serve from Safetensors but not a lot of servers support that.

Muennighoff · 2024-09-12T21:56:07Z

also cc @2015aroras

2015aroras · 2024-09-13T22:45:12Z

See ggerganov/llama.cpp#9462

Meshwa428 · 2024-09-16T05:08:54Z

See ggerganov/llama.cpp#9462

It still isn't merged 😞

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama.cpp / GGUF support #7

llama.cpp / GGUF support #7

sammcj commented Sep 4, 2024

AmitKKhanchandani commented Sep 7, 2024

Bobetele commented Sep 9, 2024

Muennighoff commented Sep 9, 2024

MrDowntempo commented Sep 12, 2024

Muennighoff commented Sep 12, 2024

2015aroras commented Sep 13, 2024

Meshwa428 commented Sep 16, 2024

llama.cpp / GGUF support #7

llama.cpp / GGUF support #7

Comments

sammcj commented Sep 4, 2024

AmitKKhanchandani commented Sep 7, 2024

Bobetele commented Sep 9, 2024

Muennighoff commented Sep 9, 2024

MrDowntempo commented Sep 12, 2024

Muennighoff commented Sep 12, 2024

2015aroras commented Sep 13, 2024

Meshwa428 commented Sep 16, 2024