Minimal GPU configuration #19

CyprienRicque · 2023-11-22T09:14:44Z

CyprienRicque
Nov 22, 2023

Hello,
I noticed the example notebook mentions the usage of nvidia V100 to run the model just like the paper.
Is that the minimum required GPU to run it?
Thanks

Answered by iofu728

Nov 22, 2023

Hello @CyprienRicque, thank you for showing interest in the LLMLingua project.

In fact, you can use any language model from Hugging Face as the small language model in LLMLingua pipeline. By default, LLMLingua uses llama-2-7b, which requires approximately 17-20GB of GPU memory for inference.

However, by using the quantized version of the model, you can significantly reduce the GPU memory usage. For example, TheBloke/Llama-2-7b-Chat-GPTQ requires less than 8GB of GPU memory. You can even use smaller models, such as models the size of GPT-2-small.

You can refer to the following code to use TheBloke/Llama-2-7b-Chat-GPTQ But before that, make sure to update LLMLingua and install optimum auto-…

View full answer

iofu728 · 2023-11-22T14:23:46Z

iofu728
Nov 22, 2023
Maintainer

Hello @CyprienRicque, thank you for showing interest in the LLMLingua project.

In fact, you can use any language model from Hugging Face as the small language model in LLMLingua pipeline. By default, LLMLingua uses llama-2-7b, which requires approximately 17-20GB of GPU memory for inference.

However, by using the quantized version of the model, you can significantly reduce the GPU memory usage. For example, TheBloke/Llama-2-7b-Chat-GPTQ requires less than 8GB of GPU memory. You can even use smaller models, such as models the size of GPT-2-small.

You can refer to the following code to use TheBloke/Llama-2-7b-Chat-GPTQ But before that, make sure to update LLMLingua and install optimum auto-gptq.

from llmlingua import PromptCompressor

llm_lingua = PromptCompressor("TheBloke/Llama-2-7b-Chat-GPTQ", model_config={"revision": "main"})
compressed_prompt = llm_lingua.compress_prompt(prompt, instruction="", question="", target_token=200)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minimal GPU configuration #19

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Minimal GPU configuration #19

CyprienRicque Nov 22, 2023

Replies: 1 comment

iofu728 Nov 22, 2023 Maintainer

CyprienRicque
Nov 22, 2023

iofu728
Nov 22, 2023
Maintainer