Use NNCF in transformers + optimum-intel ? #2308

tigerinus · 2023-12-08T02:59:35Z

tigerinus
Dec 8, 2023

I read that to use NNCF in transformers, patch is needed as described in https://github.com/openvinotoolkit/nncf/blob/develop/third_party_integration/huggingface_transformers/README.md

But it also seems that NNCF is already being used in optimum-intel as described in https://huggingface.co/docs/optimum/intel/optimization_ov

Does it mean to use NNCF I need to do both patching and making sure it uses optimum-intel library?

Thanks.

MaximProshin · 2023-12-08T05:47:32Z

MaximProshin
Dec 8, 2023
Maintainer

@tigerinus , no, it's either of two ways. I would say Intel-optimum is recommended way to do it for HF models while the patch is more an example to demonstrate how it could be done directly.

2 replies

tigerinus Dec 8, 2023
Author

I see.

What are the types of text-generation models supported by the post-training compression described at https://huggingface.co/docs/optimum/intel/optimization_ov#post-training-optimization ?

alexsu52 Jan 22, 2024
Maintainer

What type of models are you interested in? NNCF is general solution for model quantization and don't have known limitation by supporting text-generarion models. If you will face any problems, please, file an issue on github.

FYI: The transformer patch was removed in #2381

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use NNCF in transformers + optimum-intel ? #2308

{{title}}

Replies: 1 comment 2 replies

{{title}}

{{title}}

{{title}}

Select a reply

Use NNCF in transformers + optimum-intel ? #2308

tigerinus Dec 8, 2023

Replies: 1 comment · 2 replies

MaximProshin Dec 8, 2023 Maintainer

tigerinus Dec 8, 2023 Author

alexsu52 Jan 22, 2024 Maintainer

tigerinus
Dec 8, 2023

Replies: 1 comment 2 replies

MaximProshin
Dec 8, 2023
Maintainer

tigerinus Dec 8, 2023
Author

alexsu52 Jan 22, 2024
Maintainer