Replies: 1 comment 2 replies
-
@tigerinus , no, it's either of two ways. I would say Intel-optimum is recommended way to do it for HF models while the patch is more an example to demonstrate how it could be done directly. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I read that to use NNCF in transformers, patch is needed as described in https://github.com/openvinotoolkit/nncf/blob/develop/third_party_integration/huggingface_transformers/README.md
But it also seems that NNCF is already being used in optimum-intel as described in https://huggingface.co/docs/optimum/intel/optimization_ov
Does it mean to use NNCF I need to do both patching and making sure it uses optimum-intel library?
Thanks.
Beta Was this translation helpful? Give feedback.
All reactions