How to keep model in high alert state for faster inference? #243

SuperMaximus1984 · 2024-08-25T14:44:25Z

I'm running insanely-fast-whisper in environment where low latency is crucial. Once .wav file is created, it needs to be transcribed immediately. Everytime I run:

D:\InsanelyFastWhisper>insanely-fast-whisper --file-name myoutputfile_17245952653c006.wav --device-id 0
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
🤗 Transcribing... ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 0:00:00
Voila!✨ Your file has been transcribed go check it out over here 👉 output.json

I see some pause in this part: Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.

And then Transcribing runs pretty fast. I assume the model is being loaded first, so it takes time.
How do I keep it in the high alert state so that transcription would run faster upon feeding of the new .wav file?
Thanks!

The text was updated successfully, but these errors were encountered:

ArmykOliva · 2024-09-03T08:59:25Z

Don't use the CLI and use the transformers python way.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to keep model in high alert state for faster inference? #243

How to keep model in high alert state for faster inference? #243

SuperMaximus1984 commented Aug 25, 2024

ArmykOliva commented Sep 3, 2024

How to keep model in high alert state for faster inference? #243

How to keep model in high alert state for faster inference? #243

Comments

SuperMaximus1984 commented Aug 25, 2024

ArmykOliva commented Sep 3, 2024