Speaker diarization #230

wanderGuy · 2024-06-02T20:07:46Z

Hi!
I'm using the following command:
insanely-fast-whisper --file-name intro.mp3 --language en

And I get an output.json which looks like this:
{ "speakers": [], "chunks": [{ "timestamp": [0.0, 4.7], "text": "bla bla bla" }, { "timestamp": [4.7, 7.7], "text": "bla bla bla" } ], "text": "bla bla bla bla bla bla" }

Is there any way to show the speaker_0, speaker_1 thing?
Thx!

The text was updated successfully, but these errors were encountered:

flaviodelgrosso · 2024-06-06T18:59:28Z

Hi! I'm using the following command: insanely-fast-whisper --file-name intro.mp3 --language en

And I get an output.json which looks like this: { "speakers": [], "chunks": [{ "timestamp": [0.0, 4.7], "text": "bla bla bla" }, { "timestamp": [4.7, 7.7], "text": "bla bla bla" } ], "text": "bla bla bla bla bla bla" }

Is there any way to show the speaker_0, speaker_1 thing? Thx!

You need to provide an Hugging Face authentication token for Pyannote.audio to diarise the audio clips. Pass it as arg to che command --hf-token

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speaker diarization #230

Speaker diarization #230

wanderGuy commented Jun 2, 2024

flaviodelgrosso commented Jun 6, 2024

Speaker diarization #230

Speaker diarization #230

Comments

wanderGuy commented Jun 2, 2024

flaviodelgrosso commented Jun 6, 2024