Replies: 1 comment
-
The alignment approach in by stable-ts is not designed to align |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have long audio files (of the reading of the Bible), and their exact transcript (the Bible, of course), and I want to split them into small files using VAD, and get the aligned transcript of each file. The files are intended to be published in Dataset for training new models.
When I put in the full transcript, and ran the alignment on the full audio file, I got good timestamps.
But when I inserted the full transcript (text of an entire Bible passage), and ran the alignment on a short audio file of a few seconds, I got the result of all the text that was in the full transcript file, and not just the words spoken in the short file.
How can you insert a short file and a long text into alignment, and receive only the text that was said in the file, and not all the text that was in the text for alignment?
Beta Was this translation helpful? Give feedback.
All reactions