Fine-tune SpeechT5 for non-English text-to-speech task, implemented in PyTorch.
-
Updated
May 28, 2024 - Python
Fine-tune SpeechT5 for non-English text-to-speech task, implemented in PyTorch.
A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
Add a description, image, and links to the persian-text-to-speech topic page so that developers can more easily learn about it.
To associate your repository with the persian-text-to-speech topic, visit your repo's landing page and select "manage topics."