Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
-
Updated
Oct 19, 2023
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
Python implementation of pre-processing for End-to-End speech recognition
Build speech enhancement dataset.
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Extract mfcc vectors and phones from TIMIT dataset
Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.
Keyword spotting using RNNs + Edit distance
End-to-end ASR system on TIMIT
Speaker verification using Gaussian Mixture Model (GMM)
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
Main objective of this model is to develop Automatic Speech Recognition using Deep Neural Network.
The initial CNN experiments of my bachelor thesis
My bachelor thesis on Phoneme recognition and alignment on the TIMIT dataset
Add a description, image, and links to the timit-dataset topic page so that developers can more easily learn about it.
To associate your repository with the timit-dataset topic, visit your repo's landing page and select "manage topics."