This project builds a system to split spoken sentences into words (speaker-independent) and calculates the speaker's average pitch, comparing different methods to identify word boundaries.
-
Updated
Oct 19, 2024 - Python
This project builds a system to split spoken sentences into words (speaker-independent) and calculates the speaker's average pitch, comparing different methods to identify word boundaries.
Pytorch implementation for DeepSpeech 2.0
Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.
Gender Classification of the speaker from LibriSpeech Dataset
Voice Activity Detection based on Deep Learning & TensorFlow
GSoC'2021 | TensorFlow implementation of Wav2Vec2
In this repository, I have developed an end to end Automatic speech recognition project. I have developed the neural network model for automatic speech recognition with PyTorch and used MLflow to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.
In this notebook, I will build a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline!
Gender Classification with different Machine Learning models, using the LibriSpeech ASR dataset.
Project on Automatic Speech Recogition for course Deep Learning Speech Processing
An implementation of a DNN speech recognizer as part of the Udacity NLP NanoDegree program
Add a description, image, and links to the librispeech-dataset topic page so that developers can more easily learn about it.
To associate your repository with the librispeech-dataset topic, visit your repo's landing page and select "manage topics."