Slurred-Speech-Recognition-DeepLearning

For Speech-to-Text problems, our training data consists of:

Problem Statement

The purpose of this project is to fine tune the the automatic speech recognition model or apply the technique of transfer learning so that it can convert atypical speech (voice of people with speech impairments) into text.

High level Solution Overview

We will start with the state of the art end to end speech Recognition model with high accuracy. This high quality ASR model will be trained on hundreds of hours of typical or standard speech with no impairements. After we achieve high accuracy for the end to end model, then we will start fine-tuning parts of the model to an individual with speech impairement.
So our main aproach is training a base model on a large dataset of normal speech and then training a personalised model using a much smaller slurred speech dataset. We can use tranfer learning for fine tuning parts of our base model.

Model Architecture

Base Model Performance

The base ASR model was trained on 100 hours of Librispeech Dataset.

Final Epoch Average Loss: 0.46
Final Epoch Average CER: 0.10
Final Epoch Average WER: 0.11

Dataset preparation

After we train our ASR model on hundreds of hours of typical speech, we are good to go for fine-tuning our model on impaired speech. We need to collect impaired speech dataset. We build web app using django framework to do the same.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.vscode		.vscode
Best-Fit-TrainedModel		Best-Fit-TrainedModel
Dataset		Dataset
Notebooks Code		Notebooks Code
Papers		Papers
Project_Update_Reports		Project_Update_Reports
README.md		README.md
btp_report.pdf		btp_report.pdf
prediction.txt		prediction.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Slurred-Speech-Recognition-DeepLearning

Problem Statement

High level Solution Overview

Model Architecture

Base Model Performance

Dataset preparation

Link to web APP:

About

Releases

Packages

Languages

kumar-shivam-ranjan/Slurred-Speech-Recognition-DeepLearning

Folders and files

Latest commit

History

Repository files navigation

Slurred-Speech-Recognition-DeepLearning

Problem Statement

High level Solution Overview

Model Architecture

Base Model Performance

Dataset preparation

Link to web APP:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages