Speech synthesis with conditioning on very small dataset. Using Nvidia's Tacotron2 and WaveGlow models with Pytorch.
-
Updated
Sep 10, 2024 - Jupyter Notebook
Speech synthesis with conditioning on very small dataset. Using Nvidia's Tacotron2 and WaveGlow models with Pytorch.
Research on speech processing, speaker identification and audio diarization
Chat-Bot made using whisper live, speechbrain and open AI API
Dockerized Zeroc-ICE architecture processing voice commands from a Xamarin mobile application via an Automatic Speech Recognition (ASR) AI model using SpeechBrain.
A Speech Recognition Framework for Banking Interactions using Convolutional Recurrent Dense Neural Networks and Language Models
Speech transcription and speech diarization
Speaker verification of virtual assistants using ECAPA-TDNN model from SpeechBrain toolkit and transfer learning approach emphasizing on inter and intra comparision (text independent and dependent).
Speech Emotion Recognition SE&R 2022
[Research] A Perceptual Loss Based Complex Neural Beamforming for AmbiX 3D Speech Enhancement
AudioSpeakerVerification: FastAPI-based API for Speaker Matching and Verification using SpeechBrain. Compare and verify speaker identities from audio files.
pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati
Processing EEG data using Speechbrain-MOABB and model tuning to get best results
Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.
Record voice, transcribe a prompt, picturize the prompt, create variations, get description of a celebrity and upload, other use cases on KB
StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disfluencies attenuated or eliminated.
Incremental learning for automatic speech recognition (ASR)
Target speaker automatic speech recognition (TS-ASR)
Backend of anti-fraud system based on speaker identification technology. 基于声纹识别的反诈系统后端
Extensions to YAML syntax for better python interaction
Add a description, image, and links to the speechbrain topic page so that developers can more easily learn about it.
To associate your repository with the speechbrain topic, visit your repo's landing page and select "manage topics."