Voice Cloning

Note: The Voice_cloning_Training_with_Tacotron2_and_WaveGlow.ipynb notebook is to be run in Google Colab. Once in Colab, you need to import the data_cleaned.zip dataset into the current folder /content/. Replace the files in the folder /content/TTS-TT2/filelists/ with my files that have the same name after installing Tacotron2. LThe rest of the code will take care of unzipping it and putting it in the new folder /content/TTS-TT2/wavs/. The program will then ask you to load your transcription file. You will give it the list.txt file.

The files in the input folder are needed to give input to the speech synthesis model. They are also found at the root of the project. The wav files correspond to the zip file: data_cleaned.zip and the list.txt, ljs_audio_text_val_filelists.txt, ljs_audio_text_val_filelists.txt and ljs_audio_text_val_filelists.txtfiles are also found at the root of the project. The files in the output folder are the results of the model, during and after training.

TREE:

input

filelists
- list.txt
- ljs_audio_text_test_filelists.txt
- ljs_audio_text_train_filelists.txt
- ljs_audio_text_val_filelists.txt
wavs
- 1.npy
- 1.wav
- ...
- 60.npy
- 60.wav

output

audio
- model_BS_6_0.00003_350epoch_0_original_audio.wav
- model_BS_6_0.00003_350epoch_0_predicted_audio.wav
- ...
- model_BS_6_0.00003_350epoch_20_original_audio.wav
- model_BS_6_0.00003_350epoch_20_predicted_audio.wav
- model_BS_6_0.00003_350signals_epoch_0.png
- ...
- model_BS_6_0.00003_350signals_epoch_20.png
images
- model_BS_6_0.00003_350_Alignment_Epoch_0_Iteration_9_Validation_Loss_1.7767614126205444.png
- ...
- model_BS_6_0.00003_350_Alignment_Epoch_20_Iteration_189_Validation_Loss_1.0240533351898193.png
logs
- events.out.tfevents.1703405636.c8a2ca7defbc.1806.11
loss
- model_BS_6_0.00003_350loss_curve_epoch_0.png
- ...
- model_BS_6_0.00003_350loss_curve_epoch_22.png
spectrogram
- model_BS_6_0.00003_350spectrograms_epoch_0.png
- ...
- model_BS_6_0.00003_350spectrograms_epoch_20.png

Voice_cloning_Training_with_Tacotron2_and_WaveGlow.ipynb
MLSP Presentation_Clonage_de_la_voix.pdf
MLSP_Rapport_Clonage_de_la_voix.pdf
README.md
data_cleaned.zip
list.txt
ljs_audio_text_test_filelist.txt
ljs_audio_text_train_filelist.txt
ljs_audio_text_val_filelist.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Cloning

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
input		input
output		output
MLSP Presentation_Clonage_de_la_voix.pdf		MLSP Presentation_Clonage_de_la_voix.pdf
MLSP_Rapport_Clonage_de_la_voix.pdf		MLSP_Rapport_Clonage_de_la_voix.pdf
README.md		README.md
Voice_cloning_Training_with_Tacotron2_and_WaveGlow.ipynb		Voice_cloning_Training_with_Tacotron2_and_WaveGlow.ipynb
data_cleaned.zip		data_cleaned.zip
list.txt		list.txt
ljs_audio_text_test_filelist.txt		ljs_audio_text_test_filelist.txt
ljs_audio_text_train_filelist.txt		ljs_audio_text_train_filelist.txt
ljs_audio_text_val_filelist.txt		ljs_audio_text_val_filelist.txt
test_tts.wav		test_tts.wav

isabelleysseric/voice-cloning

Folders and files

Latest commit

History

Repository files navigation

Voice Cloning

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages