FastSpeech

Training

Set data_path in hparams.py as the LJSpeech folder
Set teacher_dir in hparams.py as the data directory where the alignments and melspectrogram targets are saved
Put checkpoint of the pre-trained transformer-tts (weights of the embedding/encoder layers are used)
python train.py

You can hear the audio samples here

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
figures		figures
filelists		filelists
modules		modules
text		text
training_log		training_log
utils		utils
wavs		wavs
LICENSE		LICENSE
README.md		README.md
audio_processing.py		audio_processing.py
data_inspection.ipynb		data_inspection.ipynb
generate_samples.ipynb		generate_samples.ipynb
hparams.py		hparams.py
index.html		index.html
inference.ipynb		inference.ipynb
layers.py		layers.py
requirement.txt		requirement.txt
stft.py		stft.py
train.py		train.py