WaveVAE

work in progress

Note that my implementation isn't stable yet.

A Pytorch Implementation of WaveVAE (Mel Spectrogram --> Waveform)

part of "Parallel Neural Text-to-Speech"

Requirements

PyTorch 0.4.1 & python 3.6 & Librosa

Examples

Step 1. Download Dataset

LJSpeech : https://keithito.com/LJ-Speech-Dataset/

Step 2. Preprocessing (Preparing Mel Spectrogram)

python preprocessing.py --in_dir ljspeech --out_dir DATASETS/ljspeech

Step 3. Train Model

python train.py --model_name wavevae_1 --batch_size 4 --num_gpu 2

Step 4. Synthesize

--load_step CHECKPOINT : the # of the model's global training step (also depicted in the trained weight file)

python synthesize.py --model_name wavevae_1 --load_step 10000 --num_samples 5

References

WaveNet vocoder : https://github.com/r9y9/wavenet_vocoder
Parallel Neural Text-to-Speech : https://arxiv.org/abs/1905.08459

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
LICENSE		LICENSE
README.md		README.md
data.py		data.py
model.py		model.py
modules.py		modules.py
preprocessing.py		preprocessing.py
synthesize.py		synthesize.py
train.py		train.py
wavenet.py		wavenet.py
wavenet_iaf.py		wavenet_iaf.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WaveVAE

Requirements

Examples

Step 1. Download Dataset

Step 2. Preprocessing (Preparing Mel Spectrogram)

Step 3. Train Model

Step 4. Synthesize

References

About

Releases

Packages

Languages

License

ksw0306/WaveVAE

Folders and files

Latest commit

History

Repository files navigation

WaveVAE

Requirements

Examples

Step 1. Download Dataset

Step 2. Preprocessing (Preparing Mel Spectrogram)

Step 3. Train Model

Step 4. Synthesize

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages