Name		Name	Last commit message	Last commit date
parent directory ..
logs		logs
plots		plots
README.md		README.md
generate_samples.py		generate_samples.py
pixelcnn.py		pixelcnn.py
resnet.py		resnet.py
train.py		train.py
train_prior.py		train_prior.py
util.py		util.py
vector_quantizer.py		vector_quantizer.py
visualize_codes.py		visualize_codes.py
vqvae.py		vqvae.py

README.md

VQ-VAE

This directory contains an implementation of Vector Quantized Variational Auto-Encoders, introduced in van den Oord et al. (2017). I used the Sonnet implementation as well as a notebook by Amélie Royer as references. This directory contains the following scripts:

train.py: trains a VQ-VAE on either the MNIST or CIFAR-10 datasets;
visualize_codes.py: samples a batch of images and displays them alongside their quantized codes.

Usage information is available by using the --help flag. Additionally, the following directories plots/ and logs/ contain plots and tensorboad logs for training runs, respectively.

Experiments

I tested this implementation by training the model on the MNIST and CIFAR-10 dataset. The results are shown below. For each experiment, the corresponding plots and logs are available on the plots/ and logs/ directories, respectively.

MNIST

I trained the VQ-VAE model on the MNIST dataset for 20 epochs. The run name is vqvae-mnist for the plots and tensorboard logs. The image below depicts a batch of sampled images from the test dataset.

The following animation displays the reconstructed images of the original sampled batch at the end of each epoch.

Next, we visualize the T-SNE embeddings of the learned codebook.

Finally, we sample images from the test set and visualize the discrete representations learned by the model.

CIFAR-10

Next, I trained the VQ-VAE model on the CIFAR-10 dataset for 50 epochs. The run name is vqvae-cifar10 for the plots and tensorboard logs. The image below depicts a batch of sampled images from the test dataset.

The following animation displays the reconstructed images of the original sampled batch at the end of each epoch.

Next, we visualize the T-SNE embeddings of the learned codebook.

Finally, we sample images from the test set and visualize the discrete representations learned by the model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vq-vae

vq-vae

README.md

VQ-VAE

Experiments

MNIST

CIFAR-10

Files

vq-vae

Directory actions

More options

Directory actions

More options

Latest commit

History

vq-vae

Folders and files

parent directory

README.md

VQ-VAE

Experiments

MNIST

CIFAR-10