GitHub - Howon/CRN-GAN

Contributors

Barbara Zhan bz2310
Pooja Kathail pk2485
Howon Byun hb2458

Instruction

First run

python download_models.py

to download VGG weights used for loss calculation in Cascaded Refinement Network. Goal of this project is to remove this dependency and replace it with the discriminator from Pix2Pix.

Then, two datasets from Cityscapes need to be downloaded.

gtFine_trainvaltest.zip, which holds semantic labels that are used as the training inputs.
leftImg8bit_trainvaltest.zip used as training targets.

Once these are downloaded and unzipped in this directory, run resize.m by doing

cat resize.m| matlab -nodesktop -nosplash

This Matlab script will resize images in training and validations directories from both gtFine and leftImg8bit folders into data/cityscapes/semantics and data/cityscapes/images, respectively.

Once this is done, simply perform

python crn.py

Citations

Photographic Image Synthesis with Cascaded Refinement Networks

Image-to-Image Translation with Conditional Adversarial Networks/Pix2Pix

Pix2Pix Tensorflow Implementation

Milestone 1

For milestone 1, we focused mostly on getting both Cascading Refinement Networks(CRN) and Pix2Pix to work. Pix2Pix seemed to behave fine out of the box but CRN had a lot of issues, including but not limited to:

Lack of documentation.
Incorrect naming schemes/output structure.
Inconsistent/incorrect code among the nets for each of the target resolutions.
General non-Pythonic code style.
Undocumented interlop with Matlab code used for image processing.

Once we managed to clean up the code, we attached TensorBoard and started training the model. There was an issue where the GPU we are using (1080Ti) could not fit the larger resolution networks in memory so for the sake of the milestone submission we trained the lowest resolution (256 x 512) network. Here are some of the images so far. Notice that since the original CRN used diversity loss to compute the minimum variation across variadic output channels, each image consists of 9 panels.

Semantics

Epoch 20

Epoch 50

Epoch 76 (3.5 hours)

Tensorboard Outputs

Scalar Plots

Images. Note that there are 9 "fake" images generated for diversity loss calculation

Computation Graph.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
doc		doc
README.md		README.md
cityscapes.json		cityscapes.json
crn.py		crn.py
crn_gan.py		crn_gan.py
download_models.py		download_models.py
helper.py		helper.py
resize_265.m		resize_265.m
stitch.py		stitch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contributors

Instruction

Citations

Milestone 1

About

Releases

Packages

Contributors 3

Languages

Howon/CRN-GAN

Folders and files

Latest commit

History

Repository files navigation

Contributors

Instruction

Citations

Milestone 1

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages