Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
audio video pytorch transformer gan multi-modal evaluation-metrics video-understanding vas video-features vqvae bmvc melgan audio-generation vggsound
-
Updated
Jul 12, 2024 - Jupyter Notebook