ℹ️ Paper accepted at CLIC workshop @ CVPR 2022 !
Repo under construction!
- We introduce a novel frame interpolation algorithm that utilizes both flow and occlusion maps between four input frames to estimate an automatically adaptable pixel-wise non-linear motion model to interpolate the frames.
- We propose a parameter and runtime-efficient 3D CNN named
GridNet-3D
to aggregate multi-scale features efficiently. OurGridNet-3D
has only 2.44 M parameters which performs better thanUNet-3D
with 42.06 M parameters.
- torch==1.1.0 (CUDA 10.1)
- torchvision==0.3.0
- opencv-python==3.4.2
- scikit-image==0.17.2
Please setup IRR repository and update installation directory in model.py
.
The quintuplets used for evaluation are stored in datasets
folder as .csv
files. Please modify the absolute path accordingly.
python eval.py --dataset <dataset name> --data_root <dataset location>
Our code is built upon the following existing papers and repositories.
@InProceedings{Dutta_2022_CVPR,
author = {Dutta, Saikat and Subramaniam, Arulkumar and Mittal, Anurag},
title = {Non-Linear Motion Estimation for Video Frame Interpolation Using Space-Time Convolutions},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
month = {June},
year = {2022},
pages = {1726-1731}
}
<github username>
779@gmail.com