Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
-
Updated
Oct 4, 2024 - Python
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
🔥🔥MLVU: Multi-task Long Video Understanding Benchmark
Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]
[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
Language Repository for Long Video Understanding
Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
a multi-modal video caption dataset with richer annotation
Add a description, image, and links to the long-video-understanding topic page so that developers can more easily learn about it.
To associate your repository with the long-video-understanding topic, visit your repo's landing page and select "manage topics."