An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
benchmark multimodal pre-training reformulation embodied-ai instruction-following gpt4 in-context-learning large-language-models llm instruction-tuning large-vision-language-models visual-chain-of-thought multimodal-chain-of-thought
-
Updated
Nov 17, 2023 - Python