Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"
-
Updated
Sep 7, 2023 - Python
Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"
[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.
[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
Official Code for 'TeMO: Towards Text-Driven 3D Stylization for Multi-Object Meshes' (CVPR 2024)
Official PyTorch Implementation of: "MDMP: Multi-modal Diffusion for supervised Motion Predictions"
Add a description, image, and links to the 3d-vision-and-language topic page so that developers can more easily learn about it.
To associate your repository with the 3d-vision-and-language topic, visit your repo's landing page and select "manage topics."