🎗️
Figuring out LLMs for Vision
CS Ph.D. Student at Georgia Tech | GRA @SHI-Labs | IIT Roorkee CSE 2023
-
Georgia Tech
-
11:49
(UTC -05:00) - https://praeclarumjj3.github.io/
- @praeclarumjj
Highlights
- Pro
Pinned Loading
-
SHI-Labs/OneFormer
SHI-Labs/OneFormer Public[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
-
SHI-Labs/OLA-VLM
SHI-Labs/OLA-VLM PublicOLA-VLM: Elevating Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024
-
SHI-Labs/VCoder
SHI-Labs/VCoder Public[CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models
-
Picsart-AI-Research/SeMask-Segmentation
Picsart-AI-Research/SeMask-Segmentation Public[NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation
-
SHI-Labs/FcF-Inpainting
SHI-Labs/FcF-Inpainting Public[WACV 2023] Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.