Skip to content
Change the repository type filter

All

    Repositories list

    • MoME

      Public
      [NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models
      Python
      MIT License
      04200Updated Dec 9, 2024Dec 9, 2024
    • Optimus-1

      Public
      [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
      Java
      55000Updated Dec 6, 2024Dec 6, 2024
    • Official repository of "FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers"
      Apache License 2.0
      0000Updated Nov 27, 2024Nov 27, 2024
    • The official repository of "Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding"
      0610Updated Jul 22, 2024Jul 22, 2024
    • [CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
      Jupyter Notebook
      MIT License
      612830Updated Jul 18, 2024Jul 18, 2024