Skip to content
@FoundationVision

FoundationVision

Hi there 👋

This is FoundationVision official website repo

Popular repositories Loading

  1. VAR VAR Public

    [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

    Jupyter Notebook 6.4k 428

  2. LlamaGen LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    Python 1.4k 57

  3. GLEE GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    Python 1.1k 86

  4. Groma Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    Python 586 61

  5. Infinity Infinity Public

    Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    Python 454 5

  6. OmniTokenizer OmniTokenizer Public

    [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

    Python 276 7

Repositories

Showing 10 of 12 repositories
  • Infinity Public

    Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    FoundationVision/Infinity’s past year of commit activity
    Python 454 MIT 5 5 0 Updated Dec 26, 2024
  • FoundationVision/infinity.project’s past year of commit activity
    HTML 0 0 0 0 Updated Dec 24, 2024
  • VAR Public

    [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

    FoundationVision/VAR’s past year of commit activity
    Jupyter Notebook 6,431 MIT 428 34 0 Updated Dec 22, 2024
  • Liquid Public

    Liquid: Language Models are Scalable Multi-modal Generators

    FoundationVision/Liquid’s past year of commit activity
    42 MIT 0 1 0 Updated Dec 13, 2024
  • GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    FoundationVision/GLEE’s past year of commit activity
    Python 1,114 MIT 86 39 2 Updated Oct 21, 2024
  • LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    FoundationVision/LlamaGen’s past year of commit activity
    Python 1,417 MIT 57 50 0 Updated Aug 16, 2024
  • OmniTokenizer Public

    [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

    FoundationVision/OmniTokenizer’s past year of commit activity
    Python 276 MIT 7 8 0 Updated Jul 10, 2024
  • vaex Public

    🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook

    FoundationVision/vaex’s past year of commit activity
    Python 65 MIT 4 2 0 Updated Jun 23, 2024
  • Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    FoundationVision/Groma’s past year of commit activity
    Python 586 Apache-2.0 61 8 1 Updated Jun 7, 2024
  • GenerateU Public

    [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

    FoundationVision/GenerateU’s past year of commit activity
    Python 151 6 15 0 Updated Mar 25, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.