Skip to content
@swiss-ai

swiss-ai

Popular repositories Loading

  1. mmore mmore Public

    Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets …

    Python 20 4

  2. MoE MoE Public

    some mixture of experts architecture implementations

    Python 12 2

  3. nanotron nanotron Public

    Forked from huggingface/nanotron

    Minimalistic large language model 3D-parallelism training

    Python 7 6

  4. video2dataset video2dataset Public

    Forked from iejMac/video2dataset

    Easily create large video dataset from video urls

    Python 1

  5. Megatron-LLM Megatron-LLM Public

    Forked from epfLLM/Megatron-LLM

    distributed trainer for LLMs

    Python

  6. data-PDF-pipeline data-PDF-pipeline Public

    PDF pipeline for creating training corpora (mainly for llm, multimodal and alignment horizontals)

    Python

Repositories

Showing 10 of 16 repositories
  • mmore Public

    Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets and feed them to an LLM as a knowledge base? Well, MMORE is here to help you!

    swiss-ai/mmore’s past year of commit activity
    Python 20 Apache-2.0 4 20 (1 issue needs help) 2 Updated Dec 25, 2024
  • llm-proxy Public Forked from xiaozheyao/llm-proxy

    LLM Serving and User Control

    swiss-ai/llm-proxy’s past year of commit activity
    Python 0 2 0 0 Updated Dec 21, 2024
  • lighteval Public Forked from huggingface/lighteval

    Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

    swiss-ai/lighteval’s past year of commit activity
    Python 0 MIT 110 0 0 Updated Dec 19, 2024
  • swiss-ai/ethel-tutor-eval’s past year of commit activity
    Jupyter Notebook 0 Apache-2.0 0 0 0 Updated Dec 19, 2024
  • nanotron Public Forked from huggingface/nanotron

    Minimalistic large language model 3D-parallelism training

    swiss-ai/nanotron’s past year of commit activity
    Python 7 Apache-2.0 135 1 9 Updated Dec 2, 2024
  • containers Public

    Containers for multimodal initiative (and maybe more across Swiss AI?)

    swiss-ai/containers’s past year of commit activity
    Dockerfile 0 0 0 0 Updated Nov 29, 2024
  • ml-4m Public Forked from apple/ml-4m

    4M: Massively Multimodal Masked Modeling (NeurIPS 2023 Spotlight)

    swiss-ai/ml-4m’s past year of commit activity
    Python 0 Apache-2.0 100 13 4 Updated Nov 29, 2024
  • data-tooling Public Forked from huggingface/datatrove

    Tool set for data preparation and selection in the context of Swiss-AI (forked from DataTrove)

    swiss-ai/data-tooling’s past year of commit activity
    Python 0 Apache-2.0 158 0 1 Updated Nov 21, 2024
  • swiss-ai/llm-pretrain-data-toxicity-removal’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Nov 6, 2024
  • nanotron-multilingual Public Forked from swiss-ai/nanotron

    A copy of nanotron for multilingual training

    swiss-ai/nanotron-multilingual’s past year of commit activity
    Python 0 Apache-2.0 135 0 2 Updated Oct 23, 2024

Top languages

Loading…

Most used topics

Loading…