Skip to content
Change the repository type filter

All

    Repositories list

    • nanotron

      Public
      Minimalistic large language model 3D-parallelism training
      Python
      Apache License 2.0
      122715Updated Nov 10, 2024Nov 10, 2024
    • Python
      Apache License 2.0
      0000Updated Nov 6, 2024Nov 6, 2024
    • A copy of nanotron for multilingual training
      Python
      Apache License 2.0
      122002Updated Oct 23, 2024Oct 23, 2024
    • LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
      Python
      MIT License
      95200Updated Oct 18, 2024Oct 18, 2024
    • Easily create large video dataset from video urls
      Python
      MIT License
      65101Updated Oct 14, 2024Oct 14, 2024
    • llm-proxy

      Public
      LLM Serving and User Control
      Python
      1000Updated Aug 30, 2024Aug 30, 2024
    • Containers for multimodal initiative (and maybe more across Swiss AI?)
      Dockerfile
      0000Updated Aug 7, 2024Aug 7, 2024
    • ml-4m-v2

      Public
      0000Updated Aug 5, 2024Aug 5, 2024
    • ml-4m

      Public
      4M: Massively Multimodal Masked Modeling (NeurIPS 2023 Spotlight)
      Python
      Apache License 2.0
      950134Updated Aug 5, 2024Aug 5, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      4.6k000Updated Jul 31, 2024Jul 31, 2024
    • Tool set for data preparation and selection in the context of Swiss-AI (forked from DataTrove)
      Python
      Apache License 2.0
      147000Updated Jul 15, 2024Jul 15, 2024
    • PDF pipeline for creating training corpora (mainly for llm, multimodal and alignment horizontals)
      Python
      Apache License 2.0
      0000Updated May 8, 2024May 8, 2024
    • MoE

      Public
      some mixture of experts architecture implementations
      Python
      Apache License 2.0
      1700Updated Mar 22, 2024Mar 22, 2024
    • distributed trainer for LLMs
      Python
      Other
      77000Updated Feb 8, 2024Feb 8, 2024