Skip to content
Change the repository type filter

All

    Repositories list

    • 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
      Python
      Apache License 2.0
      96201Updated Nov 15, 2024Nov 15, 2024
    • 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.
      Python
      Apache License 2.0
      551643535Updated Nov 14, 2024Nov 14, 2024
    • fms-fsdp

      Public
      🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.
      Python
      Apache License 2.0
      32193136Updated Nov 13, 2024Nov 13, 2024
    • fms-dgt

      Public
      Synthetic Data Generation for Foundation Models
      Python
      Apache License 2.0
      201422Updated Nov 12, 2024Nov 12, 2024
    • 🚀 Guardrails orchestration server for application of various detections on text generation input and output.
      Rust
      Apache License 2.0
      184294Updated Nov 11, 2024Nov 11, 2024
    • 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
      Python
      Apache License 2.0
      46261823Updated Nov 7, 2024Nov 7, 2024
    • Estimate resources needed to train LLMs
      Python
      Apache License 2.0
      61110Updated Nov 7, 2024Nov 7, 2024
    • Scan resources consumed during LLM training
      Python
      Apache License 2.0
      1710Updated Nov 4, 2024Nov 4, 2024
    • High-performance safetensors model loader
      Python
      Apache License 2.0
      2400Updated Nov 1, 2024Nov 1, 2024
    • Demonstration of MoE distributed training using various techniques
      Python
      0100Updated Oct 31, 2024Oct 31, 2024
    • Dockerfile
      Apache License 2.0
      4200Updated Oct 28, 2024Oct 28, 2024
    • pod-vllm

      Public
      Source code to launch a number of pods, performing synthetic data generation
      Python
      Apache License 2.0
      0000Updated Oct 22, 2024Oct 22, 2024
    • Go
      Apache License 2.0
      2191Updated Oct 10, 2024Oct 10, 2024
    • Python
      Apache License 2.0
      82034Updated Sep 9, 2024Sep 9, 2024
    • Go
      Apache License 2.0
      534121Updated Aug 29, 2024Aug 29, 2024
    • avengers

      Public
      Shell
      Apache License 2.0
      0040Updated Jul 20, 2024Jul 20, 2024
    • trl

      Public
      Train transformer language models with reinforcement learning.
      Python
      Apache License 2.0
      1.3k002Updated Mar 5, 2024Mar 5, 2024
    • Training job management tool for foundation model service
      Python
      Apache License 2.0
      4560Updated Feb 28, 2024Feb 28, 2024
    • Operator that enables EFA and/or GDRCOPY in an OpenShift cluster
      Go
      Apache License 2.0
      0000Updated Nov 22, 2023Nov 22, 2023
    • Training operators on Kubernetes.
      Python
      Apache License 2.0
      698000Updated Nov 16, 2022Nov 16, 2022