Pinned Loading
-
DeepSpeed-Chat-Extension
DeepSpeed-Chat-Extension PublicThis repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
-
NiuTrans/Vision-LLM-Alignment
NiuTrans/Vision-LLM-Alignment PublicThis repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.