Skip to content

Actions: microsoft/DeepSpeed

nv-lightning-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,894 workflow runs
4,894 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

support autoTP with weight only quantization in DS inference path
nv-lightning-v100 #14022: Pull request #4750 synchronize by ftian1
January 16, 2025 08:10 9m 54s ftian1:master
January 16, 2025 08:10 9m 54s
Update torch.norm to torch.linalg.norm and torch.linalg.vector_norm
nv-lightning-v100 #14020: Pull request #6931 synchronize by loadams
January 16, 2025 00:40 3h 1m 55s loadams/fix-torch-issues
January 16, 2025 00:40 3h 1m 55s
generalize deepspeed linear and implement it for non cuda systems
nv-lightning-v100 #14019: Pull request #6932 synchronize by loadams
January 16, 2025 00:23 1h 24m 24s oelayan7:linear
January 16, 2025 00:23 1h 24m 24s
nv-lightning-v100
nv-lightning-v100 #14018: Scheduled
January 16, 2025 00:20 56m 19s master
January 16, 2025 00:20 56m 19s
Pin numpy version
nv-lightning-v100 #14017: Pull request #6953 opened by BLOrange-AMD
January 15, 2025 23:43 1h 27m 54s ROCm:pin_numpy
January 15, 2025 23:43 1h 27m 54s
nv-lightning-v100
nv-lightning-v100 #14016: Merge group checks requested
January 15, 2025 22:09 14m 56s
January 15, 2025 22:09 14m 56s
Update sharded_moe.py to support top2 gate with Tutel
nv-lightning-v100 #14015: Pull request #6948 synchronize by loadams
January 15, 2025 21:15 1h 4m 27s xenshinu:patch-1
January 15, 2025 21:15 1h 4m 27s
Unpin tests that previously used a pinned version of transformers
nv-lightning-v100 #14014: Pull request #6387 synchronize by loadams
January 15, 2025 19:46 1h 16m 15s loadams/transformers-fixes
January 15, 2025 19:46 1h 16m 15s
Update sharded_moe.py to support top2 gate with Tutel
nv-lightning-v100 #14013: Pull request #6948 synchronize by xenshinu
January 15, 2025 19:40 Action required xenshinu:patch-1
January 15, 2025 19:40 Action required
nv-lightning-v100
nv-lightning-v100 #14012: Merge group checks requested
January 15, 2025 19:25 9m 56s
January 15, 2025 19:25 9m 56s
warn to warning
nv-lightning-v100 #14011: Pull request #6952 opened by qgallouedec
January 15, 2025 18:32 3m 56s qgallouedec:warn_to_warning
January 15, 2025 18:32 3m 56s
Addressing ipg Buffer Data Race Condition in Zero Stage2
nv-lightning-v100 #14010: Pull request #3727 synchronize by loadams
January 15, 2025 17:09 Action required xxr3376:master
January 15, 2025 17:09 Action required
[inf] Add config var to enable keeping module on host
nv-lightning-v100 #14009: Pull request #6846 synchronize by loadams
January 15, 2025 17:06 3m 57s oelayan7:keep_module_on_host
January 15, 2025 17:06 3m 57s
generalize deepspeed linear and implement it for non cuda systems
nv-lightning-v100 #14008: Pull request #6932 synchronize by loadams
January 15, 2025 16:24 3m 59s oelayan7:linear
January 15, 2025 16:24 3m 59s
nv-lightning-v100
nv-lightning-v100 #14007: Scheduled
January 15, 2025 00:20 57m 24s master
January 15, 2025 00:20 57m 24s
Set dataloader shuffle=true
nv-lightning-v100 #14006: Pull request #6950 opened by loadams
January 14, 2025 23:52 4m 2s loadams/shuffle-true-dataloader
January 14, 2025 23:52 4m 2s
Update sharded_moe.py to support top2 gate with Tutel
nv-lightning-v100 #14005: Pull request #6948 synchronize by xenshinu
January 14, 2025 20:11 4m 5s xenshinu:patch-1
January 14, 2025 20:11 4m 5s
Update sharded_moe.py to support top2 gate with Tutel
nv-lightning-v100 #14004: Pull request #6948 opened by xenshinu
January 14, 2025 20:11 Action required xenshinu:patch-1
January 14, 2025 20:11 Action required
Update torch.norm to torch.linalg.norm and torch.linalg.vector_norm
nv-lightning-v100 #14003: Pull request #6931 synchronize by loadams
January 14, 2025 19:14 4m 2s loadams/fix-torch-issues
January 14, 2025 19:14 4m 2s
Bing/optimizer naming
nv-lightning-v100 #14002: Pull request #3354 synchronize by loadams
January 14, 2025 17:03 4m 2s bing/optimizer-naming
January 14, 2025 17:03 4m 2s
nv-lightning-v100
nv-lightning-v100 #14001: Scheduled
January 14, 2025 00:20 26m 55s master
January 14, 2025 00:20 26m 55s
Fix assert on Lamb optimizers with BF16
nv-lightning-v100 #14000: Pull request #4451 synchronize by loadams
January 13, 2025 23:22 14m 28s loadams/lamb-bf16
January 13, 2025 23:22 14m 28s
Use ds-specific module id to avoid conflicts
nv-lightning-v100 #13999: Pull request #6847 synchronize by loadams
January 13, 2025 22:41 5m 37s olruwase/pr_6772
January 13, 2025 22:41 5m 37s
Update MII tests to support transformers latest
nv-lightning-v100 #13998: Pull request #6686 synchronize by loadams
January 13, 2025 22:14 19m 8s loadams/update-mii-transformers
January 13, 2025 22:14 19m 8s