Skip to content

Actions: microsoft/DeepSpeed

nv-lightning-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,885 workflow runs
4,885 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-lightning-v100 #14063: Pull request #6553 synchronize by gyou2021
January 21, 2025 08:59 Action required gyou2021:configurable_autoTP
January 21, 2025 08:59 Action required
Autotp training
nv-lightning-v100 #14062: Pull request #6922 synchronize by inkcherry
January 21, 2025 08:36 Action required inkcherry:autotp_training
January 21, 2025 08:36 Action required
Fix: forbid repeated deepspeed.initialize on training objects
nv-lightning-v100 #14059: Pull request #6874 synchronize by tjruwase
January 21, 2025 00:41 Action required traincheck-team:fix-6848-forbid-repeated-init
January 21, 2025 00:41 Action required
Tecorigin sdaa accelerator
nv-lightning-v100 #14058: Pull request #6903 synchronize by tjruwase
January 21, 2025 00:27 Action required siqi654321:Tecorigin-SDAA-accelerator
January 21, 2025 00:27 Action required
nv-lightning-v100
nv-lightning-v100 #14057: Scheduled
January 21, 2025 00:20 3m 55s master
January 21, 2025 00:20 3m 55s
Precisely track nvme optimizer offload
nv-lightning-v100 #14056: Pull request #6963 opened by tjruwase
January 20, 2025 17:00 3m 58s olruwase/ds_4998
January 20, 2025 17:00 3m 58s
Using explicit GPU upcast for ZeRO-Offload
nv-lightning-v100 #14055: Pull request #6962 opened by xylian86
January 20, 2025 13:25 3m 56s xylian86:explicit_upcast
January 20, 2025 13:25 3m 56s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-lightning-v100 #14053: Pull request #6553 synchronize by gyou2021
January 20, 2025 10:03 Action required gyou2021:configurable_autoTP
January 20, 2025 10:03 Action required
Autotp training
nv-lightning-v100 #14052: Pull request #6922 synchronize by inkcherry
January 20, 2025 09:24 4m 2s inkcherry:autotp_training
January 20, 2025 09:24 4m 2s
Autotp training
nv-lightning-v100 #14051: Pull request #6922 synchronize by inkcherry
January 20, 2025 07:50 3m 58s inkcherry:autotp_training
January 20, 2025 07:50 3m 58s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-lightning-v100 #14050: Pull request #6553 synchronize by gyou2021
January 20, 2025 06:23 Action required gyou2021:configurable_autoTP
January 20, 2025 06:23 Action required
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-lightning-v100 #14049: Pull request #6553 synchronize by gyou2021
January 20, 2025 05:25 Action required gyou2021:configurable_autoTP
January 20, 2025 05:25 Action required
nv-lightning-v100
nv-lightning-v100 #14048: Scheduled
January 20, 2025 00:21 3m 53s master
January 20, 2025 00:21 3m 53s
nv-lightning-v100
nv-lightning-v100 #14047: Scheduled
January 19, 2025 00:22 3m 52s master
January 19, 2025 00:22 3m 52s
nv-lightning-v100
nv-lightning-v100 #14046: Merge group checks requested
January 18, 2025 01:21 13m 4s
January 18, 2025 01:21 13m 4s
nv-lightning-v100
nv-lightning-v100 #14045: Scheduled
January 18, 2025 00:19 56m 3s master
January 18, 2025 00:19 56m 3s
Add the missing view operations from sequence parallel(async).
nv-lightning-v100 #14044: Pull request #6750 synchronize by tohtana
January 17, 2025 23:12 4m 2s inkcherry:ds_overlap_fix
January 17, 2025 23:12 4m 2s
Update torch.norm to torch.linalg.norm and torch.linalg.vector_norm
nv-lightning-v100 #14043: Pull request #6931 synchronize by loadams
January 17, 2025 22:20 15m 13s loadams/fix-torch-issues
January 17, 2025 22:20 15m 13s
Explicitly use the linalg.vector_norm call in comm/
nv-lightning-v100 #14042: Pull request #6960 synchronize by loadams
January 17, 2025 22:16 4m 5s loadams/fix-torch-linalg-norm
January 17, 2025 22:16 4m 5s
[DEBUG] Add diagnostics for cpu-torch-latest intermittent hang
nv-lightning-v100 #14041: Pull request #6942 synchronize by loadams
January 17, 2025 19:18 16m 1s loadams/cpu-runner-debug
January 17, 2025 19:18 16m 1s