Skip to content

Actions: microsoft/DeepSpeed

nv-accelerate-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,788 workflow runs
4,788 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-accelerate-v100 #12801: Pull request #6553 synchronize by gyou2021
January 21, 2025 08:59 Action required gyou2021:configurable_autoTP
January 21, 2025 08:59 Action required
Autotp training
nv-accelerate-v100 #12800: Pull request #6922 synchronize by inkcherry
January 21, 2025 08:36 Action required inkcherry:autotp_training
January 21, 2025 08:36 Action required
Fix: forbid repeated deepspeed.initialize on training objects
nv-accelerate-v100 #12797: Pull request #6874 synchronize by tjruwase
January 21, 2025 00:41 Action required traincheck-team:fix-6848-forbid-repeated-init
January 21, 2025 00:41 Action required
Tecorigin sdaa accelerator
nv-accelerate-v100 #12796: Pull request #6903 synchronize by tjruwase
January 21, 2025 00:27 Action required siqi654321:Tecorigin-SDAA-accelerator
January 21, 2025 00:27 Action required
nv-accelerate-v100
nv-accelerate-v100 #12795: Scheduled
January 21, 2025 00:07 7m 41s master
January 21, 2025 00:07 7m 41s
Precisely track nvme optimizer offload
nv-accelerate-v100 #12794: Pull request #6963 opened by tjruwase
January 20, 2025 17:00 11m 30s olruwase/ds_4998
January 20, 2025 17:00 11m 30s
Using explicit GPU upcast for ZeRO-Offload
nv-accelerate-v100 #12793: Pull request #6962 opened by xylian86
January 20, 2025 13:25 7m 30s xylian86:explicit_upcast
January 20, 2025 13:25 7m 30s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-accelerate-v100 #12791: Pull request #6553 synchronize by gyou2021
January 20, 2025 10:03 Action required gyou2021:configurable_autoTP
January 20, 2025 10:03 Action required
Autotp training
nv-accelerate-v100 #12790: Pull request #6922 synchronize by inkcherry
January 20, 2025 09:24 9m 55s inkcherry:autotp_training
January 20, 2025 09:24 9m 55s
Autotp training
nv-accelerate-v100 #12789: Pull request #6922 synchronize by inkcherry
January 20, 2025 07:50 10m 2s inkcherry:autotp_training
January 20, 2025 07:50 10m 2s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-accelerate-v100 #12788: Pull request #6553 synchronize by gyou2021
January 20, 2025 06:23 Action required gyou2021:configurable_autoTP
January 20, 2025 06:23 Action required
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-accelerate-v100 #12787: Pull request #6553 synchronize by gyou2021
January 20, 2025 05:25 Action required gyou2021:configurable_autoTP
January 20, 2025 05:25 Action required
nv-accelerate-v100
nv-accelerate-v100 #12786: Scheduled
January 20, 2025 00:07 7m 32s master
January 20, 2025 00:07 7m 32s
nv-accelerate-v100
nv-accelerate-v100 #12785: Scheduled
January 19, 2025 00:07 7m 35s master
January 19, 2025 00:07 7m 35s
nv-accelerate-v100
nv-accelerate-v100 #12784: Merge group checks requested
January 18, 2025 01:21 26m 49s
January 18, 2025 01:21 26m 49s
nv-accelerate-v100
nv-accelerate-v100 #12783: Scheduled
January 18, 2025 00:06 8m 7s master
January 18, 2025 00:06 8m 7s
Add the missing view operations from sequence parallel(async).
nv-accelerate-v100 #12782: Pull request #6750 synchronize by tohtana
January 17, 2025 23:12 11m 42s inkcherry:ds_overlap_fix
January 17, 2025 23:12 11m 42s
Update torch.norm to torch.linalg.norm and torch.linalg.vector_norm
nv-accelerate-v100 #12781: Pull request #6931 synchronize by loadams
January 17, 2025 22:20 11m 16s loadams/fix-torch-issues
January 17, 2025 22:20 11m 16s
Explicitly use the linalg.vector_norm call in comm/
nv-accelerate-v100 #12780: Pull request #6960 synchronize by loadams
January 17, 2025 22:16 7m 39s loadams/fix-torch-linalg-norm
January 17, 2025 22:16 7m 39s
[DEBUG] Add diagnostics for cpu-torch-latest intermittent hang
nv-accelerate-v100 #12779: Pull request #6942 synchronize by loadams
January 17, 2025 19:18 11m 58s loadams/cpu-runner-debug
January 17, 2025 19:18 11m 58s