Skip to content

Actions: microsoft/DeepSpeed

nv-torch-latest-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,911 workflow runs
4,911 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-torch-latest-v100 #12952: Pull request #6553 synchronize by gyou2021
January 21, 2025 08:59 Action required gyou2021:configurable_autoTP
January 21, 2025 08:59 Action required
Autotp training
nv-torch-latest-v100 #12951: Pull request #6922 synchronize by inkcherry
January 21, 2025 08:36 Action required inkcherry:autotp_training
January 21, 2025 08:36 Action required
Fix: forbid repeated deepspeed.initialize on training objects
nv-torch-latest-v100 #12949: Pull request #6874 synchronize by traincheck-team
January 21, 2025 00:57 Action required traincheck-team:fix-6848-forbid-repeated-init
January 21, 2025 00:57 Action required
Fix: forbid repeated deepspeed.initialize on training objects
nv-torch-latest-v100 #12948: Pull request #6874 synchronize by tjruwase
January 21, 2025 00:41 Action required traincheck-team:fix-6848-forbid-repeated-init
January 21, 2025 00:41 Action required
Tecorigin sdaa accelerator
nv-torch-latest-v100 #12947: Pull request #6903 synchronize by tjruwase
January 21, 2025 00:27 Action required siqi654321:Tecorigin-SDAA-accelerator
January 21, 2025 00:27 Action required
nv-torch-latest-v100
nv-torch-latest-v100 #12946: Scheduled
January 21, 2025 00:20 1h 32m 23s master
January 21, 2025 00:20 1h 32m 23s
Precisely track nvme optimizer offload
nv-torch-latest-v100 #12945: Pull request #6963 opened by tjruwase
January 20, 2025 17:00 1h 33m 51s olruwase/ds_4998
January 20, 2025 17:00 1h 33m 51s
Using explicit GPU upcast for ZeRO-Offload
nv-torch-latest-v100 #12944: Pull request #6962 opened by xylian86
January 20, 2025 13:25 1h 32m 1s xylian86:explicit_upcast
January 20, 2025 13:25 1h 32m 1s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-torch-latest-v100 #12943: Pull request #6553 synchronize by gyou2021
January 20, 2025 10:20 1h 31m 27s gyou2021:configurable_autoTP
January 20, 2025 10:20 1h 31m 27s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-torch-latest-v100 #12942: Pull request #6553 synchronize by gyou2021
January 20, 2025 10:03 Action required gyou2021:configurable_autoTP
January 20, 2025 10:03 Action required
Autotp training
nv-torch-latest-v100 #12941: Pull request #6922 synchronize by inkcherry
January 20, 2025 09:24 1h 39m 45s inkcherry:autotp_training
January 20, 2025 09:24 1h 39m 45s
Autotp training
nv-torch-latest-v100 #12940: Pull request #6922 synchronize by inkcherry
January 20, 2025 07:50 1h 3m 25s inkcherry:autotp_training
January 20, 2025 07:50 1h 3m 25s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-torch-latest-v100 #12939: Pull request #6553 synchronize by gyou2021
January 20, 2025 06:23 Action required gyou2021:configurable_autoTP
January 20, 2025 06:23 Action required
nv-torch-latest-v100
nv-torch-latest-v100 #12937: Scheduled
January 20, 2025 00:21 6h 0m 21s master
January 20, 2025 00:21 6h 0m 21s
nv-torch-latest-v100
nv-torch-latest-v100 #12936: Scheduled
January 19, 2025 00:22 1h 30m 43s master
January 19, 2025 00:22 1h 30m 43s
nv-torch-latest-v100
nv-torch-latest-v100 #12935: Merge group checks requested
January 18, 2025 01:21 1h 59m 46s
January 18, 2025 01:21 1h 59m 46s
nv-torch-latest-v100
nv-torch-latest-v100 #12934: Scheduled
January 18, 2025 00:19 2h 20m 19s master
January 18, 2025 00:19 2h 20m 19s
Add the missing view operations from sequence parallel(async).
nv-torch-latest-v100 #12933: Pull request #6750 synchronize by tohtana
January 17, 2025 23:12 1h 50m 53s inkcherry:ds_overlap_fix
January 17, 2025 23:12 1h 50m 53s
Update torch.norm to torch.linalg.norm and torch.linalg.vector_norm
nv-torch-latest-v100 #12932: Pull request #6931 synchronize by loadams
January 17, 2025 22:20 1h 31m 36s loadams/fix-torch-issues
January 17, 2025 22:20 1h 31m 36s
Explicitly use the linalg.vector_norm call in comm/
nv-torch-latest-v100 #12931: Pull request #6960 synchronize by loadams
January 17, 2025 22:16 1h 36m 4s loadams/fix-torch-linalg-norm
January 17, 2025 22:16 1h 36m 4s
[DEBUG] Add diagnostics for cpu-torch-latest intermittent hang
nv-torch-latest-v100 #12930: Pull request #6942 synchronize by loadams
January 17, 2025 19:18 1h 48m 19s loadams/cpu-runner-debug
January 17, 2025 19:18 1h 48m 19s