Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Update Gaudi2 jobs to latest 1.19 build
#6905 opened Dec 23, 2024 by raza-sikander Loading…
Tecorigin sdaa accelerator
#6903 opened Dec 23, 2024 by siqi654321 Loading…
Fix checkpointable_layers Logic
#6881 opened Dec 17, 2024 by Quentin-Anthony Loading…
Fix error caused by all_reduce call in domino
#6880 opened Dec 16, 2024 by hwchen2017 Loading…
Use ds-specific module id to avoid conflicts
#6847 opened Dec 10, 2024 by tjruwase Loading…
Cleanup ops/transformer/inference tests
#6830 opened Dec 6, 2024 by loadams Loading…
Support pure meta model lm_head tp
#6812 opened Dec 2, 2024 by Yejing-Lai Loading…
Stage3: Use new torch grad accumulation hooks API
#6773 opened Nov 21, 2024 by deepcharm Loading…
Check transformers version in BLOOM for inference v1
#6766 opened Nov 19, 2024 by lekurile Loading…
BLOOM fixes for DS Legacy Inference
#6765 opened Nov 19, 2024 by lekurile Draft
Fix building on Windows with presence of Triton
#6749 opened Nov 14, 2024 by woct0rdho Loading…
Update flake8 version
#6740 opened Nov 11, 2024 by loadams Loading…
Support latest transformers with DSChat
#6711 opened Nov 4, 2024 by loadams Loading…
Update MII tests to support transformers latest
#6686 opened Oct 29, 2024 by loadams Loading…
modify_load_save_model
#6626 opened Oct 15, 2024 by ssklzx Loading…
Improve consistency of zero_grad
#6554 opened Sep 18, 2024 by tohtana Draft
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.