Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Disable nvtx decorator to avoid graph break
#5697 opened Jun 25, 2024 by tohtana Loading…
sequence parallel with communication overlap
#5691 opened Jun 21, 2024 by inkcherry Loading…
ENV var added for recaching in INF Unit tests
#5688 opened Jun 20, 2024 by raza-sikander Loading…
Add and Remove ZeRO 3 Hooks
#5658 opened Jun 13, 2024 by jomayeri Loading…
Unpin transformers version
#5650 opened Jun 12, 2024 by loadams Loading…
Hybrid Offloading for ZeRO3
#5625 opened Jun 7, 2024 by tohtana Draft
fix: quantization with DeepSpeed HE
#5624 opened Jun 6, 2024 by Atry Loading…
Add support for Phi-3 small to FastGen
#5614 opened Jun 4, 2024 by adk9 Draft
[INF] Enable torch compile for inference
#5612 opened Jun 4, 2024 by oelayan7 Loading…
Upgrade HPU image to v1.16.2.
#5610 opened Jun 4, 2024 by vshekhawat-hlab Loading…
Update profiler.py
#5584 opened May 29, 2024 by gameofdimension Loading…
reduce cpu host overhead when using moe
#5578 opened May 29, 2024 by ranzhejiang Loading…
Reuse KV cache of prefixes
#5572 opened May 27, 2024 by tohtana Draft
Add support for Microsoft Phi-3 model to DeepSpeed-FastGen
#5559 opened May 21, 2024 by adk9 Loading…
Add chatglm2 & chatglm3 autotp
#5540 opened May 16, 2024 by Yejing-Lai Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.