Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Hongbinl/1f1b overlap mirror 0813
#1743 opened Aug 13, 2025 by lhb8125 Loading…
ci: Add build-test-publish wheel workflow
#1742 opened Aug 13, 2025 by ko3n1g Loading…
Megatron Bridge Adaption
#1741 opened Aug 12, 2025 by yaoyu-33 Loading…
Add world_size dict getter method for simple integration with W&B enhancement New feature or request
#1735 opened Aug 9, 2025 by WoosungMyung Loading…
export _move_new_state_to_right_device for offload/load enhancement New feature or request
#1734 opened Aug 8, 2025 by techkang Loading…
fix router input jitter dtype bug Something isn't working
#1726 opened Aug 1, 2025 by chaitanyadwivedi96 Loading…
Add FP8 training scripts enhancement New feature or request module: transformer engine
#1723 opened Jul 31, 2025 by SDcodehub Draft
Typo correction module: documentation
#1717 opened Jul 29, 2025 by Aditya-Shandilya1182 Loading…
Update pretrain_mamba.py bug Something isn't working module: documentation
#1682 opened Jul 11, 2025 by vignesh1507 Loading…
Support 1f1b a2a overlap module: distributed
#1671 opened Jul 7, 2025 by lhb8125 Loading…
moe: remove unused variable scale_up module: moe
#1670 opened Jul 6, 2025 by WineChord Loading…
Update README.md module: documentation
#1660 opened Jul 2, 2025 by 21jun Loading…
ProTip! Follow long discussions with comments:>50.