-
Notifications
You must be signed in to change notification settings - Fork 3k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix several typos in megatron/core/transformer/multi_token_prediction.py
#1744
opened Aug 13, 2025 by
andy-yangz
Loading…
Add world_size dict getter method for simple integration with W&B
enhancement
New feature or request
#1735
opened Aug 9, 2025 by
WoosungMyung
Loading…
export _move_new_state_to_right_device for offload/load
enhancement
New feature or request
#1734
opened Aug 8, 2025 by
techkang
Loading…
Megatron-LM changes to make Hyena/Evo 2 inference usable, especially for 40B models
enhancement
New feature or request
#1727
opened Aug 1, 2025 by
antonvnv
Loading…
fix router input jitter dtype
bug
Something isn't working
#1726
opened Aug 1, 2025 by
chaitanyadwivedi96
Loading…
Fix a typo on README git checkout
module: documentation
#1705
opened Jul 24, 2025 by
GindaChen
Loading…
BugFix: FP8 Communication Mismatch with --first-last-layers-bf16 in tp-comm-overlap
bug
Something isn't working
module: transformer engine
#1703
opened Jul 24, 2025 by
xiaomin-D
Loading…
Align import to existing module
module: data pipeline
#1692
opened Jul 15, 2025 by
AlexanderLavelle
Loading…
fix(mtp logging): Correctly accumulate MTP loss for logging when log_interval > 1
module: moe
#1684
opened Jul 11, 2025 by
Luowaterbi
Loading…
Update pretrain_mamba.py
bug
Something isn't working
module: documentation
#1682
opened Jul 11, 2025 by
vignesh1507
Loading…
[feat, moe] Add support for global aux loss
module: moe
#1681
opened Jul 11, 2025 by
Victarry
Loading…
Issue 1672 fix: initializing the current pointed with int64 to avoid …
bug
Something isn't working
community-request
#1673
opened Jul 7, 2025 by
sharanmayank
Loading…
Speed up model parallel initialization
module: distributed
#1662
opened Jul 2, 2025 by
alexqdh
Loading…
bug fixed: wandb artifact requires the tracker file
module: debugging
#1654
opened Jun 27, 2025 by
yezhengmao1
Loading…
Apply roll operation to position_ids in MTP
module: moe
#1651
opened Jun 26, 2025 by
iansheng
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.