Skip to content

Actions: NVIDIA/Megatron-LM

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
506 workflow runs
506 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Stale-Close-Inactive-Issues-PRs
Stale-Close-Inactive-Issues-PRs #11: Scheduled
August 5, 2025 02:12 54s main
August 5, 2025 02:12 54s
Stale-Close-Inactive-Issues-PRs
Stale-Close-Inactive-Issues-PRs #10: Scheduled
August 4, 2025 02:13 45s main
August 4, 2025 02:13 45s
Stale-Close-Inactive-Issues-PRs
Stale-Close-Inactive-Issues-PRs #9: Scheduled
August 3, 2025 02:12 52s main
August 3, 2025 02:12 52s
Stale-Close-Inactive-Issues-PRs
Stale-Close-Inactive-Issues-PRs #8: Scheduled
August 2, 2025 02:12 43s main
August 2, 2025 02:12 43s
Stale-Close-Inactive-Issues-PRs
Stale-Close-Inactive-Issues-PRs #7: Scheduled
August 1, 2025 02:12 47s main
August 1, 2025 02:12 47s
[QUESTION]NCCL timeout error when the second iteration
Community Bot #103: Issue #1141 reopened by sbhavani
July 31, 2025 06:02 11s
July 31, 2025 06:02 11s
[BUG] Learning rate not overrided when set --override-opt_param-scheduler
Community Bot #101: Issue #1138 reopened by sbhavani
July 31, 2025 04:35 12s
July 31, 2025 04:35 12s
[QUESTION] Epochs Larger Than 1 When Specified with Trained Samples
Community Bot #100: Issue #1127 reopened by sbhavani
July 31, 2025 04:35 13s
July 31, 2025 04:35 13s
Benchmarking DeepEP Guide
Community Bot #99: Issue #1721 opened by sbhavani
July 31, 2025 03:59 11s
July 31, 2025 03:59 11s
Stale-Close-Inactive-Issues-PRs
Stale-Close-Inactive-Issues-PRs #6: Scheduled
July 31, 2025 02:12 50s main
July 31, 2025 02:12 50s
How to split the num_layers unevenly when using pipeline parallelism?
Community Bot #97: Issue comment #381 (comment) created by edgeinfinity1
July 30, 2025 14:54 16s
July 30, 2025 14:54 16s
[QUESTION] save_checkpoint with expert_tensor_parallel_size
Community Bot #96: Issue #1719 edited by jeromeku
July 30, 2025 14:15 10s
July 30, 2025 14:15 10s
[QUESTION] save_checkpoint with expert_tensor_parallel_size
Community Bot #95: Issue #1719 opened by jeromeku
July 30, 2025 14:14 12s
July 30, 2025 14:14 12s
v0.12.2 pretrain failed, 0.14.0rc2 model conversion failed
Community Bot #94: Issue #1718 edited by justalittlenoob
July 30, 2025 09:32 12s
July 30, 2025 09:32 12s
v0.12.2 pretrain failed, 0.14.0rc2 model conversion failed
Community Bot #93: Issue #1718 opened by justalittlenoob
July 30, 2025 08:10 10s
July 30, 2025 08:10 10s
[QUESTION]How to calculate MFU based on the flops?
Community Bot #92: Issue #1565 closed by Lynnzake
July 30, 2025 06:44 13s
July 30, 2025 06:44 13s
July 30, 2025 06:22 11s
July 30, 2025 02:26 9s
July 30, 2025 02:26 15s
Stale-Close-Inactive-Issues-PRs
Stale-Close-Inactive-Issues-PRs #5: Scheduled
July 30, 2025 02:12 53s main
July 30, 2025 02:12 53s
Issue 1672 fix: initializing the current pointed with int64 to avoid …
Community Bot #88: Issue comment #1673 (comment) created by sharanmayank
July 29, 2025 23:33 11s
July 29, 2025 23:33 11s