Actions: NVIDIA/Megatron-LM
Actions
Showing runs from all workflows
506 workflow runs
506 workflow runs
--override-opt_param-scheduler
Community Bot
#101:
Issue #1138
reopened
by
sbhavani
save_checkpoint
with expert_tensor_parallel_size
Community Bot
#96:
Issue #1719
edited
by
jeromeku
save_checkpoint
with expert_tensor_parallel_size
Community Bot
#95:
Issue #1719
opened
by
jeromeku
total_num_tokens += num_tokens.item()
in megatron/core/pipeline_parallel/schedules.py
Community Bot
#87:
Issue #1403
closed
by
sbhavani