Skip to content

Navigation Menu

Appearance settings

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

NVIDIA / Megatron-LM Public

Notifications You must be signed in to change notification settings
Fork 3k
Star 13.2k

Code
Issues 277
Pull requests 120
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Releases: NVIDIA/Megatron-LM

Releases · NVIDIA/Megatron-LM

NVIDIA Megatron Core 0.13.1

12 Aug 18:33

ko3n1g

Compare

Choose a tag to compare

Loading

NVIDIA Megatron Core 0.13.1 Latest

Latest

Merge branch 'cherry-pick-f36e1705' into 'core_r0.13.0'

Cherry-pick 'Use ruff linter (3627)' into 'core_r0.13.0'

See merge request ADLR/megatron-lm!3793

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

okoge-kaz reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

NVIDIA Megatron Core 0.14.0rc5

11 Aug 04:12

ko3n1g

core_v0.14.0rc5

Compare

Choose a tag to compare

Loading

NVIDIA Megatron Core 0.14.0rc5 Pre-release

Pre-release

Prerelease: NVIDIA Megatron Core 0.14.0rc5 (2025-08-11)

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

NVIDIA Megatron Core 0.12.3

12 Aug 18:12

ko3n1g

Compare

Choose a tag to compare

Loading

NVIDIA Megatron Core 0.12.3

Merge branch 'chtruong/cherry-pick-3627' into 'core_r0.12.0'

Cherry-pick 'use yaml safe load  (3627)' into 'core_r0.12.0'

See merge request ADLR/megatron-lm!3795

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

NVIDIA Megatron Core 0.14.0rc4

04 Aug 04:12

ko3n1g

core_v0.14.0rc4

Compare

Choose a tag to compare

Loading

NVIDIA Megatron Core 0.14.0rc4 Pre-release

Pre-release

Prerelease: NVIDIA Megatron Core 0.14.0rc4 (2025-08-04)

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

NVIDIA Megatron Core 0.14.0rc3

28 Jul 04:13

ko3n1g

core_v0.14.0rc3

Compare

Choose a tag to compare

Loading

NVIDIA Megatron Core 0.14.0rc3 Pre-release

Pre-release

Prerelease: NVIDIA Megatron Core 0.14.0rc3 (2025-07-28)

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

Surprisemotherhacker reacted with heart emoji

All reactions

❤️ 1 reaction

1 person reacted

NVIDIA Megatron Core 0.13.0

25 Jul 18:04

ko3n1g

Compare

Choose a tag to compare

Loading

NVIDIA Megatron Core 0.13.0

Support bf16 dtype for optimizer states to use precision-aware optimizer in TransformerEngine
MoE
- Features:
  - Flexible Asymmetric Virtual Pipeline Parallelism with Custom Pipeline Layout (--pipeline-model-parallel-layout)
  - Add support to pass custom parallelism groups to MoE modules.
  - Add Hybrid Shard Data-Parallel support for MoE models (--num-distributed-optimizer-instances)
  - Support EP + custom FSDP training for DeepSeek-V3
  - FP8 support for Multi-Token-Prediction
- Memory Optimization
  - Fine-grained recomputation to reduce activation memory. (--recompute-modules with --recompute-granularity selective)
  - Memory efficient token permutation by moving the probs multiplication from unpermutation to activation function of GroupedMLP.
- Performance Optimization
  - MLA RoPE fusion kernel and YARN embedding cache.
  - FP8 padding optimization of MoE models by padding the routing map.
- Bug fixes:
  - Fix the aux loss calculation when expert_bias or group limited routing is used. This leads to load_balancing_loss values change compared to the previous version.
  - Fix packed sequence support for MLA
- Known Issues:
  - MTP is not compatible with flexible pipeline layout, will be fixed at !3594.
  - MTP convergence issue with TP2, will be fixed at !3594.

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

okoge-kaz, 651961, Rememberz, and ljy-gh reacted with thumbs up emoji

okoge-kaz, 651961, yanring, and Rememberz reacted with hooray emoji

Surprisemotherhacker reacted with heart emoji

All reactions

👍 4 reactions
🎉 4 reactions
❤️ 1 reaction

6 people reacted

NVIDIA Megatron Core 0.14.0rc2

21 Jul 04:12

ko3n1g

core_v0.14.0rc2

Compare

Choose a tag to compare

Loading

NVIDIA Megatron Core 0.14.0rc2 Pre-release

Pre-release

Prerelease: NVIDIA Megatron Core 0.14.0rc2 (2025-07-21)

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

NVIDIA Megatron Core 0.13.0rc4

22 Jul 08:03

ko3n1g

core_v0.13.0rc4

Compare

Choose a tag to compare

Loading

NVIDIA Megatron Core 0.13.0rc4 Pre-release

Pre-release

Prerelease: NVIDIA Megatron Core 0.13.0rc4 (2025-07-22)

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

NVIDIA Megatron Core 0.13.0rc3

17 Jul 15:04

ko3n1g

core_v0.13.0rc3

This commit was signed with the committer’s verified signature.

ko3n1g oliver könig

GPG key ID: 2A0D811D627CDD85

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

NVIDIA Megatron Core 0.13.0rc3 Pre-release

Pre-release

Prerelease: NVIDIA Megatron Core 0.13.0rc3 (2025-07-17)

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

NVIDIA Megatron Core 0.14.0rc1

14 Jul 04:12

ko3n1g

core_v0.14.0rc1

Compare

Choose a tag to compare

Loading

NVIDIA Megatron Core 0.14.0rc1 Pre-release

Pre-release

Prerelease: NVIDIA Megatron Core 0.14.0rc1 (2025-07-14)

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

Previous 1 2 3 4 Next

Previous Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.