[QUESTION]　To convert a Llama 3.1 70B checkpoint in torch dcp format to the HuggingFace format, #1245

kaiyama12345679 · 2024-10-19T05:16:45Z

kaiyama12345679
Oct 19, 2024

I've pre-trained Llama-3.1 70B using Megatron-LM (8 tensor parallel, 2 pipeline parallel, using distributed optimizer), and I have confirmed that the checkpoint is saved in the torch dcp format.
However, I am not sure how to convert this checkpoint into a format that can be uploaded to HuggingFace. If anyone knows how to do this, I would greatly appreciate your help.

zhangyilalala · 2024-11-19T09:21:17Z

zhangyilalala
Nov 19, 2024

@kaiyama12345679 Hi~ Did you find a solution later? I'm encountering the same issue.

0 replies

leondada · 2025-03-27T15:25:04Z

leondada
Mar 27, 2025

@kaiyama12345679 @zhangyilalala Hi, I'm encountering the same issue. Have you solved it?

0 replies

gaojingwei · 2025-05-19T06:59:07Z

gaojingwei
May 19, 2025

try use nemo

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[QUESTION]　To convert a Llama 3.1 70B checkpoint in torch dcp format to the HuggingFace format, #1245

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

[QUESTION] To convert a Llama 3.1 70B checkpoint in torch dcp format to the HuggingFace format, #1245

Uh oh!

kaiyama12345679 Oct 19, 2024

Replies: 3 comments

Uh oh!

zhangyilalala Nov 19, 2024

Uh oh!

leondada Mar 27, 2025

Uh oh!

gaojingwei May 19, 2025

[QUESTION]　To convert a Llama 3.1 70B checkpoint in torch dcp format to the HuggingFace format, #1245

kaiyama12345679
Oct 19, 2024

zhangyilalala
Nov 19, 2024

leondada
Mar 27, 2025

gaojingwei
May 19, 2025