[QUESTION] To convert a Llama 3.1 70B checkpoint in torch dcp format to the HuggingFace format, #1245
Unanswered
kaiyama12345679
asked this question in
Q&A
Replies: 3 comments
-
@kaiyama12345679 Hi~ Did you find a solution later? I'm encountering the same issue. |
Beta Was this translation helpful? Give feedback.
0 replies
-
@kaiyama12345679 @zhangyilalala Hi, I'm encountering the same issue. Have you solved it? |
Beta Was this translation helpful? Give feedback.
0 replies
-
try use nemo |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I've pre-trained Llama-3.1 70B using Megatron-LM (8 tensor parallel, 2 pipeline parallel, using distributed optimizer), and I have confirmed that the checkpoint is saved in the torch dcp format.
However, I am not sure how to convert this checkpoint into a format that can be uploaded to HuggingFace. If anyone knows how to do this, I would greatly appreciate your help.
Beta Was this translation helpful? Give feedback.
All reactions