[QUESTION] When will model have _extra_state
?
#1104
Unanswered
1049451037
asked this question in
Q&A
Replies: 2 comments
-
I had faced this error too, recently, and had to write a downstream script to remove this from every layer -- looks like the Mcore format is out of sync with what MegatronLM consumes? |
Beta Was this translation helpful? Give feedback.
0 replies
-
any updates ? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
After updating to the main branch of Megatron-LM recently, I met this error when loading model:
The checkpoint is transformed by the
tools/checkpoint/convert.py
, and loaded bypretrain_gpt.py
.Beta Was this translation helpful? Give feedback.
All reactions