[QUESTION] `save_checkpoint` with `expert_tensor_parallel_size` #1719

Open

Open

[QUESTION] save_checkpoint with expert_tensor_parallel_size#1719

Assignees

Labels

community-requestmodule: moequestion

opened

on Jul 30, 2025

Your question
How to save a sharded checkpoint when using MoE parallel folding -- specifically, when ETP != TP? save_checkpoint only supports PP, TP, and EP (as in expert_model_parallel not expert_tensor_parallel)?

Metadata

Assignees

yanring

Labels

community-requestmodule: moequestion

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests