[QUESTION] Performance Impact of Using item() in total_num_tokens += num_tokens.item()
in megatron/core/pipeline_parallel/schedules.py
#87
Job | Run time |
---|---|
5s | |
5s |