README.md multi-GPU training instructions add bold emphasis to how numGPUs influences batch size #268

mkrupczak3 · 2025-07-22T22:34:34Z

I found this description to be incredibly subtle, I ran a lot of training runs with the wrong batch size before I saw it

Description

Simple change to README.md to add bold emphasis under Multi-GPU training instructions:

Note that your effective batch size is multiplied by the number of GPUs**, so you may need to adjust your batch_size and grad_accum_steps to maintain the same overall batch size!

No dependencies as it is just a documentation update

Type of change

Please delete options that are not relevant.

This change requires a documentation update

How has this change been tested, please provide a testcase or example of how you tested the change?

With my eyes reading the README.md

Any specific deployment considerations

May be worth emphasizing in documentation elsewhere to make sure others don't make the same mistake that I did

Docs

Docs updated? What were the changes:
Just a simple change to README.md

…mGPUs influences batch size I found this description to be incredibly subtle, I ran a lot of training runs with the wrong batch size before I saw it

CLAassistant · 2025-07-22T22:34:40Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

README.md multi-GPU training instructions add bold emphasis to how nu…

ffe8b84

…mGPUs influences batch size I found this description to be incredibly subtle, I ran a lot of training runs with the wrong batch size before I saw it

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md multi-GPU training instructions add bold emphasis to how numGPUs influences batch size #268

README.md multi-GPU training instructions add bold emphasis to how numGPUs influences batch size #268

Uh oh!

mkrupczak3 commented Jul 22, 2025

Uh oh!

CLAassistant commented Jul 22, 2025

Uh oh!

Uh oh!

README.md multi-GPU training instructions add bold emphasis to how numGPUs influences batch size #268

Are you sure you want to change the base?

README.md multi-GPU training instructions add bold emphasis to how numGPUs influences batch size #268

Uh oh!

Conversation

mkrupczak3 commented Jul 22, 2025

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Any specific deployment considerations

Docs

Uh oh!

CLAassistant commented Jul 22, 2025

Uh oh!

Uh oh!