Updates evals to run with ddp=8 for small models #428

edbeeching · 2025-02-25T14:51:29Z

Currently the logic for calculating num_gpus considers eval in the TP setting, for the Qwen 7b models this retuns 4. However for smaller models we can use DDP and fix the num_gpus at 8

Updates evals to run with ddp=8 for small models

8a50454

Currently the logic for calculating num_gpus considers eval in the TP setting, for the Qwen 7b models this retuns 4. However for smaller models we can use DDP and fix the num_gpus at 8

edbeeching requested a review from plaguss February 25, 2025 14:51

lewtun approved these changes Feb 25, 2025

View reviewed changes

plaguss approved these changes Feb 25, 2025

View reviewed changes

edbeeching merged commit 11beb9a into main Feb 25, 2025
1 check passed

edbeeching deleted the eval-ddp-8 branch February 25, 2025 15:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updates evals to run with ddp=8 for small models #428

Updates evals to run with ddp=8 for small models #428

edbeeching commented Feb 25, 2025

Updates evals to run with ddp=8 for small models #428

Updates evals to run with ddp=8 for small models #428

Conversation

edbeeching commented Feb 25, 2025