about the training speed #5

winnechan · 2022-07-18T12:54:36Z

Hi, thanks for your project.

The EMA model is updated on CPU by iterating over all the parameters of the online model, which makes the GPU utility is low. Does this mean that there is no way to speed up the training?

Thanks

HiDolen · 2024-08-05T08:26:47Z

I noticed that when instantiating KarrasEMA or EMA, you can pass the parameter allow_different_devices (which defaults to False). When allow_different_devices is set to True, the parameters of the EMA model will be moved to the same device as the parameters of the trained model; otherwise, they are kept on the CPU. Although it might be too late, I hope it helps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about the training speed #5

about the training speed #5

winnechan commented Jul 18, 2022

HiDolen commented Aug 5, 2024

about the training speed #5

about the training speed #5

Comments

winnechan commented Jul 18, 2022

HiDolen commented Aug 5, 2024