How does enable_cpu_offload work? #10703

CaledoniaProject · 2025-02-02T11:03:33Z

CaledoniaProject
Feb 2, 2025

Does anyone know how enable_cpu_offload works? I mean, what's the strategy on memory usage if it's enabled?

Feb 3, 2025

Hi, it just unloads to RAM models that aren't used, it's a very basic memory optimization, this keeps only the model or models used at the current inference step in the VRAM.

There isn't a memory usage strategy, this will happen even if you have 80GB or 16GB of VRAM.

View full answer

asomoza · 2025-02-03T14:10:35Z

asomoza
Feb 3, 2025
Maintainer

Hi, it just unloads to RAM models that aren't used, it's a very basic memory optimization, this keeps only the model or models used at the current inference step in the VRAM.

There isn't a memory usage strategy, this will happen even if you have 80GB or 16GB of VRAM.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How does enable_cpu_offload work? #10703

{{title}}

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

How does enable_cpu_offload work? #10703

CaledoniaProject Feb 2, 2025

Replies: 1 comment

asomoza Feb 3, 2025 Maintainer

CaledoniaProject
Feb 2, 2025

asomoza
Feb 3, 2025
Maintainer