AssertionError: only support equal chunk. Got size of DataProto 2 and chunk 4. #47

OKC13 · 2025-02-07T08:44:46Z

/verl/protocol.py", line 491, in chunk

请问是该错误是为何

yangDDDD · 2025-02-11T08:13:40Z

same question

OKC13 · 2025-02-11T08:15:50Z

same question

batch_size需要是n_gpus_per_node的整数倍

juneodie · 2025-02-11T08:40:57Z

same question

batch_size需要是n_gpus_per_node的整数倍

I tried to run this with llama 8b model in 8*H100s
With batch size 8 it failed with OOM
I want batch_size smaller than n_gpus_per_node because of memory
Is there other way?

DolbyUUU · 2025-02-19T02:43:37Z

same question

batch_size需要是n_gpus_per_node的整数倍

I tried to run this with llama 8b model in 8*H100s With batch size 8 it failed with OOM I want batch_size smaller than n_gpus_per_node because of memory Is there other way?

Co-ask.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AssertionError: only support equal chunk. Got size of DataProto 2 and chunk 4. #47

AssertionError: only support equal chunk. Got size of DataProto 2 and chunk 4. #47

OKC13 commented Feb 7, 2025

yangDDDD commented Feb 11, 2025

OKC13 commented Feb 11, 2025

juneodie commented Feb 11, 2025

DolbyUUU commented Feb 19, 2025

AssertionError: only support equal chunk. Got size of DataProto 2 and chunk 4. #47

AssertionError: only support equal chunk. Got size of DataProto 2 and chunk 4. #47

Comments

OKC13 commented Feb 7, 2025

yangDDDD commented Feb 11, 2025

OKC13 commented Feb 11, 2025

juneodie commented Feb 11, 2025

DolbyUUU commented Feb 19, 2025