vllm grpo trainer inputs_ids bug #134

tcy6 · 2025-02-24T15:05:56Z

setting: 卡0_5用于训练，卡6部署vllm
当我尝试在_prepare_inputs函数中打印inputs_ids的size时

prompt_inputs = self.processing_class(
            # prompts_text, return_tensors="pt", padding=True, padding_side="left", add_special_tokens=False
            text=prompts_text,
            images=images,
            return_tensors="pt",
            padding=True,
            padding_side="left",
            add_special_tokens=False,
        )
        prompt_ids, prompt_mask = (
            prompt_inputs["input_ids"].to(device),
            prompt_inputs["attention_mask"].to(device),
        )
        print(f"{prompt_inputs['input_ids'].size()}")
        if self.max_prompt_length is not None:
            prompt_ids = prompt_ids[:, -self.max_prompt_length :]
            prompt_mask = prompt_mask[:, -self.max_prompt_length :]

我发现打印结果总是这样的

torch.Size([1, 1598])
torch.Size([1, 1598])
torch.Size([1, 1598])
torch.Size([1, 1598])
torch.Size([1, 1598])
torch.Size([1, 1614])

即总是存在一张卡上的input_ids的size和其他卡不一样，原因目前还在寻找中，但是应该是一个bug

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vllm grpo trainer inputs_ids bug #134

vllm grpo trainer inputs_ids bug #134

tcy6 commented Feb 24, 2025

vllm grpo trainer inputs_ids bug #134

vllm grpo trainer inputs_ids bug #134

Comments

tcy6 commented Feb 24, 2025