sample params difference between vLLM and Huggingface transformers #539

zhaoyang-star · 2023-07-21T03:15:53Z

zhaoyang-star
Jul 21, 2023

We know evaluation is very important for a production env. I tested the code completion ability of starcoder-15b by HumanEval using vLLM and HF as two different backends.

I found it is hard to get the same pass@1 using vLLM and HF transformers. So how to get the same pass@1 as HF when I use vLLM?

Backends	pass@1
HF	0.38
vLLM	0.33

There may be two reasons and the first may be the main part.

Different definations of sampling params
Kernel impls diff.

For the first one. I compared the main params as following.

HF	Default value	vLLM	Default value
num_return_sequences	1	n	1
num_beams	1	best_of	1
repetition_penalty	1.0	frequency_penalty	0.0
		presence_penalty	0.0
temperature	1.0	temperature	1.0
top_p	1.0	top_p	1.0
top_k	50	top_k	-1
max_new_tokens	None	max_tokens	16

I use the same sampling params in vLLM and HF.

# vLLM config
sampling_params = SamplingParams(
    best_of=9,
    n=9,
    frequency_penalty=0.0,
    temperature=0.2,
    max_tokens=1024,
)

# HF
        outputs = model.generate(
            input_ids=inputs,
            num_beams=9,
            num_return_sequences=9,
            repetition_penalty=1.0,
            temperature=0.2,
            max_new_tokens=1024,
            remove_invalid_values=True,
        )

imiraoui · 2024-01-23T06:09:12Z

imiraoui
Jan 23, 2024

Did you figure it out?

0 replies

chenxu2048 · 2024-01-25T07:03:21Z

chenxu2048
Jan 25, 2024

Here are some merged pull requests focusing on the alignment of sampling parameters and results: #753, #1424, #1577, and #1885.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sample params difference between vLLM and Huggingface transformers #539

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments

{{title}}

{{title}}

Select a reply

sample params difference between vLLM and Huggingface transformers #539

zhaoyang-star Jul 21, 2023

Replies: 2 comments

imiraoui Jan 23, 2024

chenxu2048 Jan 25, 2024

zhaoyang-star
Jul 21, 2023

imiraoui
Jan 23, 2024

chenxu2048
Jan 25, 2024