Is vllm==0.8.3 causing some incompatible problems #602

roaminwind · 2025-04-15T01:48:52Z

First I got vllm=0.8.3 and lighteval=0.8.1dev
but problem AttributeError

then I follow some suggestions like checkout the git repository to a certain version, and the problem really disappeared
However, a new problem says VLLMModelConfig.init() got an unexpected keyword argument 'max_num_batched_tokens'

Then I remember when I clone the project at the beginning, vllm version was 0.7.2

I pip install a 0.7.2 version, but more problems arise

:<

qianfantianyuzhouzhou · 2025-04-15T14:00:46Z

vllm==0.7.1,it works

roaminwind · 2025-04-15T14:01:21Z

这是来自QQ邮箱的假期自动回复邮件。off line sorry

StarLooo · 2025-04-16T09:52:57Z

same question about VLLMModelConfig.init() got an unexpected keyword argument 'max_num_batched_tokens'

lewtun · 2025-04-16T12:22:57Z

Hi everyone, yes you need to use the pinned version of lighteval to work with vllm=0.8.3 because this PR has breaking changes.

There's a separate issue with DP>1 that is being tracked here: huggingface/lighteval#670

StarLooo · 2025-04-17T02:19:17Z

Hi everyone, yes you need to use the pinned version of lighteval to work with vllm=0.8.3 because this PR has breaking changes.

There's a separate issue with DP>1 that is being tracked here: huggingface/lighteval#670

Thanks!
So, which version of lighteval should we use to solve this problem?

StarLooo · 2025-04-17T08:31:59Z

Hi everyone, yes you need to use the pinned version of lighteval to work with vllm=0.8.3 because this PR has breaking changes.
There's a separate issue with DP>1 that is being tracked here: huggingface/lighteval#670

Thanks! So, which version of lighteval should we use to solve this problem?

Well, I try to clone the latest version of lighteval repository and install from source.
After I modify the MODEL_ARGS from:
MODEL_ARGS="pretrained=$MODEL,dtype=bfloat16,max_model_length=32768,max_num_batched_tokens=32768,gpu_memory_utilization=0.8,generation_parameters={max_new_tokens:32768,temperature:0.6,top_p:0.95}"
to
MODEL_ARGS="model_name=$MODEL,dtype=bfloat16,data_parallel_size=$NUM_GPUS,max_model_length=32768,max_num_batched_tokens=32768,gpu_memory_utilization=0.8,generation_parameters={max_new_tokens:16384,temperature:0.6,top_p:0.95}"
I roughly reproduce the results of deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B and deepseek-ai/DeepSeek-R1-Distill-Qwen-7B using lighteval.

extractive_match	MATH-500	AIME24	math_pass@1:32_samples	AIME24
DeepSeek-R1-Distill-Qwen-1.5B	0.848	0.300	DeepSeek-R1-Distill-Qwen-1.5B	0.300
DeepSeek-R1-Distill-Qwen-7B	0.934	0.633	DeepSeek-R1-Distill-Qwen-7B	0.496

The DeepSeek-R1-Distill-Qwen-7B's performance on AIME24 seems to be too large? If any one can get the similar or diffirent results, we can share with each other and have a further discussion.

JoeyXuquant11 · 2025-04-19T04:59:20Z

@lewtun plz tell me which version of lighteval is compatiable, I am also struggling with that

StarLooo · 2025-04-21T02:37:41Z

@lewtun plz tell me which version of lighteval is compatiable, I am also struggling with that

Hi, you can follow what I mentioned before:

git clone the latest version of lighteval repository and install from source.
modify the MODEL_ARGS to:
2.1 use 'model_name' instead of 'pretrained'
2.2 reduce the max_new_tokens since the original 32k is too large

StarLooo · 2025-04-21T02:40:04Z

Hi everyone, yes you need to use the pinned version of lighteval to work with vllm=0.8.3 because this PR has breaking changes.
There's a separate issue with DP>1 that is being tracked here: huggingface/lighteval#670

Thanks! So, which version of lighteval should we use to solve this problem?

Well, I try to clone the latest version of lighteval repository and install from source. After I modify the MODEL_ARGS from: MODEL_ARGS="pretrained=$MODEL,dtype=bfloat16,max_model_length=32768,max_num_batched_tokens=32768,gpu_memory_utilization=0.8,generation_parameters={max_new_tokens:32768,temperature:0.6,top_p:0.95}" to MODEL_ARGS="model_name=$MODEL,dtype=bfloat16,data_parallel_size=$NUM_GPUS,max_model_length=32768,max_num_batched_tokens=32768,gpu_memory_utilization=0.8,generation_parameters={max_new_tokens:16384,temperature:0.6,top_p:0.95}" I roughly reproduce the results of deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B and deepseek-ai/DeepSeek-R1-Distill-Qwen-7B using lighteval.

extractive_match MATH-500 AIME24 math_pass@1:32_samples AIME24
DeepSeek-R1-Distill-Qwen-1.5B 0.848 0.300 DeepSeek-R1-Distill-Qwen-1.5B 0.300
DeepSeek-R1-Distill-Qwen-7B 0.934 0.633 DeepSeek-R1-Distill-Qwen-7B 0.496
The DeepSeek-R1-Distill-Qwen-7B's performance on AIME24 seems to be too large? If any one can get the similar or diffirent results, we can share with each other and have a further discussion.

After I increase the max_new_token from 16k to 28k, I get higher performance on 7B model but a slightly lower performance on 1.5B model.
Besides, I think the extractive_match on AIME24 is very sensitive since the AIME24 only contain 30 samples. The math_pass@1:32_samples may be a better indicator of model performance on this dataset.

StarLooo · 2025-04-25T01:52:37Z

Do you encounter the same problem mentioned in #463 when using max_new_tokens:32768 ?

Nativu5 · 2025-05-19T09:11:30Z

Do you encounter the same problem mentioned in #463 when using max_new_tokens:32768 ?

Hi there, we are encountering the truncating problem of max_new_tokens:32768. Could you please share any ideas on setting this param? Or we can just ignore the truncating warning?

lewtun · 2025-05-20T06:53:44Z

Hi @Nativu5 I think you can mostly ignore the truncation warning or alternatively set max_new_tokens to a larger value if the model's context support it

StarLooo mentioned this issue Apr 18, 2025

Almost successfully reproducing open-r1/OpenR1-Qwen-7B based on Qwen/Qwen2.5-Math-7B-Instruct. Here are the training configurations. #545

Closed

StarLooo mentioned this issue Apr 25, 2025

OpenR1-Qwen-7B achieves 47.40 on AIME24, better than reported! #622

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Is vllm==0.8.3 causing some incompatible problems #602

Is vllm==0.8.3 causing some incompatible problems #602

roaminwind commented Apr 15, 2025

qianfantianyuzhouzhou commented Apr 15, 2025

Uh oh!

roaminwind commented Apr 15, 2025 via email

Uh oh!

StarLooo commented Apr 16, 2025

Uh oh!

lewtun commented Apr 16, 2025

Uh oh!

StarLooo commented Apr 17, 2025

Uh oh!

StarLooo commented Apr 17, 2025 •

edited

Loading

Uh oh!

JoeyXuquant11 commented Apr 19, 2025

Uh oh!

StarLooo commented Apr 21, 2025

Uh oh!

StarLooo commented Apr 21, 2025 •

edited

Loading

Uh oh!

StarLooo commented Apr 25, 2025

Uh oh!

Nativu5 commented May 19, 2025

Uh oh!

lewtun commented May 20, 2025

Uh oh!

Is vllm==0.8.3 causing some incompatible problems #602

Is vllm==0.8.3 causing some incompatible problems #602

Comments

roaminwind commented Apr 15, 2025

qianfantianyuzhouzhou commented Apr 15, 2025

Uh oh!

roaminwind commented Apr 15, 2025 via email

Uh oh!

StarLooo commented Apr 16, 2025

Uh oh!

lewtun commented Apr 16, 2025

Uh oh!

StarLooo commented Apr 17, 2025

Uh oh!

StarLooo commented Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JoeyXuquant11 commented Apr 19, 2025

Uh oh!

StarLooo commented Apr 21, 2025

Uh oh!

StarLooo commented Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

StarLooo commented Apr 25, 2025

Uh oh!

Nativu5 commented May 19, 2025

Uh oh!

lewtun commented May 20, 2025

Uh oh!

StarLooo commented Apr 17, 2025 •

edited

Loading

StarLooo commented Apr 21, 2025 •

edited

Loading