Allow extra request inputs #552

dyastremsky · 2024-03-27T18:42:37Z

This will allow users to pass in additional inputs beyond those in the model. This is useful for adding support to different backends/endpoints that have different arguments they support without hardcoding each one into GenAi-Perf.

This has been tested for Triton+TRT-LLM, Triton+vLLM, vLLM+OpenAI (both endpoints). If we wanted to drop support for --streaming, we could so since that would now be supported via --extra-inputs (e.g. --extra-inputs stream:True).

When a non-existent input name is given, no error is raised and GenAi-Perf works as if it was not provided. I created a follow-up ticket (TMA-1799) to investigate if PA is not raising an error when input data for non-existent inputs is provided in an input data JSON. That ticket aims to make sure an error is raised in PA and GenAi-Perf for those cases.

Unit testing passes:

src/c++/perf_analyzer/genai-perf/genai_perf/wrapper.py

src/c++/perf_analyzer/genai-perf/tests/test_llm_inputs.py

src/c++/perf_analyzer/genai-perf/genai_perf/parser.py

src/c++/perf_analyzer/genai-perf/tests/test_cli.py

src/c++/perf_analyzer/genai-perf/tests/test_llm_inputs.py

nv-hwoo

Looks good overall! Thanks for working on this. I just left a few questions and suggestions.

src/c++/perf_analyzer/genai-perf/genai_perf/parser.py

src/c++/perf_analyzer/genai-perf/tests/test_cli.py

src/c++/perf_analyzer/genai-perf/README.md

src/c++/perf_analyzer/genai-perf/tests/test_llm_inputs.py

Co-authored-by: Hyunjae Woo <[email protected]>

nv-braf

LLM input changes look good

nv-hwoo

LGTM 🚀

dyastremsky added 3 commits March 25, 2024 16:06

Add extra_args

cbe52ee

Test extra_args

774070a

Revert commenting current LLM input tests

73191f1

dyastremsky requested review from nv-braf and debermudez March 27, 2024 18:42

dyastremsky self-assigned this Mar 27, 2024

dyastremsky requested a review from tgerdesnv March 27, 2024 18:42

dyastremsky marked this pull request as ready for review March 27, 2024 18:43

github-advanced-security bot found potential problems Mar 27, 2024

View reviewed changes

src/c++/perf_analyzer/genai-perf/genai_perf/wrapper.py Fixed Show fixed Hide fixed

dyastremsky added 2 commits March 27, 2024 11:48

Add comma between args

0776d6b

Change arg name, add parser test

8cea6ef

debermudez reviewed Mar 27, 2024

View reviewed changes

nv-braf reviewed Mar 27, 2024

View reviewed changes

src/c++/perf_analyzer/genai-perf/tests/test_llm_inputs.py Outdated Show resolved Hide resolved

dyastremsky and others added 7 commits March 27, 2024 15:09

Add extra error checking and tests

7aad03e

Make caught exception more specific.

b414590

Fix inputs, change warning to error

4f07612

Fix tests to use lists and have required args

cf9befa

Merge branch 'main' into dyas-generic-inputs

bc3ee02

Update docs, use input_name instead of key

0677748

Change wording

6217783

dyastremsky requested review from nv-braf and debermudez April 3, 2024 01:46

nv-hwoo reviewed Apr 3, 2024

View reviewed changes

Change Boolean example to lower case.

f02419f

Co-authored-by: Hyunjae Woo <[email protected]>

nv-braf reviewed Apr 3, 2024

View reviewed changes

dyastremsky added 2 commits April 3, 2024 09:35

Update tests, get rid of try-catch

4386064

Remove extra comments, remove default value

f684dbf

dyastremsky requested a review from nv-hwoo April 3, 2024 17:01

nv-hwoo approved these changes Apr 3, 2024

View reviewed changes

dyastremsky merged commit 6259cda into main Apr 3, 2024

dyastremsky deleted the dyas-generic-inputs branch April 3, 2024 17:16

debermudez pushed a commit that referenced this pull request Apr 4, 2024

Accept extra request inputs in GenAi-Perf (#552)

47dea71

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow extra request inputs #552

Allow extra request inputs #552

Uh oh!

dyastremsky commented Mar 27, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nv-hwoo left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nv-braf left a comment

Uh oh!

nv-hwoo left a comment

Uh oh!

Uh oh!

Allow extra request inputs #552

Allow extra request inputs #552

Uh oh!

Conversation

dyastremsky commented Mar 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nv-hwoo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nv-braf left a comment

Choose a reason for hiding this comment

Uh oh!

nv-hwoo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dyastremsky commented Mar 27, 2024 •

edited

Loading