Add support for SageMaker Inference Components in sagemaker chat #10603

bobbywlindsey · 2025-05-06T22:57:11Z

Title

Add support for SageMaker Inference Components in sagemaker chat

Relevant issues

Fixes #9909

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🆕 New Feature
🐛 Bug Fix

Changes

If model_id is present as a parameter for completion(model="sagemaker_chat/*"...) calls:

Include an additional header in the request that enables Inference Components for SageMaker endpoints using Messages API and uses model_id as the Inference Component
Remove model key and value from body of request

vercel · 2025-05-06T22:57:15Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jun 11, 2025 8:42pm

CLAassistant · 2025-05-06T22:57:16Z

All committers have signed the CLA.

litellm/llms/openai_like/chat/handler.py

krrishdholakia · 2025-05-12T16:41:31Z

@bobbywlindsey sagemaker_chat no longer uses the openai_like/ flow. It's been migrated to our common base_llm_http_handler.py

Can you please update your PR to reflect the change?

You should just need to make any mods here -

litellm/litellm/llms/sagemaker/chat/transformation.py

Line 4 in e865a4c

Called if Sagemaker endpoint supports HF Messages API.

bobbywlindsey · 2025-05-12T21:57:40Z

@krrishdholakia Nice refactor! I put my changes where you suggested and seems to work nicely 👍🏻 Could you review again? Thanks!

krrishdholakia · 2025-05-12T22:10:15Z

Hey @bobbywlindsey could you please add some unit testing inside tests/litellm?

litellm/llms/sagemaker/chat/transformation.py

litellm/llms/openai/chat/gpt_transformation.py

bobbywlindsey · 2025-06-02T20:12:55Z

Hey @ishaan-jaff, any chance you could take a look at this PR? Thanks!

dgallitelli · 2025-06-06T10:02:55Z

@krrishdholakia I see you're the reviewer for this PR - all the checks have passed and there are no conflict with base. Can we merge? Thank you!

Add support for SageMaker Inference Components in sagemaker chat

8633d1b

vercel bot deployed to Preview May 6, 2025 22:58 View deployment

krrishdholakia requested changes May 7, 2025

View reviewed changes

litellm/llms/openai_like/chat/handler.py Outdated Show resolved Hide resolved

Merge branch 'BerriAI:main' into main

507f25e

vercel bot deployed to Preview May 12, 2025 15:48 View deployment

bobbywlindsey requested a review from krrishdholakia May 12, 2025 15:57

Merge branch 'BerriAI:main' into main

c8db5f0

vercel bot deployed to Preview May 12, 2025 21:35 View deployment

Integrate with migration to comnon base_llm_http_handler

cd9ab8e

vercel bot deployed to Preview May 12, 2025 21:43 View deployment

Remove extraneous new line

b24c9e4

vercel bot deployed to Preview May 12, 2025 21:46 View deployment

Merge branch 'BerriAI:main' into main

4b2afe2

vercel bot deployed to Preview May 12, 2025 23:14 View deployment

krrishdholakia reviewed May 12, 2025

View reviewed changes

litellm/llms/sagemaker/chat/transformation.py Outdated Show resolved Hide resolved

Merge branch 'BerriAI:main' into main

2275ab0

vercel bot deployed to Preview May 13, 2025 20:38 View deployment

Add test for inference components

e4e54d1

vercel bot deployed to Preview May 13, 2025 20:41 View deployment

Move request logic to transform_request

0668d0c

vercel bot deployed to Preview May 13, 2025 22:07 View deployment

krrishdholakia requested changes May 13, 2025

View reviewed changes

litellm/llms/openai/chat/gpt_transformation.py Outdated Show resolved Hide resolved

Keep request transformation logic in sagemaker chat

6995b8d

vercel bot deployed to Preview May 13, 2025 22:37 View deployment

Fix formatting errors

b3b122e

Merge branch 'BerriAI:main' into main

e8ab570

vercel bot deployed to Preview May 21, 2025 15:33 View deployment

Merge branch 'BerriAI:main' into main

8c63bda

vercel bot deployed to Preview May 21, 2025 16:37 View deployment

Merge branch 'BerriAI:main' into main

76cd1d5

vercel bot deployed to Preview May 21, 2025 17:51 View deployment

Merge branch 'BerriAI:main' into main

b27cce8

vercel bot deployed to Preview May 22, 2025 19:39 View deployment

Merge branch 'BerriAI:main' into main

5b70e0d

vercel bot deployed to Preview May 23, 2025 15:55 View deployment

Resolve merge conflict in test

daf3a4f

vercel bot deployed to Preview May 27, 2025 17:26 View deployment

Merge remote-tracking branch 'upstream/main'

13e52e5

vercel bot deployed to Preview May 27, 2025 17:48 View deployment

Merge branch 'BerriAI:main' into main

8755ee9

vercel bot deployed to Preview May 28, 2025 16:24 View deployment

Merge branch 'BerriAI:main' into main

ad591d6

vercel bot deployed to Preview May 30, 2025 15:58 View deployment

Merge branch 'BerriAI:main' into main

71f1575

vercel bot deployed to Preview June 1, 2025 22:51 View deployment

Merge branch 'BerriAI:main' into main

d96bb61

vercel bot deployed to Preview June 2, 2025 19:55 View deployment

Merge branch 'BerriAI:main' into main

d2d7bc9

vercel bot deployed to Preview June 3, 2025 21:47 View deployment

Merge branch 'BerriAI:main' into main

e907cca

vercel bot deployed to Preview June 5, 2025 04:37 View deployment

Merge branch 'BerriAI:main' into main

de2c6df

vercel bot deployed to Preview June 11, 2025 20:42 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add support for SageMaker Inference Components in sagemaker chat #10603

Add support for SageMaker Inference Components in sagemaker chat #10603

bobbywlindsey commented May 6, 2025

Uh oh!

vercel bot commented May 6, 2025 •

edited

Loading

Uh oh!

CLAassistant commented May 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

krrishdholakia commented May 12, 2025

Uh oh!

bobbywlindsey commented May 12, 2025

Uh oh!

krrishdholakia commented May 12, 2025

Uh oh!

Uh oh!

Uh oh!

bobbywlindsey commented Jun 2, 2025

Uh oh!

dgallitelli commented Jun 6, 2025

Uh oh!

Uh oh!

Uh oh!

Add support for SageMaker Inference Components in sagemaker chat #10603

Are you sure you want to change the base?

Add support for SageMaker Inference Components in sagemaker chat #10603

Conversation

bobbywlindsey commented May 6, 2025

Title

Relevant issues

Pre-Submission checklist

Type

Changes

Uh oh!

vercel bot commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

krrishdholakia commented May 12, 2025

Uh oh!

bobbywlindsey commented May 12, 2025

Uh oh!

krrishdholakia commented May 12, 2025

Uh oh!

Uh oh!

Uh oh!

bobbywlindsey commented Jun 2, 2025

Uh oh!

dgallitelli commented Jun 6, 2025

Uh oh!

Uh oh!

vercel bot commented May 6, 2025 •

edited

Loading

CLAassistant commented May 6, 2025 •

edited

Loading