Skip to content

Commit dc6f323

Browse files
authored
Update README.md
1 parent f9d0fcb commit dc6f323

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

README.md

+3-3
Original file line numberDiff line numberDiff line change
@@ -18,9 +18,9 @@ Deploy OpenAI-Compatible Blazing-Fast LLM Endpoints powered by the [vLLM](https:
1818
### 1. UI for Deploying vLLM Worker on RunPod console:
1919
![Demo of Deploying vLLM Worker on RunPod console with new UI](media/ui_demo.gif)
2020

21-
### 2. Worker vLLM `v2.0.0` with vLLM `0.7.2` now available under `stable` tags
21+
### 2. Worker vLLM `v2.1.0` with vLLM `0.7.3` now available under `stable` tags
2222

23-
Update v2.0.0 is now available, use the image tag `runpod/worker-v1-vllm:v2.0.0stable-cuda12.1.0`.
23+
Update v2.0.0 is now available, use the image tag `runpod/worker-v1-vllm:v2.1.0stable-cuda12.1.0`.
2424

2525
### 3. OpenAI-Compatible [Embedding Worker](https://github.com/runpod-workers/worker-infinity-embedding) Released
2626
Deploy your own OpenAI-compatible Serverless Endpoint on RunPod with multiple embedding models and fast inference for RAG and more!
@@ -82,7 +82,7 @@ Below is a summary of the available RunPod Worker images, categorized by image s
8282

8383
| CUDA Version | Stable Image Tag | Development Image Tag | Note |
8484
|--------------|-----------------------------------|-----------------------------------|----------------------------------------------------------------------|
85-
| 12.1.0 | `runpod/worker-v1-vllm:v2.0.0stable-cuda12.1.0` | `runpod/worker-v1-vllm:v2.0.0dev-cuda12.1.0` | When creating an Endpoint, select CUDA Version 12.3, 12.2 and 12.1 in the filter. |
85+
| 12.1.0 | `runpod/worker-v1-vllm:v2.1.0stable-cuda12.1.0` | `runpod/worker-v1-vllm:v2.1.0dev-cuda12.1.0` | When creating an Endpoint, select CUDA Version 12.3, 12.2 and 12.1 in the filter. |
8686

8787

8888

0 commit comments

Comments
 (0)