Skip to content

Commit 92055de

Browse files
authored
Bump TGI version to 1.4.2 to support gemma models (#288)
1 parent ffd5a19 commit 92055de

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

benchmarks/inference-server/text-generation-inference/manifest-templates/text-generation-inference.tftpl

+1-1
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@ spec:
5050
- name: text-generation-inference
5151
ports:
5252
- containerPort: 80
53-
image: "ghcr.io/huggingface/text-generation-inference:1.1.1"
53+
image: "ghcr.io/huggingface/text-generation-inference:1.4.2"
5454
args: ["--model-id", "${model_id}", "--num-shard", "${gpu_count}"] # , "{token}" tensor parallelism, should correspond to number of gpus below
5555
%{ for hugging_face_token_secret in hugging_face_token_secret_list ~}
5656
env:

0 commit comments

Comments
 (0)