Skip to content

Commit 1db5104

Browse files
authored
chore(model gallery): add smolvlm-500m-instruct (#5413)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent ec21b58 commit 1db5104

File tree

1 file changed

+18
-0
lines changed

1 file changed

+18
-0
lines changed

gallery/index.yaml

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -29,6 +29,24 @@
2929
- filename: SmolVLM-256M-Instruct-Q8_0.gguf
3030
sha256: 2a31195d3769c0b0fd0a4906201666108834848db768af11de1d2cef7cd35e65
3131
uri: huggingface://ggml-org/SmolVLM-256M-Instruct-GGUF/SmolVLM-256M-Instruct-Q8_0.gguf
32+
- !!merge <<: *smolvlm
33+
name: "smolvlm-500m-instruct"
34+
urls:
35+
- https://huggingface.co/HuggingFaceTB/SmolVLM-500M-Instruct
36+
- https://huggingface.co/ggml-org/SmolVLM-500M-Instruct-GGUF
37+
description: |
38+
SmolVLM-500M is a tiny multimodal model, member of the SmolVLM family. It accepts arbitrary sequences of image and text inputs to produce text outputs. It's designed for efficiency. SmolVLM can answer questions about images, describe visual content, or transcribe text. Its lightweight architecture makes it suitable for on-device applications while maintaining strong performance on multimodal tasks. It can run inference on one image with 1.23GB of GPU RAM.
39+
overrides:
40+
parameters:
41+
model: SmolVLM-500M-Instruct-Q8_0.gguf
42+
mmproj: mmproj-SmolVLM-500M-Instruct-Q8_0.gguf
43+
files:
44+
- filename: mmproj-SmolVLM-500M-Instruct-Q8_0.gguf
45+
sha256: d1eb8b6b23979205fdf63703ed10f788131a3f812c7b1f72e0119d5d81295150
46+
uri: huggingface://ggml-org/SmolVLM-500M-Instruct-GGUF/mmproj-SmolVLM-500M-Instruct-Q8_0.gguf
47+
- filename: SmolVLM-500M-Instruct-Q8_0.gguf
48+
sha256: 9d4612de6a42214499e301494a3ecc2be0abdd9de44e663bda63f1152fad1bf4
49+
uri: huggingface://ggml-org/SmolVLM-500M-Instruct-GGUF/SmolVLM-500M-Instruct-Q8_0.gguf
3250
- &qwen3
3351
url: "github:mudler/LocalAI/gallery/qwen3.yaml@master"
3452
name: "qwen3-30b-a3b"

0 commit comments

Comments
 (0)