Skip to content

Commit 5204ff5

Browse files
[Bugfix] Fix Granite 3.0 MoE model loading (#12446)
Signed-off-by: DarkLight1337 <[email protected]>
1 parent 0cc6b38 commit 5204ff5

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

vllm/model_executor/models/granitemoe.py

+1
Original file line numberDiff line numberDiff line change
@@ -348,6 +348,7 @@ def __init__(self, *, vllm_config: VllmConfig, prefix: str = ""):
348348

349349
self.config = config
350350
self.lora_config = lora_config
351+
self.quant_config = quant_config # Required by MixtralForCausalLM
351352

352353
self.model = GraniteMoeModel(vllm_config=vllm_config,
353354
prefix=maybe_prefix(prefix, "model"))

0 commit comments

Comments
 (0)