Issue when running latest Mistral model #4948

fciannella · 2025-03-31T15:33:21Z

I am trying to run mistralai/Mistral-Small-3.1-24B-Instruct-2503

root@batch-block7-00842:~/src# pip list | grep -i trans
hf_transfer                       0.1.9
transformers                      4.50.0
root@batch-block7-00842:~/src# pip list | grep -i sg
msgpack                           1.1.0
msgspec                           0.19.0
sgl-kernel                        0.0.5.post3
sglang                            0.4.4.post2         /root/src/sglang/python

I am getting this error:

[2025-03-31 08:45:36 TP0] The following error message 'operation scheduled before its operands' can be ignored.
[2025-03-31 08:45:36 TP0] Scheduler hit an exception: Traceback (most recent call last):
  File "/app/sglang/python/sglang/srt/managers/scheduler.py", line 1999, in run_scheduler_process
    scheduler = Scheduler(server_args, port_args, gpu_id, tp_rank, dp_rank)
  File "/app/sglang/python/sglang/srt/managers/scheduler.py", line 249, in __init__
    self.tp_worker = TpWorkerClass(
  File "/app/sglang/python/sglang/srt/managers/tp_worker_overlap_thread.py", line 63, in __init__
    self.worker = TpModelWorker(server_args, gpu_id, tp_rank, dp_rank, nccl_port)
  File "/app/sglang/python/sglang/srt/managers/tp_worker.py", line 74, in __init__
    self.model_runner = ModelRunner(
  File "/app/sglang/python/sglang/srt/model_executor/model_runner.py", line 169, in __init__
    self.initialize(min_per_gpu_memory)
  File "/app/sglang/python/sglang/srt/model_executor/model_runner.py", line 179, in initialize
    self.load_model()
  File "/app/sglang/python/sglang/srt/model_executor/model_runner.py", line 392, in load_model
    self.model = get_model(
  File "/app/sglang/python/sglang/srt/model_loader/__init__.py", line 22, in get_model
    return loader.load_model(
  File "/app/sglang/python/sglang/srt/model_loader/loader.py", line 365, in load_model
    model = _initialize_model(
  File "/app/sglang/python/sglang/srt/model_loader/loader.py", line 144, in _initialize_model
    model_class, _ = get_model_architecture(model_config)
  File "/app/sglang/python/sglang/srt/model_loader/utils.py", line 37, in get_model_architecture
    return ModelRegistry.resolve_model_cls(architectures)
  File "/app/sglang/python/sglang/srt/models/registry.py", line 65, in resolve_model_cls
    return self._raise_for_unsupported(architectures)
  File "/app/sglang/python/sglang/srt/models/registry.py", line 32, in _raise_for_unsupported
    raise ValueError(
ValueError: Model architectures ['Mistral3ForConditionalGeneration'] are not supported for now. Supported architectures: dict_keys(['BaichuanForCausalLM', 'ChatGLMModel', 'CLIPModel', 'CohereForCausalLM', 'Cohere2ForCausalLM', 'DbrxForCausalLM', 'DeepseekForCausalLM', 'MultiModalityCausalLM', 'DeepseekV3ForCausalLMNextN', 'DeepseekV2ForCausalLM', 'DeepseekV3ForCausalLM', 'DeepseekVL2ForCausalLM', 'ExaoneForCausalLM', 'GemmaForCausalLM', 'Gemma2ForCausalLM', 'Gemma2ForSequenceClassification', 'Gemma3ForCausalLM', 'Gemma3ForConditionalGeneration', 'GPT2LMHeadModel', 'GPTBigCodeForCausalLM', 'GraniteForCausalLM', 'Grok1ForCausalLM', 'Grok1ModelForCausalLM', 'InternLM2ForCausalLM', 'InternLM2ForRewardModel', 'LlamaForCausalLM', 'Phi3ForCausalLM', 'InternLM3ForCausalLM', 'LlamaForClassification', 'LlamaForCausalLMEagle', 'LlamaForCausalLMEagle3', 'LlamaEmbeddingModel', 'MistralModel', 'LlamaForSequenceClassification', 'LlamaForSequenceClassificationWithNormal_Weights', 'LlavaLlamaForCausalLM', 'LlavaQwenForCausalLM', 'LlavaMistralForCausalLM', 'LlavaVidForCausalLM', 'MiniCPMForCausalLM', 'MiniCPM3ForCausalLM', 'MiniCPMO', 'MiniCPMV', 'MistralForCausalLM', 'MixtralForCausalLM', 'QuantMixtralForCausalLM', 'MllamaForConditionalGeneration', 'OlmoForCausalLM', 'Olmo2ForCausalLM', 'OlmoeForCausalLM', 'Phi3SmallForCausalLM', 'QWenLMHeadModel', 'Qwen2ForCausalLM', 'Qwen2_5_VLForConditionalGeneration', 'Qwen2ForSequenceClassification', 'Qwen2ForCausalLMEagle', 'Qwen2MoeForCausalLM', 'Qwen2ForRewardModel', 'Qwen2VLForConditionalGeneration', 'StableLmForCausalLM', 'TorchNativeLlamaForCausalLM', 'TorchNativePhi3ForCausalLM', 'XverseForCausalLM', 'XverseMoeForCausalLM', 'YiVLForCausalLM'])

Is it a support issue or it is just about updating some files? The 2501 version was working!

The text was updated successfully, but these errors were encountered:

adarshxs · 2025-04-01T06:23:53Z

Hey @fciannella Mistral Small 3.1 is not yet supported. We are in the process of supporting it. You can track updates related to it here: Model coverage

KivenChen · 2025-04-01T18:45:56Z

I'm selecting VLM on low-spec computes for a use case -- happy to help if support is needed

KivenChen · 2025-04-06T08:43:07Z

Hi @fciannella @adarshxs, I just created a PR branch for Mistral 3.1 Support (#5099). Currently good with text generation, single image modality and tensor parallel serving. Multi-image modality is yet to be tested.

If you are interested, feel free to take a look and test it out. We'd greatly appreciate anyone's feedback.

adarshxs · 2025-04-07T05:37:26Z

Thanks @KivenChen. Cc @mickqian

fciannella · 2025-04-07T20:10:49Z

Still not working for me. Can you add detailed instructions on how to run it? I am using `python3 -m sglang.launch_server --model-path mistralai/Mistral-Small-3.1-24B-Instruct-2503 --host 0.0.0.0 --port 30000` but getting the same error as before.

…

On Sun, Apr 6, 2025 at 10:37 PM Adarsh Shirawalmath < ***@***.***> wrote: Thanks @KivenChen <https://github.com/KivenChen>. Cc @mickqian <https://github.com/mickqian> — Reply to this email directly, view it on GitHub <#4948 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACLQL2KBBMEQCLDKU5ZEACL2YIFKZAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBSGA3TAMBSG4> . You are receiving this because you were mentioned.Message ID: ***@***.***> [image: adarshxs]*adarshxs* left a comment (sgl-project/sglang#4948) <#4948 (comment)> Thanks @KivenChen <https://github.com/KivenChen>. Cc @mickqian <https://github.com/mickqian> — Reply to this email directly, view it on GitHub <#4948 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACLQL2KBBMEQCLDKU5ZEACL2YIFKZAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBSGA3TAMBSG4> . You are receiving this because you were mentioned.Message ID: ***@***.***>

KivenChen · 2025-04-07T23:59:12Z

Still not working for me. Can you add detailed instructions on how to run it?

I am using python3 -m sglang.launch_server --model-path mistralai/Mistral-Small-3.1-24B-Instruct-2503 --host 0.0.0.0 --port 30000 but getting the same error as before.

On Sun, Apr 6, 2025 at 10:37 PM Adarsh Shirawalmath <
@.***> wrote:

Thanks @KivenChen https://github.com/KivenChen. Cc @mickqian
https://github.com/mickqian

—
Reply to this email directly, view it on GitHub
#4948 (comment),
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ACLQL2KBBMEQCLDKU5ZEACL2YIFKZAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBSGA3TAMBSG4
.
You are receiving this because you were mentioned.Message ID:
@.***>
[image: adarshxs]adarshxs left a comment (#4948)
#4948 (comment)

Thanks @KivenChen https://github.com/KivenChen. Cc @mickqian
https://github.com/mickqian

—
Reply to this email directly, view it on GitHub
#4948 (comment),
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ACLQL2KBBMEQCLDKU5ZEACL2YIFKZAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBSGA3TAMBSG4
.
You are receiving this because you were mentioned.Message ID:
@.***>

git checkout the dev branch (kivenchen/kgl/kiv__m1stral) and pip install from source (-e) should do the job. If not, the real cause can be found in server debug-level log as "Ignore import error...".

fciannella · 2025-04-08T15:30:59Z

Perfect, the server comes up. When testing I get this error: ``` [2025-04-08 08:28:20] INFO: XX.XX.XX.XX:57854 - "POST /v1/chat/completions HTTP/1.1" 500 Internal Server Error [2025-04-08 08:28:20] ERROR: Exception in ASGI application Traceback (most recent call last): File "/app/kgl/python/sglang/srt/openai_api/adapter.py", line 960, in v1_chat_generate_request prompt_ids = tokenizer_manager.tokenizer.apply_chat_template( File "/usr/local/lib/python3.10/dist-packages/transformers/tokenization_utils_base.py", line 1629, in apply_chat_template chat_template = self.get_chat_template(chat_template, tools) File "/usr/local/lib/python3.10/dist-packages/transformers/tokenization_utils_base.py", line 1822, in get_chat_template raise ValueError( ValueError: Cannot use chat template functions because tokenizer.chat_template is not set and no template argument was passed! For information about writing templates and setting the tokenizer.chat_template attribute, please see the documentation at https://huggingface.co/docs/transformers/main/en/chat_templating ``` I am sending a request like this: ``` import openai client = openai.Client(base_url="http://localhost:30000/v1", api_key="None") response = client.chat.completions.create( model="mistralai/Mistral-Small-3.1-24B-Instruct-2503", messages=[ {"role": "user", "content": "List 3 countries and their capitals."}, ], temperature=0, max_tokens=64, ) print_highlight(response) ``` And I am launching the server like this: `python3 -m sglang.launch_server --model-path mistralai/Mistral-Small-3.1-24B-Instruct-2503 --tp-size=4 --host 0.0.0.0 --port 30000` I am using transformers 4.51.1

On Mon, Apr 7, 2025 at 4:59 PM Kiv Chen ***@***.***> wrote: Still not working for me. Can you add detailed instructions on how to run it? I am using python3 -m sglang.launch_server --model-path mistralai/Mistral-Small-3.1-24B-Instruct-2503 --host 0.0.0.0 --port 30000 but getting the same error as before. On Sun, Apr 6, 2025 at 10:37 PM Adarsh Shirawalmath < *@*.***> wrote: Thanks @KivenChen <https://github.com/KivenChen> https://github.com/KivenChen. Cc @mickqian <https://github.com/mickqian> https://github.com/mickqian — Reply to this email directly, view it on GitHub #4948 (comment) <#4948 (comment)> , or unsubscribe https://github.com/notifications/unsubscribe-auth/ACLQL2KBBMEQCLDKU5ZEACL2YIFKZAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBSGA3TAMBSG4 . You are receiving this because you were mentioned.Message ID: *@*.***> [image: adarshxs]*adarshxs* left a comment (#4948 <#4948>) #4948 (comment) <#4948 (comment)> Thanks @KivenChen <https://github.com/KivenChen> https://github.com/KivenChen. Cc @mickqian <https://github.com/mickqian> https://github.com/mickqian — Reply to this email directly, view it on GitHub #4948 (comment) <#4948 (comment)> , or unsubscribe https://github.com/notifications/unsubscribe-auth/ACLQL2KBBMEQCLDKU5ZEACL2YIFKZAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBSGA3TAMBSG4 . You are receiving this because you were mentioned.Message ID: *@*.***> git checkout the dev branch (kivenchen/kgl/kiv__m1stral) and pip install from source (-e) should do the job. If not, the real cause can be found in server debug-level log as "Ignore import error...". — Reply to this email directly, view it on GitHub <#4948 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACLQL2O2IXRNBBN3NY6V2OL2YMGONAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBUHA4DQNZZHA> . You are receiving this because you were mentioned.Message ID: ***@***.***> [image: KivenChen]*KivenChen* left a comment (sgl-project/sglang#4948) <#4948 (comment)> Still not working for me. Can you add detailed instructions on how to run it? I am using python3 -m sglang.launch_server --model-path mistralai/Mistral-Small-3.1-24B-Instruct-2503 --host 0.0.0.0 --port 30000 but getting the same error as before. On Sun, Apr 6, 2025 at 10:37 PM Adarsh Shirawalmath < *@*.***> wrote: Thanks @KivenChen <https://github.com/KivenChen> https://github.com/KivenChen. Cc @mickqian <https://github.com/mickqian> https://github.com/mickqian — Reply to this email directly, view it on GitHub #4948 (comment) <#4948 (comment)> , or unsubscribe https://github.com/notifications/unsubscribe-auth/ACLQL2KBBMEQCLDKU5ZEACL2YIFKZAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBSGA3TAMBSG4 . You are receiving this because you were mentioned.Message ID: *@*.***> [image: adarshxs]*adarshxs* left a comment (#4948 <#4948>) #4948 (comment) <#4948 (comment)> Thanks @KivenChen <https://github.com/KivenChen> https://github.com/KivenChen. Cc @mickqian <https://github.com/mickqian> https://github.com/mickqian — Reply to this email directly, view it on GitHub #4948 (comment) <#4948 (comment)> , or unsubscribe https://github.com/notifications/unsubscribe-auth/ACLQL2KBBMEQCLDKU5ZEACL2YIFKZAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBSGA3TAMBSG4 . You are receiving this because you were mentioned.Message ID: *@*.***> git checkout the dev branch (kivenchen/kgl/kiv__m1stral) and pip install from source (-e) should do the job. If not, the real cause can be found in server debug-level log as "Ignore import error...". — Reply to this email directly, view it on GitHub <#4948 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACLQL2O2IXRNBBN3NY6V2OL2YMGONAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBUHA4DQNZZHA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

When using mistral 3.0 I was not having any of these issues.

KivenChen · 2025-04-08T19:23:08Z

@fciannella It seems the chat template isn't registered successfully.

You can either:

Pass --chat-template llama-2 (compatible)
Convert JSON to Jinja: copy template string from chat_template.json, print to remove \n, save as .jinja file, use with --chat-template xxx.jinja"

fciannella · 2025-04-09T15:28:48Z

Works for me now (I only use it for text for now).

Thank you so much!

fciannella · 2025-04-10T05:35:37Z

Is structured output working for you? I am always getting a None as reply when trying structured output.

…

On Tue, Apr 8, 2025 at 12:23 PM Kiv Chen ***@***.***> wrote: @fciannella <https://github.com/fciannella> It seems the chat template isn't registered. The repo does have a chat_template.json but it's actually Jinja format. You can either 1. add "--chat-template llama-2" to command. It's compatible. 2. copy that format string in chat_template.json, print it (so that we get rid of \ns), put the content into some converted_chat_template.jinja, and use "--chat-template converted_chat_template.jinja" — Reply to this email directly, view it on GitHub <#4948 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACLQL2MHSFPHRDGD35EHAZL2YQO3DAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBXGQ2TGOBXGA> . You are receiving this because you were mentioned.Message ID: ***@***.***> *KivenChen* left a comment (sgl-project/sglang#4948) <#4948 (comment)> @fciannella <https://github.com/fciannella> It seems the chat template isn't registered. The repo does have a chat_template.json but it's actually Jinja format. You can either 1. add "--chat-template llama-2" to command. It's compatible. 2. copy that format string in chat_template.json, print it (so that we get rid of \ns), put the content into some converted_chat_template.jinja, and use "--chat-template converted_chat_template.jinja" — Reply to this email directly, view it on GitHub <#4948 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACLQL2MHSFPHRDGD35EHAZL2YQO3DAVCNFSM6AAAAAB2ES54YGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBXGQ2TGOBXGA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

KivenChen · 2025-04-10T07:09:03Z

Haven't tested yet. Since Mistral has their own standards, did their docs mention anything special?

justicel · 2025-04-15T21:59:51Z

FYI using tools doesn't work with the branch as-is.

KivenChen · 2025-04-16T18:46:35Z

@justicel It seems this involves sglang's tool call parsers, same for structured output @fciannella. I'll be back with details. Meanwhile I'm working on clearing mistral 3.1's upstream dependencies. #5084

KivenChen · 2025-04-19T09:56:07Z

FYI using tools doesn't work with the branch as-is.

SGL actually have Mistral tool call parser. Have you tried this tool calling template?

KivenChen · 2025-05-07T06:11:27Z

FYI structured output works as expected tested with official example

fciannella changed the title ~~Issue when running latest Mistal model~~ Issue when running latest Mistral model Mar 31, 2025

b8zhong closed this as completed May 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue when running latest Mistral model #4948

Issue when running latest Mistral model #4948

fciannella commented Mar 31, 2025 •

edited

Loading

adarshxs commented Apr 1, 2025

Uh oh!

KivenChen commented Apr 1, 2025

Uh oh!

KivenChen commented Apr 6, 2025

Uh oh!

adarshxs commented Apr 7, 2025

Uh oh!

fciannella commented Apr 7, 2025 via email •

edited

Loading

Uh oh!

KivenChen commented Apr 7, 2025

Uh oh!

fciannella commented Apr 8, 2025 via email •

edited

Loading

Uh oh!

KivenChen commented Apr 8, 2025 •

edited

Loading

Uh oh!

fciannella commented Apr 9, 2025

Uh oh!

fciannella commented Apr 10, 2025 via email

Uh oh!

KivenChen commented Apr 10, 2025

Uh oh!

justicel commented Apr 15, 2025

Uh oh!

KivenChen commented Apr 16, 2025 •

edited

Loading

Uh oh!

KivenChen commented Apr 19, 2025

Uh oh!

KivenChen commented May 7, 2025

Uh oh!

Issue when running latest Mistral model #4948

Issue when running latest Mistral model #4948

Comments

fciannella commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

adarshxs commented Apr 1, 2025

Uh oh!

KivenChen commented Apr 1, 2025

Uh oh!

KivenChen commented Apr 6, 2025

Uh oh!

adarshxs commented Apr 7, 2025

Uh oh!

fciannella commented Apr 7, 2025 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KivenChen commented Apr 7, 2025

Uh oh!

fciannella commented Apr 8, 2025 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KivenChen commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fciannella commented Apr 9, 2025

Uh oh!

fciannella commented Apr 10, 2025 via email

Uh oh!

KivenChen commented Apr 10, 2025

Uh oh!

justicel commented Apr 15, 2025

Uh oh!

KivenChen commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KivenChen commented Apr 19, 2025

Uh oh!

KivenChen commented May 7, 2025

Uh oh!

fciannella commented Mar 31, 2025 •

edited

Loading

fciannella commented Apr 7, 2025 via email •

edited

Loading

fciannella commented Apr 8, 2025 via email •

edited

Loading

KivenChen commented Apr 8, 2025 •

edited

Loading

KivenChen commented Apr 16, 2025 •

edited

Loading