llama-model : fix the reported size class for nomic-embed-text-v2-moe #13223

cebtenzzre · 2025-04-30T21:41:00Z

These models don't come in multiple sizes, but this is not a 137M model.

Before:

...
print_info: model type       = 137M
print_info: model params     = 475.29 M
...

After:

...
print_info: model type       = 475M
print_info: model params     = 475.29 M
...

* origin/master: sync : ggml whisper : add check that target name exists (whisper/3103) ggml : suppress Windows compiler warnings (whisper/3075) mtmd : add **vision** support for Mistral Small 3.1 (ggml-org#13231) arg : remove CURLINFO_EFFECTIVE_METHOD (ggml-org#13228) llama-model : fix the reported size class for nomic-embed-text-v2-moe (ggml-org#13223) sync : ggml ggml : fix ggml_gallocr_ptr type (ggml/1205) cuda : fix unused variable compile warning (whisper/0) CUDA: batched+noncont MMQ, refactor bs>1 MoE code (ggml-org#13199) arg : -hf do not fail if url mismatch (ggml-org#13219) fix typo: `n_ctx_pre_seq` -> `n_ctx_per_seq` (ggml-org#13221) convert : improve model arch handling (ggml-org#13122) llava : remove duplicate include (ggml-org#13207) common : add -jf / --json-schema-file flag (ggml-org#12011)

llama-model : fix the reported size class for nomic-embed-text-v2-moe

e065e8b

cebtenzzre requested a review from ggerganov April 30, 2025 21:41

ggerganov approved these changes May 1, 2025

View reviewed changes

ggerganov merged commit a70183e into master May 1, 2025
48 checks passed

ggerganov deleted the jared/fix-nomic-embed-v2-size branch May 1, 2025 07:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama-model : fix the reported size class for nomic-embed-text-v2-moe #13223

llama-model : fix the reported size class for nomic-embed-text-v2-moe #13223

cebtenzzre commented Apr 30, 2025

llama-model : fix the reported size class for nomic-embed-text-v2-moe #13223

llama-model : fix the reported size class for nomic-embed-text-v2-moe #13223

Conversation

cebtenzzre commented Apr 30, 2025