Skip to content

llama-model : fix the reported size class for nomic-embed-text-v2-moe #13223

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 1, 2025

Conversation

cebtenzzre
Copy link
Collaborator

These models don't come in multiple sizes, but this is not a 137M model.

Before:

...
print_info: model type       = 137M
print_info: model params     = 475.29 M
...

After:

...
print_info: model type       = 475M
print_info: model params     = 475.29 M
...

@cebtenzzre cebtenzzre requested a review from ggerganov April 30, 2025 21:41
@ggerganov ggerganov merged commit a70183e into master May 1, 2025
48 checks passed
@ggerganov ggerganov deleted the jared/fix-nomic-embed-v2-size branch May 1, 2025 07:09
gabe-l-hart added a commit to gabe-l-hart/llama.cpp that referenced this pull request May 1, 2025
* origin/master:
sync : ggml
whisper : add check that target name exists (whisper/3103)
ggml : suppress Windows compiler warnings (whisper/3075)
mtmd : add **vision** support for Mistral Small 3.1 (ggml-org#13231)
arg : remove CURLINFO_EFFECTIVE_METHOD (ggml-org#13228)
llama-model : fix the reported size class for nomic-embed-text-v2-moe (ggml-org#13223)
sync : ggml
ggml : fix ggml_gallocr_ptr type (ggml/1205)
cuda : fix unused variable compile warning (whisper/0)
CUDA: batched+noncont MMQ, refactor bs>1 MoE code (ggml-org#13199)
arg : -hf do not fail if url mismatch (ggml-org#13219)
fix typo: `n_ctx_pre_seq` -> `n_ctx_per_seq` (ggml-org#13221)
convert : improve model arch handling (ggml-org#13122)
llava : remove duplicate include (ggml-org#13207)
common : add -jf / --json-schema-file flag (ggml-org#12011)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants