Skip to content

Nomic Embed integration via llama.cpp BERT implementation #2086

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 41 commits into from
Mar 13, 2024
Merged
Show file tree
Hide file tree
Changes from 33 commits
Commits
Show all changes
41 commits
Select commit Hold shift + click to select a range
3077b97
llmodel_c: simplify casts and callbacks
cebtenzzre Feb 28, 2024
3d8159b
modellist: fix a memory leak
cebtenzzre Feb 29, 2024
536d666
modellist: remove modelDirPath(), which was never used
cebtenzzre Feb 29, 2024
520b3af
llamamodel: initial LLamaModel::embed implementation
cebtenzzre Feb 15, 2024
7b37051
s/prompts/texts/
cebtenzzre Feb 28, 2024
4f11471
python: adapt to the new LLModel::embed API
cebtenzzre Feb 28, 2024
54e8fbe
llamamodel: whitelist BERT and Nomic BERT
cebtenzzre Feb 28, 2024
e38140e
python: Nomic Embed is actually working now
cebtenzzre Feb 28, 2024
735127a
llamamodel: blacklist old MiniLM quant
cebtenzzre Feb 28, 2024
9134410
llamamodel: only allow LLamaModel::embed on embedding models
cebtenzzre Feb 28, 2024
5671c0e
chat: initial integration of new embedding code with chat UI
cebtenzzre Feb 29, 2024
4751cd3
llamamodel: use embedding code from "fix embeddings" PR
cebtenzzre Feb 29, 2024
99f1e93
matryoshka
cebtenzzre Mar 4, 2024
b998791
fix n_batch for embedding models
cebtenzzre Mar 4, 2024
979b947
update to latest changes from PR 5796 (merged)
cebtenzzre Mar 4, 2024
9b51271
better alignment with Atlas API
cebtenzzre Mar 6, 2024
8e5645c
closer adherence to Atlas API
cebtenzzre Mar 7, 2024
d00a9e0
it compiles!
cebtenzzre Mar 7, 2024
6c10840
python fixup
cebtenzzre Mar 7, 2024
5c10a2d
chat: don't do anything with pre-GGUF .bin files
cebtenzzre Mar 8, 2024
56768ff
chat: restrict local embed to all-MiniLM-L6-v2-gguf2.f16.gguf for now
cebtenzzre Mar 8, 2024
910b472
models3.json: add new MiniLM quant
cebtenzzre Mar 8, 2024
17fe0b3
models3.json: add Nomic Embed v1 and v1.5
cebtenzzre Mar 8, 2024
ab8bfd9
Merge branch 'main' into new-bert
cebtenzzre Mar 8, 2024
5f7103e
llmodel_c: fix typo
cebtenzzre Mar 8, 2024
bb1fc75
Merge branch 'main' into new-bert
cebtenzzre Mar 8, 2024
6ee8c76
fix botched merge
cebtenzzre Mar 8, 2024
eabc9cd
chat: fix filename bug caused by merge
cebtenzzre Mar 8, 2024
122ae59
chat: update isEmbeddingModel logic for models.json
cebtenzzre Mar 8, 2024
c39fa52
modellist: also whitelist nomic-embed-text-v1.txt
cebtenzzre Mar 8, 2024
33a0cd6
python: use slice to index embedding_ptr
cebtenzzre Mar 10, 2024
db10b7d
python: change dimensionality exception to a warning
cebtenzzre Mar 12, 2024
cb214df
Merge branch 'main' into new-bert
cebtenzzre Mar 12, 2024
46d64d6
style: do not use braces with single-line ifs
cebtenzzre Mar 12, 2024
39fb564
embllm: make sendAtlasRequest private
cebtenzzre Mar 12, 2024
e3fde3b
modellist: clarify arument to EmbeddingModels
cebtenzzre Mar 12, 2024
223090b
llamamodel: handle null d_ptr->model in auto-prefix embed()
cebtenzzre Mar 12, 2024
5c1ddb0
llamamodel: fix missing const
cebtenzzre Mar 12, 2024
9b93ab1
llamamodel: make magic constant less magic
cebtenzzre Mar 12, 2024
967616e
models3.json: assume these changes will be released in v2.7.4
cebtenzzre Mar 13, 2024
ad86bf5
modellist: never list or use models with disableGUI set
cebtenzzre Mar 13, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 0 additions & 5 deletions gpt4all-backend/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -97,11 +97,6 @@ foreach(BUILD_VARIANT IN LISTS BUILD_VARIANTS)
add_library(gptj-${BUILD_VARIANT} SHARED
gptj.cpp utils.h utils.cpp llmodel_shared.cpp llmodel_shared.h)
prepare_target(gptj llama-mainline)

add_library(bert-${BUILD_VARIANT} SHARED
bert.cpp utils.h utils.cpp llmodel_shared.cpp llmodel_shared.h)
target_compile_definitions(bert-${BUILD_VARIANT} PRIVATE LLAMA_VERSIONS=>=3 LLAMA_DATE=999999)
prepare_target(bert llama-mainline)
endif()
endforeach()

Expand Down
Loading