Add modelId and modelText to KnnVectorQueryBuilder #106068

tteofili · 2024-03-07T12:12:12Z

Make it possible to perform KNN queries by supplying model_text and model_id instead of the query_vector.

This makes use of a QueryVectorBuilder. Supplying a text_embedding query_vector_builder with model_text and model_id instead of the query_vector will result in the generation of a query_vector by calling inference (during query rewrite) on the specified model_id with the supplied model_text.
This is consistent with the way query vectors are built from model_id / model_text in KnnSearchBuilder (DFS phase).

Sample query:

{
    "query": {
        "knn" : {
            "field": "embedding",
            "num_candidates": 10,
            "query_vector_builder": {
                "text_embedding": {
                    "model_id": "bert_base",
                    "model_text": "lucene is all you need"
                }
            }
        }
    }
}

See also https://docs.google.com/document/d/12SYyHbPbCzhPYQ65HiRMesANObvzWDrBDm0Qn9QSwFQ/edit#heading=h.r3mn4wd2it4e

Use QueryVectorBuilder within KnnVectorQueryBuilder to make it possible to perform knn queries also when a query vector is not immediately available. Supplying a text_embedding query_vector_builder with model_text and model_id instead of the query_vector will result in the generation of a query_vector by calling inference on the specified model_id with the supplied model_text (during query rewrite). This is consistent with the way query vectors are built from model_id / model_text in KnnSearchBuilder (DFS phase).

elasticsearchmachine · 2024-03-07T14:18:36Z

Hi @tteofili, I've created a changelog YAML for you.

…icsearch into knn_dsl_modeltext_modelid

…modeltext_modelid

benwtrent · 2024-03-08T21:07:13Z

.../plugin/src/yamlRestTest/resources/rest-api-spec/test/ml/search_knn_query_vector_builder.yml

+              query_vector_builder:
+                text_embedding:
+                  model_id: text_embedding_model
+                  model_text: "the octopus comforter smells"


I realize that this is how the top level knn is done, but now that inference_id is becoming a thing, and being used in the core server code, maybe we can switch it it?

@davidkyle what do you think? Can we make an inference call directly from server now? so our query DSL turns into

"knn": { "inference_id": "foo", "my_dense_vector_field": "what did the fox jump over?" }

?

the reasoning for the query_vector_builder stuff (QueryVectorBuilder interface, same as top level knn) is that we cannot introduce a direct dependency between KnnVectorQueryBuilder (from server) and TextEmbeddingQueryVectorBuilder (from xpack/plugin/ml) or InferenceAction (from xpach/plugin/core).
Also it seems desirable to have consistent behaviors between top level and DSL knn queries.

I now see we have SemanticTextModelSettings in server, so if we want to "switch" to inference_id to be picked up by some ml code at runtime (which is anyway the case with query_vector_builder) we can probably do that, but I'd consider doing that for both top level and DSL queries eventually.

In the near future we will want to rename model_id -> inference_id and model_text to ?inference_text but there are many places it needs to be done and should change all uses at the same time. I expect to keep the model_id option for compatibility.

server now contains the InferenceServiceRegistry which you can use to look up inference objects configured in _inference and call infer on those objects directly, but it doesn't have access to the models in _ml/trained_models. I'm assuming the purpose of this PR is to create parity between the top level knn and the knn query, in which case it is best to use the same implementation for now.

In future we will automatically create _inference configurations for text embedding models uploaded with Eland, at that point using the InferenceServiceRegistry becomes an option

…modeltext_modelid

elasticsearchmachine · 2024-03-11T08:38:08Z

Pinging @elastic/es-search (Team:Search)

benwtrent · 2024-03-11T14:34:49Z

docs/reference/rest-api/common-parms.asciidoc

+not both. Refer to <<knn-semantic-search>> to learn more.
+end::knn-query-vector-builder[]
+
+


I think you need to update docs/reference/query-dsl/knn-query.asciidoc as well to include knn-query-vector-builder

can I use include directive inside docs/reference/query-dsl/knn-query.asciidoc ?

You should be able to use include

server/src/main/java/org/elasticsearch/search/vectors/KnnVectorQueryBuilder.java

benwtrent · 2024-03-11T14:37:57Z

docs/reference/search/knn-search.asciidoc

-(Required, array of floats)
+(Optional, array of floats)
 include::{es-repo-dir}/rest-api/common-parms.asciidoc[tag=knn-query-vector]
 ====

+`query_vector_builder`::
+(Optional, object)
+include::{es-repo-dir}/rest-api/common-parms.asciidoc[tag=knn-query-vector-builder]
+


This API is deprecated. We shouldn't really add features to it. Maybe it occurs by proxy by adding it to the knn query.

on top of modifying docs/reference/query-dsl/knn-query.asciidoc, should I just revert this change, then?

I think so @tteofili. this API is deprecated and folks shouldn't be using it. Everything is under _search now. Either in the knn clause or the knn query.

server/src/main/java/org/elasticsearch/search/vectors/KnnVectorQueryBuilder.java

…modeltext_modelid

davidkyle · 2024-03-12T10:31:32Z

.../plugin/src/yamlRestTest/resources/rest-api-spec/test/ml/search_knn_query_vector_builder.yml

+              query_vector_builder:
+                text_embedding:
+                  model_id: text_embedding_model
+                  model_text: "the octopus comforter smells"


In the near future we will want to rename model_id -> inference_id and model_text to ?inference_text but there are many places it needs to be done and should change all uses at the same time. I expect to keep the model_id option for compatibility.

server now contains the InferenceServiceRegistry which you can use to look up inference objects configured in _inference and call infer on those objects directly, but it doesn't have access to the models in _ml/trained_models. I'm assuming the purpose of this PR is to create parity between the top level knn and the knn query, in which case it is best to use the same implementation for now.

In future we will automatically create _inference configurations for text embedding models uploaded with Eland, at that point using the InferenceServiceRegistry becomes an option

...rc/javaRestTest/java/org/elasticsearch/xpack/ml/integration/KnnQueryWithTextEmbeddingIT.java

tteofili · 2024-03-12T14:54:43Z

run elasticsearch-ci/part-4

davidkyle

LGTM thanks for reorganising the tests

…modeltext_modelid

benwtrent · 2024-03-12T15:54:40Z

server/src/main/java/org/elasticsearch/search/vectors/KnnVectorQueryBuilder.java

    protected void doXContent(XContentBuilder builder, Params params) throws IOException {
+        if (queryVectorSupplier != null) {
+            throw new IllegalStateException("missing a rewriteAndFetch?");


This needs to be done in wire serialization (StreamOutput) as well.

…modeltext_modelid

benwtrent

This looks good.

Could you test inference with nested vectors & inner hits? I suspect we will infer the model twice, once for gathering the nearest docs and another time for inner_hits. This is OK for now and fixing it will be a future optimization.

I just want to make sure this works OK when querying nested vectors & gathering inner_hits.

…modeltext_modelid

benwtrent · 2024-04-11T15:52:19Z

@tteofili do you mind adding a highlight to the release note associated with this PR?

elasticsearchmachine · 2024-04-11T15:52:22Z

@tteofili according to this PR's labels, I need to update the changelog YAML, but I can't because the PR is closed. Please either update the changelog yourself on the appropriate branch, or adjust the labels. Specifically:

The PR is labelled release highlight but the changelog has no highlight section

* Added highlight for #106068

elasticsearchmachine added the v8.14.0 label Mar 7, 2024

Added missing files

d81c4e1

tteofili added the >enhancement label Mar 7, 2024

Fix query in yaml test

32e701e

tteofili added the :Search/Search Search-related issues that do not fall into other categories label Mar 7, 2024

tteofili and others added 6 commits March 7, 2024 15:18

Update docs/changelog/106068.yaml

1eb1286

Fixed docs, bumped TransportVersion

98e9b21

Merge branch 'knn_dsl_modeltext_modelid' of github.com:tteofili/elast…

b58d1d2

…icsearch into knn_dsl_modeltext_modelid

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

dca4e99

…modeltext_modelid

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

840ff1c

…modeltext_modelid

correct assertion for yml test

2e35f29

benwtrent reviewed Mar 8, 2024

View reviewed changes

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

1f9ac72

…modeltext_modelid

tteofili marked this pull request as ready for review March 11, 2024 08:37

elasticsearchmachine added the Team:Search Meta label for search team label Mar 11, 2024

benwtrent reviewed Mar 11, 2024

View reviewed changes

tteofili added 6 commits March 11, 2024 16:44

Fix ctor visibility

f2c998e

Improved input validation

b164d25

validation should not check for query supplier

905bc31

dropped unused ctor

d359256

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

9fa135c

…modeltext_modelid

formatting fix

1e58587

davidkyle reviewed Mar 12, 2024

View reviewed changes

tteofili added 3 commits March 12, 2024 14:57

revert deprecated api doc change, use include for qvb

301705d

validation in KNNVQB#doXContent

303431e

IT refactoring to avoid duplicate tests

7f09dc1

skip yml test for versions <= 8.14

28e2136

davidkyle approved these changes Mar 12, 2024

View reviewed changes

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

3c22e9c

…modeltext_modelid

benwtrent reviewed Mar 12, 2024

View reviewed changes

tteofili added 3 commits March 13, 2024 10:00

serialization validation for supplier

ce232be

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

f359b89

…modeltext_modelid

rewrite test expected vector should be float[]

3013a1b

benwtrent approved these changes Mar 13, 2024

View reviewed changes

tteofili added 11 commits March 14, 2024 18:03

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

45b5a00

…modeltext_modelid

nested vectors with inner hits test

0de580a

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

9193cf7

…modeltext_modelid

add required privileges for indexing nested vectors

8f97ea1

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

73f300d

…modeltext_modelid

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

983c7e9

…modeltext_modelid

add required privileges for indexing nested vectors

f682fb7

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

61671b6

…modeltext_modelid

index refresh needs token too

50e882d

Merge branch 'main' of github.com:elastic/elasticsearch into knn_dsl_…

822a32d

…modeltext_modelid

added skip version for nested vectors test

166e3c5

tteofili merged commit 7bff3b3 into elastic:main Mar 18, 2024

lkts mentioned this pull request Mar 21, 2024

Add DownsampleMetrics #106637

Closed

DaveCTurner mentioned this pull request Mar 22, 2024

[CI] XPackRestIT test {p0=ml/search_knn_query_vector_builder/Test vector search with query_vector_builder} failing #106650

Closed

This was referenced Mar 25, 2024

Set index mode earlier for new downsample index #106728

Merged

Added initial metrics for synthetic source #106732

Merged

benwtrent added the release highlight label Apr 11, 2024

tteofili added a commit to tteofili/elasticsearch that referenced this pull request Apr 12, 2024

Added higlight for elastic#106068

1f04e80

tteofili mentioned this pull request Apr 12, 2024

Add release highlight for #106068 #107418

Merged

tteofili added a commit that referenced this pull request Apr 13, 2024

Add release highlight for #106068 (#107418)

d3e0ec1

* Added highlight for #106068

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add modelId and modelText to KnnVectorQueryBuilder #106068

Add modelId and modelText to KnnVectorQueryBuilder #106068

tteofili commented Mar 7, 2024 •

edited

Loading

elasticsearchmachine commented Mar 7, 2024

benwtrent Mar 8, 2024

tteofili Mar 11, 2024

davidkyle Mar 12, 2024

elasticsearchmachine commented Mar 11, 2024

benwtrent Mar 11, 2024

tteofili Mar 11, 2024

benwtrent Mar 11, 2024

benwtrent Mar 11, 2024

tteofili Mar 11, 2024

benwtrent Mar 11, 2024

davidkyle Mar 12, 2024

tteofili commented Mar 12, 2024

davidkyle left a comment

benwtrent Mar 12, 2024

benwtrent left a comment

benwtrent commented Apr 11, 2024

elasticsearchmachine commented Apr 11, 2024

		not both. Refer to <<knn-semantic-search>> to learn more.
		end::knn-query-vector-builder[]

Add modelId and modelText to KnnVectorQueryBuilder #106068

Add modelId and modelText to KnnVectorQueryBuilder #106068

Conversation

tteofili commented Mar 7, 2024 • edited Loading

elasticsearchmachine commented Mar 7, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

elasticsearchmachine commented Mar 11, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tteofili commented Mar 12, 2024

davidkyle left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

benwtrent left a comment

Choose a reason for hiding this comment

benwtrent commented Apr 11, 2024

elasticsearchmachine commented Apr 11, 2024

tteofili commented Mar 7, 2024 •

edited

Loading