Support Sparse Embedding Retrieval #7355

anakin87 · 2024-03-13T16:35:28Z

It is a feature the community asks for and is currently supported by Qdrant and Pinecone.

Update
I experimented with the complete round trip: from Document to sparse embedding stored in Qdrant/Pinecone and then querying (notebook).

What we need to do:

- [x] Investigate/design the integration
- [x] Introduce SparseEmbedding class and add it to Document
- [ ] https://github.com/deepset-ai/haystack-core-integrations/issues/604
- [x] release the SparseEmbedding class in 2.0.1
- [x] Introduce a first Sparse Embedder (https://github.com/deepset-ai/haystack-core-integrations/pull/579)
- [x] Make Qdrant write sparse embeddings (https://github.com/deepset-ai/haystack-core-integrations/pull/578)
- [x] Introduce Qdrant Sparse Embedding Retriever (https://github.com/deepset-ai/haystack-core-integrations/pull/578)
- [x] non-urgent: understand the problems related to Qdrant Hybrid Retriever
- [ ] https://github.com/deepset-ai/haystack-core-integrations/issues/695
- [ ] https://github.com/deepset-ai/haystack-core-integrations/issues/660
- [ ] https://github.com/deepset-ai/haystack-core-integrations/pull/675
- [x] The feature was announced through social media

The text was updated successfully, but these errors were encountered:

lambda-science · 2024-03-13T20:52:19Z

Note:
As a 1st step: we now have working Sparse embedder in Haystack through FastEmbed integration
deepset-ai/haystack-core-integrations#579

lambda-science · 2024-03-13T23:23:25Z

Btw it would be cool to have a general BM25 Embedder in core haystack repo instead of relying only on Splade Embedder from FastEmbed :) As you already have a haystack-bm25

lambda-science · 2024-03-14T10:22:12Z

Note:
As a 2nd step: Qdrant integration could now support Sparse vector and be compatible with the FastEmbed sparse embedder from above 👀 deepset-ai/haystack-core-integrations#578

anakin87 self-assigned this Mar 13, 2024

anakin87 added the type:feature New feature or request label Mar 19, 2024

anakin87 mentioned this issue Mar 19, 2024

feat: introduce SparseEmbedding #7382

Merged

anakin87 changed the title ~~Design the support for Sparse Embedding Retrieval~~ Support for Sparse Embedding Retrieval Mar 22, 2024

anakin87 changed the title ~~Support for Sparse Embedding Retrieval~~ Support Sparse Embedding Retrieval Mar 22, 2024

anakin87 closed this as completed Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Sparse Embedding Retrieval #7355

Support Sparse Embedding Retrieval #7355

anakin87 commented Mar 13, 2024 •

edited

Loading

lambda-science commented Mar 13, 2024

lambda-science commented Mar 13, 2024

lambda-science commented Mar 14, 2024

Support Sparse Embedding Retrieval #7355

Support Sparse Embedding Retrieval #7355

Comments

anakin87 commented Mar 13, 2024 • edited Loading

lambda-science commented Mar 13, 2024

lambda-science commented Mar 13, 2024

lambda-science commented Mar 14, 2024

anakin87 commented Mar 13, 2024 •

edited

Loading