Qdrant Wrapper issue: _document_from_score_point exposes incorrect key for content #1087

rishabh-ti · 2023-02-16T13:18:41Z

pydantic.error_wrappers.ValidationError: 1 validation error for Document
page_content
  none is not an allowed value (type=type_error.none.not_allowed)

The text was updated successfully, but these errors were encountered:

Fixes #1087

kacperlukawski · 2023-02-16T17:17:53Z

@rishabh-ti Could you please share some more context? How do you create a Qdrant collection? Are there any chances you provide None as one of the texts?

The #1088 introduced a bug in Qdrant integration. That PR reverts those changes and provides class attributes to ensure consistent payload keys. In addition to that, an exception will be thrown if any of texts is None (that could have been an issue reported in #1087)

rishabh-ti · 2023-02-17T11:12:35Z

@kacperlukawski

client = QdrantClient(host="localhost", port=6333)
embeddings = OpenAIEmbeddings()
qdrant = Qdrant(client=client,collection_name=collection_name,embedding_function=embeddings.embed_query)
llm = OpenAI(model_name="text-davinci-003", temperature=0.0, max_tokens=1500)
qa_chain = ChatVectorDBChain.from_llm(
    llm=llm,
    vectorstore=qdrant,
    condense_question_prompt=question_prompt,
    qa_prompt=answer_prompt,
    chain_type="stuff"
)

So my collection & records already exist in Qdrant. I was passing it in ChatVectorDBChain which was giving me issues when I ran:
result = qa_chain({"chat_history": chat_history,"question": query})

kacperlukawski · 2023-02-17T11:36:55Z

@rishabh-ti Great to see the whole context. Thanks a lot! I assume you uploaded the records on your own and just used the "content" payload key to store the texts. If so, we can consider introducing two additional parameters into the __init__ of Qdrant:

Document content payload key
Metadata payload key

The default values would be set to "page_content" and "metadata" respectively, but in your case, you'll be able to change them so they work as intended. Would that be ok?

rishabh-ti · 2023-02-17T11:58:29Z

@rishabh-ti Great to see the whole context. Thanks a lot! I assume you uploaded the records on your own and just used the "content" payload key to store the texts. If so, we can consider introducing two additional parameters into the __init__ of Qdrant:

Document content payload key

Metadata payload key

The default values would be set to "page_content" and "metadata" respectively, but in your case, you'll be able to change them so they work as intended. Would that be ok?

Yes that would work!

Fixes langchain-ai#1087

This PR: - Increases `qdrant-client` version to 1.0.4 - Introduces custom content and metadata keys (as requested in #1087) - Moves all the `QdrantClient` parameters into the method parameters to simplify code completion

Fixes langchain-ai#1087

…in-ai#1093) The langchain-ai#1088 introduced a bug in Qdrant integration. That PR reverts those changes and provides class attributes to ensure consistent payload keys. In addition to that, an exception will be thrown if any of texts is None (that could have been an issue reported in langchain-ai#1087)

This PR: - Increases `qdrant-client` version to 1.0.4 - Introduces custom content and metadata keys (as requested in langchain-ai#1087) - Moves all the `QdrantClient` parameters into the method parameters to simplify code completion

Thiru-GVT · 2023-12-11T02:44:08Z

Hello, I have this exact same issue. Could I ask you to expand more on this metadata?

I have my qdrant server with data inside, but when i try to use it as a retriever, i keep getting this validation error

rishabh-ti mentioned this issue Feb 16, 2023

Update qdrant.py #1088

Merged

hwchase17 closed this as completed in #1088 Feb 16, 2023

hwchase17 pushed a commit that referenced this issue Feb 16, 2023

Update qdrant.py (#1088)

5d11e5d

Fixes #1087

kacperlukawski mentioned this issue Feb 16, 2023

Hotfix: Qdrant content retrieval (revert: #1088) #1093

Merged

dongreenberg pushed a commit to dongreenberg/langchain that referenced this issue Feb 17, 2023

Update qdrant.py (langchain-ai#1088)

b1b0d00

Fixes langchain-ai#1087

kacperlukawski mentioned this issue Mar 2, 2023

Add Qdrant named arguments #1386

Merged

zachschillaci27 pushed a commit to zachschillaci27/langchain that referenced this issue Mar 8, 2023

Update qdrant.py (langchain-ai#1088)

9b07c4c

Fixes langchain-ai#1087

dosubot bot mentioned this issue Dec 11, 2023

Qdrant retriever with existing data leads to pydantic.error_wrappers.ValidationError: 1 validation error for Document page_content none is not an allowed value (type=type_error.none.not_allowed) #14515

Closed

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qdrant Wrapper issue: _document_from_score_point exposes incorrect key for content #1087

Qdrant Wrapper issue: _document_from_score_point exposes incorrect key for content #1087

rishabh-ti commented Feb 16, 2023

kacperlukawski commented Feb 16, 2023 •

edited

Loading

rishabh-ti commented Feb 17, 2023 •

edited

Loading

kacperlukawski commented Feb 17, 2023 •

edited

Loading

rishabh-ti commented Feb 17, 2023

Thiru-GVT commented Dec 11, 2023

Qdrant Wrapper issue: _document_from_score_point exposes incorrect key for content #1087

Qdrant Wrapper issue: _document_from_score_point exposes incorrect key for content #1087

Comments

rishabh-ti commented Feb 16, 2023

kacperlukawski commented Feb 16, 2023 • edited Loading

rishabh-ti commented Feb 17, 2023 • edited Loading

kacperlukawski commented Feb 17, 2023 • edited Loading

rishabh-ti commented Feb 17, 2023

Thiru-GVT commented Dec 11, 2023

kacperlukawski commented Feb 16, 2023 •

edited

Loading

rishabh-ti commented Feb 17, 2023 •

edited

Loading

kacperlukawski commented Feb 17, 2023 •

edited

Loading