feat: custom web RAG with Firecrawl #196

justinh-rahb · 2024-05-31T19:03:56Z

justinh-rahb
May 31, 2024
Maintainer

Firecrawl is highly suitable for custom web Retrieval-Augmented Generation (RAG) pipelines due to its advanced features and flexibility. Here are the key highlights:

Smart LLM Scraping: Converts websites into clean, LLM-ready markdown, ensuring data is optimized for language models which improves the accuracy and relevance of RAG outputs.
Comprehensive Capabilities: Efficiently handles JavaScript-heavy sites and complex site navigations for thorough data retrieval, critical for feeding accurate information into RAG systems.
Self-Hostable and Scalable: Offers both a hosted version and a self-hostable option, giving you control over deployment and scalability, which is ideal for handling large-scale data needs in RAG pipelines.
Efficient and Cost-Effective: Noted for significant savings in computational costs and time, enhancing operational efficiency in large-scale deployments.

This combination of smart scraping, comprehensive data handling, scalability, and cost-efficiency makes Firecrawl an excellent choice for enhancing custom RAG pipelines
Firecrawl.dev | Introduction | Scrape Docs

tjbck · 2024-05-31T20:34:36Z

tjbck
May 31, 2024
Maintainer

I guess the issue here is knowing when to search the web, one solution here is asking the users to prefix their prompt with a designated command (e.g. !search what's open webui?)

0 replies

bannert1337 · 2024-06-10T16:14:47Z

bannert1337
Jun 10, 2024

Why not as a tool the LLM can call whenever it wants/needs it.

0 replies

atgehrhardt · 2024-06-18T04:15:02Z

atgehrhardt
Jun 18, 2024

One issue I can think of with it being a tool is that it's going to be really hit or miss. This would be so useful, that while a tool implementation would be cool, smaller models would really struggle, so I think more people would get benefit out of being able to invoke it explicitly.

It can always be implemented both ways, but while the tool would be cooler, it also would be less applicable across the board.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: custom web RAG with Firecrawl #196

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

feat: custom web RAG with Firecrawl #196

justinh-rahb May 31, 2024 Maintainer

Replies: 3 comments

tjbck May 31, 2024 Maintainer

bannert1337 Jun 10, 2024

atgehrhardt Jun 18, 2024

justinh-rahb
May 31, 2024
Maintainer

tjbck
May 31, 2024
Maintainer

bannert1337
Jun 10, 2024

atgehrhardt
Jun 18, 2024