What this does Receives a URL via webhook, uses Firecrawl to scrape the page into clean markdown, and stores it as vector embeddings in Supabase pgvector. A visual, self-hosted ingestion pipeline for RAG knowledge bases. Adding a new source is as simple as sending a URL. The second part of the workflow exposes a chat interface where an AI Agent queries the stored knowledge base to answer questions, with Cohere reranking for better retrieval quality. How it works Part 1: Ingestion Pipeline 1. Web

What this does Receives a URL via webhook, uses Firecrawl to scrape the page into clean markdown, and stores it as vector embeddings in Supabase pgvector. A visual, self-hosted ingestion pipeline for RAG knowledge bases. Adding a new source is as simple as sending a URL. The second part of the workflow exposes a chat interface where an AI Agent queries the stored knowledge base to answer questions, with Cohere reranking for better retrieval quality. How it works Part 1: Ingestion Pipeline 1. Webhook receives a POST request with a url field 2. Verify URL validates and normalizes the domain 3. Supabase checks if the URL was already ingested (deduplication) 4. If the URL already exists, ingestion is skipped; otherwise it continues 5. Firecrawl fetches the page and converts it to clean markdown 6. OpenAI generates vector embeddings from the scraped content 7. Default Data Loader attaches the source URL as metadata 8. Supabase Vector Store inserts the content and embeddings into pgvector 9. Respond to Webhook confirms how many items were added Part 2: RAG Chat Agent 1. Chat trigger receives a user question 2. AI Agent (OpenRouter) queries the Supabase vector store filtered by URL 3. Cohere Reranker improves retrieval quality before the agent responds 4. Agent answers based solely on the ingested knowledge base Requirements - Firecrawl API key - OpenAI API key (for embeddings) - OpenRouter API key (for the chat agent) - Cohere API key (for reranking) - Supabase project with pgvector enabled Setup 1. Create a Supabase project and run the following SQL in the SQL editor: 2. Add your Firecrawl API key as a credential in n8n 3. Add your OpenAI API key as a credential (for embeddings) 4. Add your OpenRouter API key as a credential (for the chat agent) 5. Add your Cohere API key as a credential (for reranking) 6. Activate the workflow How to use Send a POST request to the webhook URL: Then open the chat interface in n8n to ask questions about the ingested content.
Download the workflow JSON file after purchase.
Open n8n → click the menu → Import from File.
Select the downloaded JSON and import.
Set up credentials for each node that requires them.
Click Execute Workflow to test, then activate.
Setup guide included
Purchase to unlock the full step-by-step guide
No reviews yet
Be the first to buy and share your experience.
Leave a review
Sign in to share your experience with this workflow.
Create a free account to purchase workflows.
Need help setting this up?
Book a 3-hour live setup session with an Agility consultant.