Home/Browse/Evaluate AI agent response relevance using OpenAI and cosine similarity

Evaluate AI agent response relevance using OpenAI and cosine similarity

n8n17 modulesv1.0

OpenAI

About this workflow

This n8n template demonstrates how to calculate the evaluation metric "Relevance" which in this scenario, measures the relevance of the agent's response to the user's question. The scoring approach is adapted from the open-source evaluations project RAGAS and you can see the source here How it works This evaluation works best for Q&A agents. For our scoring, we analyse the agent's response and ask another AI to generate a question from it. This generated question is then compared to the original question using cosine similarity. A high score indicates relevance and the agent's successful ability to answer the question whereas a low score means agent may have added too much irrelevant info, went off script or hallucinated. Requirements n8n version 1.94+ Check out this Google Sheet for a sample data

How to import this n8n workflow

1
Download the workflow JSON file after purchase.
2
Open n8n → click the menu → Import from File.
3
Select the downloaded JSON and import.
4
Set up credentials for each node that requires them.
5
Click Execute Workflow to test, then activate.

Setup guide

Setup guide included

Purchase to unlock the full step-by-step guide

Reviews

No reviews yet

Be the first to buy and share your experience.

Leave a review

Free

No ratings yet

Create a free account to purchase workflows.

JSON blueprint — instant download
Setup guide PDF included
5 downloads · valid 30 days
Works with n8n

Evaluate AI agent response relevance using OpenAI and cosine similarity

About this workflow

How to import this n8n workflow

Setup guide

Step 1: Connect your apps

Step 2: Import the blueprint

Reviews