Home/Browse/Evaluation metric example: Check if tool was called

Evaluation metric example: Check if tool was called

n8n11 modulesv1.0

agentchatTriggerevaluationevaluationTriggerhttpRequestToollmChatOpenAinoOpsettoolCalculator

Evaluation metric example: Check if tool was called workflow diagram

About this workflow

AI evaluation in n8n This is a template for n8n's evaluation feature. Evaluation is a technique for getting confidence that your AI workflow performs reliably, by running a test dataset containing different inputs through the workflow. By calculating a metric (score) for each input, you can see where the workflow is performing well and where it isn't. How it works This template shows how to calculate a workflow evaluation metric: whether a specific tool was called by an agent. - We use an evaluation trigger to read in our dataset - It is wired up in parallel with the regular trigger so that the workflow can be started from either one. More info - We make sure that the agent outputs the list of tools that it used - We then check whether the expected tool (from the dataset) is in that list - Finally we pass this information back to n8n as a metric

How to import this n8n workflow

1
Download the workflow JSON file after purchase.
2
Open n8n → click the menu → Import from File.
3
Select the downloaded JSON and import.
4
Set up credentials for each node that requires them.
5
Click Execute Workflow to test, then activate.

Setup guide

Setup guide included

Purchase to unlock the full step-by-step guide

Reviews

No reviews yet

Be the first to buy and share your experience.

Leave a review

Free

No ratings yet

Create a free account to purchase workflows.

JSON blueprint — instant download
Setup guide PDF included
5 downloads · valid 30 days
Works with n8n

Evaluation metric example: Check if tool was called

About this workflow

How to import this n8n workflow

Setup guide

Step 1: Connect your apps

Step 2: Import the blueprint

Reviews