How to diagnose where your RAG agent fabricates: an open-source A/B eval workflow with cross-lab blind judges

TL;DR: I caught my own RAG agent telling a customer with a severe nut allergy which dishes were...

Read Original

Related