I open-sourced a 3-agent blind eval team. Any agent runtime can call it for pre-commitment review of its own plans.

Shipped this weekend: a 3-agent blind cross-lab evaluation workflow on heym, MIT licensed, callable...

Read Original

Related