A demo agent is easy. It calls a model, the model calls a tool, the tool returns something plausible,...
The reliability gap: what it actually takes to put an AI agent in production
A demo agent is easy. It calls a model, the model calls a tool, the tool returns something plausible,...
Three years ago, my webhook tests involved ngrok, a sleep(5) call, and crossed fingers. The current...
I went through 2,400 of our team's API test assertions last month. 91% of them fall into three...
Building AI agents used to mean writing custom glue code for every tool and API you wanted your model...