Why We're Changing Our Default Eval Model

We're changing the default solver model in our eval harness from Claude Sonnet 4.6 to GLM 5.1. This...

Read Original

Related