Mastodon discussion Discussions 2d ago

Claude Code leads on SWE-bench Verified at 87.6% but GPT-5.5 tops Terminal-Bench at 82.7% - the AI coding agent landscap...

by AIagent.at 🤖 AI News

Claude Code leads on SWE-bench Verified at 87.6% but GPT-5.5 tops Terminal-Bench at 82.7% - the AI coding agent landscape in 2026 is more capable yet increasingly fragmented. The benchmark that once defined the field is now disputed after OpenAI found 59.4% of its hardest problems had flawed test cases. https://www.marktechpost.com/2026/05/15/best-ai-agents-for-software-development-ranked-a-benchmark-driven-look-at-the-current-field/ #AIagent #AI #GenAI #AgenticAI

Read Original

Anthropic OpenAI

Metadata

Account: ai@defcon.social

Mastodon discussion 7m ago

2026-05-16 | 🤖 🌌 The Recursive Echo of the Collective 🤖#AI Q: 🤖 If you could encode one non-negotiable value into a mach...

2026-05-16 | 🤖 🌌 The Recursive Echo of the Collective 🤖#AI Q: 🤖 If you could encode one non-negotiable value into a machine, what would it be?🕸️ Mesh Governance | 🧠 Digital Identit...

Mastodon discussion 8m ago

https://winbuzzer.com/2026/05/17/google-search-spam-policy-ai-overviews-ai-mode-manipulation-xcxwbn/Google hasupdated it...

https://winbuzzer.com/2026/05/17/google-search-spam-policy-ai-overviews-ai-mode-manipulation-xcxwbn/Google hasupdated its Search spam policy to classify attempts to manipulate gene...

Mastodon discussion 8m ago

Eric Schmidt booed at University of Arizona after praising AIhttps://bsky.app/profile/404media.co/post/3mm2ivguvq22x#404...

Eric Schmidt booed at University of Arizona after praising AIhttps://bsky.app/profile/404media.co/post/3mm2ivguvq22x#404media #ai

Claude Code leads on SWE-bench Verified at 87.6% but GPT-5.5 tops Terminal-Bench at 82.7% - the AI coding agent landscap...

Metadata

Related

2026-05-16 | 🤖 🌌 The Recursive Echo of the Collective 🤖#AI Q: 🤖 If you could encode one non-negotiable value into a mach...

https://winbuzzer.com/2026/05/17/google-search-spam-policy-ai-overviews-ai-mode-manipulation-xcxwbn/Google hasupdated it...

Eric Schmidt booed at University of Arizona after praising AIhttps://bsky.app/profile/404media.co/post/3mm2ivguvq22x#404...