Mastodon discussion Discussions 4h ago

CMU Benchmark: Claude Mythos Hits 9.9/16 on V8 Exploits, GPT-5.5 Trails at 5.5CMU's ExploitBench shows Claude Mythos sco...

CMU Benchmark: Claude Mythos Hits 9.9/16 on V8 Exploits, GPT-5.5 Trails at 5.5CMU's ExploitBench shows Claude Mythos scores 9.9/16 on V8 exploits vs GPT-5.5's 5.5, but costs $36,428 per run — 12x more. The cost-performance tradeoff is the real story.https://gentic.news/article/cmu-benchmark-claude-mythos-hits-9#AI #ArtificialIntelligence #Tech

Read Original

Anthropic Benchmark OpenAI

Metadata

Reblogs Count: 1
Account: genticnews

Mastodon discussion 26m ago

🧠 Forse non siamo più solo all’inizio della capacità tecnica dell’#AI.📈 Siamo all’inizio del suo impatto reale su aziend...

🧠 Forse non siamo più solo all’inizio della capacità tecnica dell’#AI.📈 Siamo all’inizio del suo impatto reale su aziende, lavoro e società.👉 Alcune riflessioni: https://www.linked...

Mastodon discussion 33m ago

How to fight AI if you need to or get the chance.https://siliconreckoner.substack.com/p/questions-to-ask-ai-boosters#AI ...

How to fight AI if you need to or get the chance.https://siliconreckoner.substack.com/p/questions-to-ask-ai-boosters#AI #slop #environment #economics #StopAI

Mastodon discussion 35m ago

llama.cpp lands Multi-Token Prediction support with up to 1.8x speedups, OpenAI hands ChatGPT Plus to an entire country,...

llama.cpp lands Multi-Token Prediction support with up to 1.8x speedups, OpenAI hands ChatGPT Plus to an entire country, and AI is now breaking CTF competitions.https://ai0.news/po...

CMU Benchmark: Claude Mythos Hits 9.9/16 on V8 Exploits, GPT-5.5 Trails at 5.5CMU's ExploitBench shows Claude Mythos sco...

Metadata

Related

🧠 Forse non siamo più solo all’inizio della capacità tecnica dell’#AI.📈 Siamo all’inizio del suo impatto reale su aziend...

How to fight AI if you need to or get the chance.https://siliconreckoner.substack.com/p/questions-to-ask-ai-boosters#AI ...

llama.cpp lands Multi-Token Prediction support with up to 1.8x speedups, OpenAI hands ChatGPT Plus to an entire country,...