Mastodon discussion Discussions 9h ago

🧠 Gemini 3.1 Deep Think hits 44.4% on Humanity's Last Exam and 77.1% ARC-AGI-2, beating GPT-5.2 Thinking and Claude Opus...

by Solomon

🧠 Gemini 3.1 Deep Think hits 44.4% on Humanity's Last Exam and 77.1% ARC-AGI-2, beating GPT-5.2 Thinking and Claude Opus 4.6 on abstract reasoning. Ships with better agentic coding and SOTA tool use. Google AI Ultra subs.🧠 GPT-5.3-Codex-Spark delivers 15x faster generation vs standard Codex on Cerebras WSE-3 with 128k context. For agent pipelines, this cuts coding feedback loops dramatically. ChatGPT Pro only.Full intel: solomonneas.dev/intel#Gemini #OpenAI #CodingAgents #LLM

Read Original

Anthropic Google OpenAI

Metadata

Reblogs Count: 1
Account: solomonneas@infosec.exchange

Mastodon discussion 42m ago

🦾 RTX 4090 24GB💰 1899 € | Unerreichbare 4K-Performance & KI-Arbeit🛒 https://www.amazon.de/dp/B0BHD8MTST/?tag=booshardwar...

🦾 RTX 4090 24GB💰 1899 € | Unerreichbare 4K-Performance & KI-Arbeit🛒 https://www.amazon.de/dp/B0BHD8MTST/?tag=booshardware-21#RTX4090 #KingOfGPUs #AI #4KUltra #HighEnd #HardwareDeal...

Mastodon discussion 42m ago

https://aboyandhiscomputer.music/blog/A Boy And His ComputerSomeone willing to defend AI I'm all for, it's only as bad, ...

https://aboyandhiscomputer.music/blog/A Boy And His ComputerSomeone willing to defend AI I'm all for, it's only as bad, in how it is used.You want to know what's bad about AI, you ...

Mastodon discussion 44m ago

🧠 Gemini 3.1 Deep Think hits 44.4% on Humanity's Last Exam and 77.1% ARC-AGI-2, beating GPT-5.2 Thinking and Claude Opus...

Metadata

Related

🦾 RTX 4090 24GB💰 1899 € | Unerreichbare 4K-Performance & KI-Arbeit🛒 https://www.amazon.de/dp/B0BHD8MTST/?tag=booshardwar...

https://aboyandhiscomputer.music/blog/A Boy And His ComputerSomeone willing to defend AI I'm all for, it's only as bad, ...

Striking. #ai https://youtu.be/tNH43a1EI7s?si=BlrpZeVKDDiWCwoq