1
Anthropic has filed confidentially for a US IPO at a valuation of approximately $965 billion, which will force public disclosure of revenue, margins, inference costs, and enterprise retention data for the first time.
2
Claude Opus 4.8 achieved 1.5% on ARC AGI 3 — still low in absolute terms but far ahead of all other models — and observers noted it reasoned at a higher abstraction level, modeling objects rather than treating inputs as raw pixels.
3
On the Deep Suite software engineering benchmark (113 contamination-free tasks, 91 repos, 5 languages), GPT-5.5 still outperforms Claude Opus 4.8, though the Ultra Code effort mode was not tested.
4
Rumors point to GPT-5.6 and GPT-5.6 Pro appearing in OpenAI Codex backlogs, with reported improvements in coding, agentic abilities, and a potential 1.5 million token context window — possibly announced at an imminent livestream.
5
Wes Roth built a full autonomous economic simulation game using Claude Opus 4.8 in Ultra Code mode, which he is using as an independent LLM benchmark alongside up to 10 custom benchmarks he is developing.