3 items

All three pieces are about the same underlying problem: frontier AI labs now hold capabilities that outpace the governance infrastructure built to oversee them, and the institutions trying to fill that gap — independent safety researchers, government evaluators, enterprise buyers — are being shaped by lab decisions made for other reasons entirely.

tanyaverma.sh 2026-04-13-1

The Closing of the Frontier

Two-thirds of MATS symposium research posters ran on Chinese open-source models because Anthropic's Mythos restrictions closed off Western frontier access to independent safety researchers. The safety case for restricted access is degrading the safety research pipeline it claims to protect. The policy question isn't content moderation: it's whether frontier model access needs due process obligations the way utilities do.

The Verge 2026-04-13-2

OpenAI CRO Memo: Platform War Thesis, Amazon Distribution, and the Anthropic Revenue Accounting Battle

OpenAI's CRO spending four paragraphs rebutting Anthropic's 'fear, restriction, elites' positioning in a Q2 sales memo is revealed preference: you don't rebut what isn't landing with enterprise buyers. The more consequential line is buried: 'the biggest bottleneck is no longer whether the technology works, it's whether companies can deploy it successfully.' That's OpenAI officially declaring the deployment race primary, with the $8B run rate attack on Anthropic reading as pre-IPO narrative anchoring, falsifiable when both S-1s drop.

UK AI Security Institute 2026-04-13-3

AISI Evaluation of Claude Mythos Preview's Cyber Capabilities

A UK government lab confirmed Mythos can autonomously execute a 32-step corporate network attack end-to-end, outperforming every tested model including GPT-5, with performance still scaling at the 100M token ceiling. The evaluation tested capability against undefended ranges, so what AISI validated is threat potential, not operational impact against a real defended environment. The structural shift is that government evaluation infrastructure is becoming the third-party verification layer for frontier AI claims, sitting between self-reported lab benchmarks and the market the way FDA trials sit between pharma and prescribers.