scaling-laws

6 items

WIRED · 2026-04-28 2026-05-01-w2

The Man Behind AlphaGo Thinks AI Is Taking the Wrong Path

David Silver raised $1.1B at a $5.1B valuation on the argument that LLMs are bounded by the human-data manifold, and that the only way out is RL-trained agents operating in simulation. The architectural evidence is real: AlphaGo's Move 37 came from outside the space of human play, and Sutton's Turing Award validates the theoretical foundation Silver is building on. What this week's picks clarify is that the capability argument is almost beside the point: the OpenAI goblin postmortem shows that even current systems can't reliably control what they're optimizing for, and Karpathy's MenuGen demo shows that the harness around the model is already more consequential than the model itself. Silver's unpriced bottleneck, reliable verifiers for unbounded domains, is also the missing piece in both of those stories. The next value pool isn't in bigger models or better prompts; it's in the infrastructure that tells you whether the output was actually right.

WIRED 2026-04-28-1

The Man Behind AlphaGo Thinks AI Is Taking the Wrong Path

David Silver left DeepMind to raise $1.1B at $5.1B for Ineffable Intelligence on a thesis that says LLMs hit a ceiling defined by the human-data manifold and only RL-trained agents in simulations can break through. The architectural argument has teeth: AlphaGo's Move 37 came from outside human play, and Sutton just won the Turing Award for the foundational work. The unspoken bottleneck if Silver is right isn't compute or data, it's verifiers — reliable scoring functions for unbounded domains like science, governance, novel discovery — and that is the quiet investable category nobody's pricing yet.

LinkedIn 2026-04-12-2

The AI Discourse Gap: When Pundit Narratives Decouple from Verifiable Architecture

Gary Marcus found a 3,167-line TypeScript file that handles terminal output formatting and declared it proof that the neurosymbolic paradigm has arrived. The actual architecture documented in community analysis is multi-agent orchestration, KAIROS scaffolding, and structured reasoning pipelines: good engineering around a model, which is both true and completely banal. Capital follows narratives before architecture, which is how the SoftBank/OpenAI mega-round closed on a scaling story months after practitioners had already documented diminishing pre-training returns.

The Twenty Minute VC (20VC) 2026-04-08-1

Demis Hassabis on 20VC: AGI Timeline, LLM Non-Commoditization, and the Algorithmic Innovation Thesis

Hassabis argues frontier models won't commoditize because algorithmic innovation, not scaling spend, is the new differentiator: only 3-4 labs can still invent. What he conspicuously omits is inference economics; collapsing costs commoditize models at the useful-capability threshold regardless of what happens at the absolute frontier. The real signal is his "jagged intelligence" admission: if foundation models remain inconsistent, the durable moat lives in application-layer reliability engineering, not model access.

MIT CSAIL · 2026-03-19 2026-03-20-w1

MIT CSAIL: 80-90% of Frontier AI Performance Is Just Compute

The week's most clarifying number wasn't a revenue figure or a benchmark score: it was 40x, the compute efficiency variance MIT CSAIL found within individual labs producing frontier models, meaning a single developer can't reliably reproduce its own results even when it controls the spending. That internal inconsistency quietly dissolves the moat thesis from both directions: if the frontier is a spending race and the spending doesn't produce consistent outcomes, neither scale nor safety restrictions reliably compound into durable advantage. That framing lands harder alongside Ramp's transaction data, where the more expensive, supply-constrained product is growing fastest precisely because product differentiation has become so hard to verify that buyers are using price as a trust proxy. And it reframes the Morningstar moat downgrades: if 37 application-layer moats narrowed because AI compresses the cost of performing expertise, the labs producing the underlying models face the same compression one layer down. Pre-training scale is now a commodity floor, not a ceiling; the differentiation that actually moves enterprise purchasing decisions has migrated to post-training alignment and inference-time compute, layers that don't appear in any scaling regression.

MIT CSAIL 2026-03-19-3

MIT CSAIL: 80-90% of Frontier AI Performance Is Just Compute

The study's headline finding confirms what everyone suspects: scale drives frontier performance. The buried finding inverts it: individual labs produce models with 40x compute efficiency variance, meaning they can't reliably reproduce their own results. If the frontier is a spending race and the spending doesn't produce consistent outcomes, the moat thesis weakens from both directions. The entire analysis is also blind to where differentiation actually moved: post-training alignment, tool use, and inference-time compute are now the layers where product quality diverges, and none of them show up in a pre-training scaling regression.