agent-gating

5 items · chronological order

2026-03-12

Financial Times 2026-03-12-1

The AI pension advisers are already here

50%+ of UK adults already use AI for financial guidance, yet the article buries the structural story: the marginal cost of personalized financial advice is collapsing to zero. JPMorgan's Bilton warns "always use a human adviser" — from a firm that killed Nutmeg and has $3T+ AUM to protect. The real question isn't whether AI gives wrong pension advice; it's whether a £15K/year advisory fee can survive a free alternative that improves with every interaction.

OpenAI JPMorgan Lloyds Banking Group Altruist FCA Perplexity Google Scottish Widows Nutmeg

permalink

2026-03-23

Fortune 2026-03-23-2

The Karpathy Loop: Autonomous Agent Optimization as Research Pattern

Karpathy's autoresearch ran 700 experiments in two days on a 630-line codebase: the result matters less than the pattern. The Karpathy Loop (agent + single file + testable metric + time limit) is the atomic unit of constrained autonomous optimization, and it generalizes to any problem with a measurable output and a modifiable code surface. The real competitive shift isn't building better agents; it's designing better constraints, metrics, and stopping criteria: taste becomes the bottleneck, not compute.

Andrej Karpathy Karpathy Loop autoresearch Tobias Lütke Shopify AutoML Eureka Labs

permalink

2026-04-26

Wall Street Journal 2026-04-26-3

AI Is Cannibalizing Human Intelligence (Vivienne Ming, WSJ)

Ming's Polymarket experiment splits human-AI usage into three measurable patterns: oracle (use the answer), validator (use AI to confirm priors), cyborg (use AI as sparring partner). Validators perform worse than AI alone — sycophancy laundered as evidence — while the 5-10% of cyborgs match or beat prediction-market consensus. The unbuilt premium category is AI that disagrees with you on purpose; today's benchmarks measure what AI does alone, not whether the product is building human capacity or consuming it.

Vivienne Ming Polymarket ChatGPT Gemini Anthropic OpenAI WSJ

permalink

2026-05-13

VentureBeat 2026-05-13-3

Anthropic Reinstates OpenClaw with Metered Agent SDK Credits: Compute Arbitrage Ends, Caching Becomes Pricing Substrate

Anthropic published the metering template every frontier lab will run by year-end. The May 13 restoration locks third-party agentic usage to API rates inside a non-rollover Agent SDK credit ($20 Pro, $100 Max 5x, $200 Max 20x), ending compute arbitrage and naming prompt cache hit rate, in Boris Cherny's words, as the published pricing primitive that separates flat-rate from metered inference. OpenAI and Google face identical inference economics; the lab that meters last bleeds margin.

Anthropic Claude OpenClaw Boris Cherny Claude Code Lydia Hallie Theo Browne Kun Chen Ben Hylak OpenAI Google Conductor Zed Raindrop.ai Cursor

permalink

2026-05-13

404 Media 2026-05-13-1

404 Media: Software Developers Say AI Is Rotting Their Brains

Performance reviews at FAANG and mid-tech now grade AI adoption, with one UX designer naming the dynamic exactly: "the actual quality of output doesn't matter as much as our willingness to participate." The "X percent of code is AI-generated" metric tech executives cite on earnings calls measures HR obedience contaminated by Goodhart at org-design scale, not output throughput. Almost no company is measuring the number that actually matters: production value net of verification cost.

404 Media Emanuel Maiberg Meta Microsoft Snap FAANG Cursor JetBrains Anthropic Claude Code Goodhart's Law MIT Reddit Hacker News

permalink