agent-gating
5 items · chronological order
The AI pension advisers are already here
50%+ of UK adults already use AI for financial guidance, yet the article buries the structural story: the marginal cost of personalized financial advice is collapsing to zero. JPMorgan's Bilton warns "always use a human adviser" — from a firm that killed Nutmeg and has $3T+ AUM to protect. The real question isn't whether AI gives wrong pension advice; it's whether a £15K/year advisory fee can survive a free alternative that improves with every interaction.
The Karpathy Loop: Autonomous Agent Optimization as Research Pattern
Karpathy's autoresearch ran 700 experiments in two days on a 630-line codebase: the result matters less than the pattern. The Karpathy Loop (agent + single file + testable metric + time limit) is the atomic unit of constrained autonomous optimization, and it generalizes to any problem with a measurable output and a modifiable code surface. The real competitive shift isn't building better agents; it's designing better constraints, metrics, and stopping criteria: taste becomes the bottleneck, not compute.
AI Is Cannibalizing Human Intelligence (Vivienne Ming, WSJ)
Ming's Polymarket experiment splits human-AI usage into three measurable patterns: oracle (use the answer), validator (use AI to confirm priors), cyborg (use AI as sparring partner). Validators perform worse than AI alone — sycophancy laundered as evidence — while the 5-10% of cyborgs match or beat prediction-market consensus. The unbuilt premium category is AI that disagrees with you on purpose; today's benchmarks measure what AI does alone, not whether the product is building human capacity or consuming it.
Anthropic Reinstates OpenClaw with Metered Agent SDK Credits: Compute Arbitrage Ends, Caching Becomes Pricing Substrate
Anthropic published the metering template every frontier lab will run by year-end. The May 13 restoration locks third-party agentic usage to API rates inside a non-rollover Agent SDK credit ($20 Pro, $100 Max 5x, $200 Max 20x), ending compute arbitrage and naming prompt cache hit rate, in Boris Cherny's words, as the published pricing primitive that separates flat-rate from metered inference. OpenAI and Google face identical inference economics; the lab that meters last bleeds margin.
404 Media: Software Developers Say AI Is Rotting Their Brains
Performance reviews at FAANG and mid-tech now grade AI adoption, with one UX designer naming the dynamic exactly: "the actual quality of output doesn't matter as much as our willingness to participate." The "X percent of code is AI-generated" metric tech executives cite on earnings calls measures HR obedience contaminated by Goodhart at org-design scale, not output throughput. Almost no company is measuring the number that actually matters: production value net of verification cost.