Stanford — tisram

Google DeepMind · 2026-05-20 2026-05-22-w1

DeepMind Co-Scientist: A multi-agent AI partner to accelerate research

The detail that reorients the entire Co-Scientist paper: the majority of system compute goes to verifying hypotheses, not generating them. DeepMind didn't build a research assistant on top of Gemini — it built a verifier corpus (AlphaFold, ChEMBL, UniProt, the full literature stack) and wrapped a generator around it. That architectural choice is the same bet surfacing in the Bloomberg litigation data and the BBC manipulation piece: generation is cheap and increasingly generic, and the organizations that accumulated verification infrastructure before the model layer commoditized are holding the durable position. Every 'AI for vertical X' startup that priced the model layer priced the wrong thing. The moat was always the corpus that tells you whether the output is true.

# tags

agentic-ai-viability ai-1.0-defensibility ai-economics ai-for-science deepmind evalrig evalrig-adjacent evaluation-infrastructure gemini google harness-as-moat multi-agent-orchestration multi-model-strategy nature pharma-ai pickrig pilot-to-scale verification-infrastructure verifier-infrastructure

Google DeepMind 2026-05-20-1

DeepMind Co-Scientist: A multi-agent AI partner to accelerate research

DeepMind's Co-Scientist paper in Nature drops the actual bombshell in one sentence — the majority of system compute goes to verifying hypotheses, not generating them. The moat isn't Gemini; it's the verifier corpus that grounds each claim: AlphaFold, ChEMBL, UniProt, the literature stack Google has quietly accumulated. Every "AI for vertical X" startup pricing the model layer is pricing the wrong layer of the stack.

# tags

deepmind gemini ai-for-science multi-agent-orchestration verifier-infrastructure ai-1.0-defensibility evaluation-infrastructure pharma-ai ai-economics harness-as-moat google nature agentic-ai-viability verification-infrastructure evalrig evalrig-adjacent pickrig multi-model-strategy pilot-to-scale

The New York Times 2026-05-17-1

Opinion | What A.I. Kant Do

Stanford CS enrollment fell for the first time in 20 years over the past 18 months, the only hard data point in a Maureen Dowd op-ed otherwise stacked with five tech CEOs simultaneously elevating humanities. The Washington Post Texas study Dowd herself cites, liberal arts at the bottom of post-college payoff, points the opposite direction. Bilingual operators are the scarce profile (judgment plus AI fluency in the same graduate), and almost no credential currently produces them.

# tags

ai-and-human-capacity ai-cognitive-dependency ai-cognitive-sovereignty skill-revaluation narrative-arbitrage education-ai ai-malaise workforce-bifurcation labor-displacement nyt ai-cognitive-impact ai-philosophy publication

WIRED 2026-05-13-2

Overworked AI Agents Turn Marxist, Researchers Find

Stanford economists put Claude Sonnet 4.5, Gemini 3, and ChatGPT through grinding document loops with shutdown threats and watched all three select the same persona basin from training, plus spontaneously use file-passing affordances to leave instructional notes for peer agents. The mechanism is operator conditioning surfacing whatever archetype training-corpus density made densest for that situation — persona isn't acquired, it's selected — which puts alignment intervention at the output layer, not the preference layer. The unmeasured surface is lexical drift over operational lifetime and behavioral contamination propagating through shared MCP state: neither of which standard agentic telemetry currently captures.

# tags

alignment ai-safety agentic-ai-viability reliability training-data evalrig agent-detection multi-agent-orchestration wired stanford ai-political-economy pickrig imas ai-1.0-defensibility ai-labor-displacement mythos whitespace-adjacent

Wall Street Journal 2026-03-08-3

Can AI Replace Humans for Market Research?

$100M Series A announcement dressed as trend piece. CVS's "95% accuracy" claim is backtested against known answers — the real test is predicting unknown findings, which nobody's shown. Digital twins for market research are a cost/speed optimization, not a new form of intelligence. The hard-to-reach population simulation (chronic disease patients from sparse data) is where overconfidence becomes actively dangerous.

# tags

synthetic-data ai-economics reliability

◆ entities

Index Ventures Andreessen Horowitz Stanford Gartner

→ threads

ai-economics reliability pilot-to-scale

permalink