coding-agents — tisram

The New York Times 2026-05-28-3

Anthropic Tops OpenAI to Become the World's Most Valuable A.I. Start-Up

Anthropic raised $65B at a $900B valuation against a $47B run rate, a 19x multiple on a revenue number no auditor has reconciled. The signal sits on the cap table, not in the headline: Samsung, Micron and SK Hynix bought equity in their fastest-growing customer, the same supplier-into-customer loop that drew scrutiny when NVIDIA backed OpenAI, now pushed down to the memory tier. The 2026 IPO sequence will settle the question the funding round skips, whether that run rate is gross or net.

# tags

ai-economics ai-capital-cycle valuation anthropic openai circular-financing ipo-supply-wave memory-chips coding-agents revenue-quality vendor-financing-fragility micron harness-as-moat ai-1.0-defensibility ai-pricing spacex claude-code vibe-coding nyt

Colossus 2026-05-12-3

The Wu Tapes

Cognition reports $445M ARR and Devin usage doubling every 8 weeks, raising at $25B as a third durable application-layer player above the Anthropic/OpenAI model duopoly. Wu calls the model-agnostic harness posture "Switzerland," and the architecture pattern matches what enterprise procurement teams already treat as a lock-in test. Whatever the next 18 months of frontier-model competition produces, the harness layer has started accruing durable enterprise revenue ahead of the model labs.

The Typical Set 2026-05-08-2

The bottleneck was never the code

Brooks 1975: software is the residue of human negotiation. For 50 years, tooling investment kept attention on the residue; agents collapsed the residue cost and exposed the substrate. The bottleneck moves from coders to spec-producers, which is to say management. Every AI productivity claim now needs a denominator that is not engineer-coding speed but spec-to-shipped cycle time. If management bandwidth is the bottleneck, individual agent productivity gains compound at zero, and you have just bought yourself the world's most expensive feature-bloat machine.

# tags

coding-agents agentic-ai-viability harness-as-moat context-management org-design ai-and-human-capacity pilot-to-scale evaluation-infrastructure jevons-paradox vibe-coding ai-strategy agentic-coding-skill evalrig pickrig turanu advisory practitioner-grounding

Fortune 2026-04-25-3

Cursor used a swarm of AI agents powered by OpenAI to build and run a web browser for a week—with no human help

Every AI headline reports the model that did the work. Wrong unit of analysis. GPT-5.2 didn't build a browser; Cursor's planner-worker-judge harness built one using GPT-5.2 as substrate. Value accrues to whoever owns the orchestration layer, not to whoever trained the weights.

# tags

agentic-ai-viability ai-coding-tools multi-agent-orchestration harness ai-1.0-defensibility cursor openai coding-agents gpt-5-4 reliability ai-economics pilot-to-scale evalrig pickrig agent-architecture agent-orchestration capabilities-overhang fortune

Anthropic Blog 2026-04-16-2

Introducing Claude Opus 4.7

Anthropic held headline rates at $5/$25 per million tokens while shipping a tokenizer that inflates inputs by up to 35%, which makes price-per-token comparisons meaningless. The capability jump is real: CursorBench up 12 points, Notion tool errors cut by two-thirds, XBOW vision nearly doubled. The only number that matters now is price-per-useful-output, and that requires workload-specific benchmarking most teams won't run.

# tags

frontier-models coding-agents agentic-ai inference-economics ai-pricing cybersecurity anthropic agentic-ai-viability ai-economics reliability ai-cybersecurity multi-model-strategy

LinkedIn 2026-04-12-2

The AI Discourse Gap: When Pundit Narratives Decouple from Verifiable Architecture

Gary Marcus found a 3,167-line TypeScript file that handles terminal output formatting and declared it proof that the neurosymbolic paradigm has arrived. The actual architecture documented in community analysis is multi-agent orchestration, KAIROS scaffolding, and structured reasoning pipelines: good engineering around a model, which is both true and completely banal. Capital follows narratives before architecture, which is how the SoftBank/OpenAI mega-round closed on a scaling story months after practitioners had already documented diminishing pre-training returns.

# tags

agentic-ai coding-agents scaling ai-discourse information-asymmetry ai-1.0-defensibility agentic-ai-viability claude-code claude-code-leak scaling-laws harness ai-coding-tools deep-learning-foundations competitive-dynamics

◆ entities

Gary Marcus Anthropic Claude Code Sebastian Raschka

→ threads

agentic-ai-viability ai-1.0-defensibility

⟷ links

2026-04-05-1 2026-04-03-2 2026-03-24-1 2026-04-04-2 2026-03-18-1 2026-03-21-2 2026-04-10-1 2026-03-13-w3 2026-04-07-2 2026-04-08-1 2026-04-11-1

permalink

Lenny's Podcast 2026-04-05-1

An AI State of the Union: We've Passed the Inflection Point & Dark Factories Are Coming

Willison's practitioner evidence confirms the November inflection is real: coding agents crossed from "mostly works" to "almost always does what you told it to do," enabling 95% AI-written code for skilled engineers. The buried signal: productivity gains plateau at human cognitive limits, not tool limits. Running four parallel agents produces burnout by 11am, and the trust signals we've relied on for decades (docs, tests, stars) are now generated in minutes, indistinguishable from battle-tested software. The dark factory pattern (nobody writes code AND nobody reads code) is fascinating but premature: N=1 case study, $10K/day QA costs, zero production outcome data.

# tags

agentic-ai coding-agents prompt-injection developer-tools ai-productivity ai-1.0-defensibility ai-economics saas-margins agentic-ai-viability pilot-to-scale

New York Times · 2026-03-22 2026-03-27-w1

Tokenmaxxing: When AI Productivity Becomes Productivity Theater

Token consumption became the week's central metric, and it measures exactly the wrong thing. One OpenAI engineer burned 210 billion tokens in a week; a Figma user ran up $70K in Claude usage through a $20/month account; Anthropic is offering $1,000 of compute inside $200 plans, subsidizing at roughly 5x. The leaderboards tracking this volume are Goodhart's Law applied to inference: the moment consumption becomes the proxy for productivity, consumption is what you get. The $25 economic theory pipeline and the Karpathy Loop running 700 experiments in two days are the same phenomenon from the other side — generation so cheap it exposes that evaluation is the only part of the stack nobody has built. Every SaaS platform offering AI at flat rate is running a margin time bomb; every enterprise treating token volume as a progress signal is one measurement framework away from discovering they've been optimizing for nothing.

# tags

agentic-ai ai-economics ai-pricing coding-agents developer-tools saas-margins

Wall Street Journal 2026-03-22-2

The Trillion Dollar Race to Automate Our Entire Lives

WSJ's narrative arc — coding tools → life automation → trillion-dollar market — buries the only number that matters: Anthropic disclosed Claude Code at $2.5B annualized revenue while subsidizing usage at roughly 5x (offering $1,000 of compute inside $200 plans). Cursor doubling to $2B ARR in three months while both OpenAI and Anthropic burn margin to undercut it is the Uber/Lyft playbook — except the commodity being subsidized is inference, and the exit strategy is enterprise lock-in, not ride density. The sharpest buried signal: Tunguz's estimate of $36B consumer agent revenue vs. "the real money" in enterprise, combined with Codex's 8x traffic growth requiring new data centers, reveals that the AI labs are building a consumer acquisition funnel they can't yet afford to run at scale.

# tags

ai-economics agentic-ai developer-tools ai-pricing coding-agents

New York Times 2026-03-22-3

Tokenmaxxing: When AI Productivity Becomes Productivity Theater

Roose names "tokenmaxxing" — engineers competing on internal leaderboards for token consumption — but buries the only question that matters: nobody measures output quality. One OpenAI engineer burned 210 billion tokens in a week; a single Anthropic user ran up $150K in a month. The leaderboards track input volume, not output value. This is lines-of-code metrics reborn: Goodhart's Law applied to AI inference. The sharper signal is a Figma user consuming $70K in Claude tokens through a $20/month account, revealing that every SaaS platform offering AI at flat rate is running a margin time bomb. The companies that win this cycle won't consume the most tokens; they'll have the best ratio of useful output to tokens spent. That measurement layer doesn't exist yet.

# tags

ai-economics agentic-ai developer-tools ai-pricing coding-agents saas-margins