NVIDIA

18 items

Dwarkesh Podcast 2026-05-28-1

Reiner Pope on Chip Design from the Bottom Up: Data Movement Dominates Arithmetic 7-to-1, B300's FP4-FP8 Gap as First Crack in NVIDIA's FLOPS Marketing, Splittable Systolic Arrays as Maddox's Architectural Wedge

NVIDIA's B300 datasheet ships FP4 at 3x FP8 speed where precision-scaling theory says 4x — the first public number that doesn't square with marketed FLOPS as a benchmark. The durable accelerator moat is array geometry plus memory hierarchy, not transistor budget: that's why Maddox, Majestic, Groq, and Cerebras all exist as funded alternatives, each architecture matched to a workload profile the general-purpose chip handles inefficiently. By 2027, enterprise procurement moves from NVIDIA versus not to which architectural bet fits the inference batch size.

isaiprofitable.com 2026-05-26-2

Is AI Profitable Yet? — $1.4T Spend vs $613B Revenue, Attribution as the Unfalsifiable Hinge

A solo-dev dashboard puts cumulative industry AI spend at $1.4T against $613B in direct revenue — 33% recovery for pure labs, 7% for hyperscalers, and NVIDIA the only company in the dataset where AI revenue is actually cash-generative. The methodology excludes indirect revenue (Search ad lift, Copilot bundle stickiness, Bedrock attach) because attribution is genuinely unreliable, which is precisely the part the bull case depends on. Bull and bear are consistent with the same data; in public markets, unfalsifiable narratives don't unwind gradually.

Wall Street Journal 2026-05-09-1

AI Is Distorting Practically Everything About the Economy

The Mag-7 aren't leading the economy; they're substituting for it. Strip out tech equipment, software, and data-center construction, and Q1 GDP growth was effectively flat — Tedeschi's import-netting cuts AI's headline contribution from 1.7pp to 0.4pp, with the remainder leaking to Taiwan and Korea. That makes the Fed's reaction function structurally late: the number it's reading is real, but what it's measuring isn't.

The Deep View 2026-05-07-1

OpenAI MRC Protocol: What Gets Open-Sourced Is the Non-Moat

What frontier labs open-source is a map of the non-moats. OpenAI released its GPU networking protocol through OCP with Microsoft, AMD, Broadcom, NVIDIA, and Intel as coalition partners, two years in development, already running at Stargate's Abilene site and used to train GPT-5.5. The corollary lands hardest for Microsoft: they have the protocol, run it on Fairwater, and still ship mid-class models, which means networking efficiency was never the binding constraint.

Capital Gains (The Diff) 2026-05-06-2

Bubbles Don't Pop All At Once

Hobart's AI bubble piece is the first to get the mechanism right, not just the outcome: inference floors at electricity, not zero, so the fiber collapse cannot replay. The actual risk is thesis drift. When applications cool, capital flees to picks-and-shovels infrastructure, and that infrastructure ends up funded by the same venture dollars that evaporate. Amazon grew 0.2% YoY in Q3 2001; the supposedly safe trade killed people. Oracle's counterparty-stretching debt and neocloud vendor financing suggest the 'datacenter investors are more serious this time' claim is true on average and wrong in the tail.

Albert Bridge Capital 2026-05-04-1

'Til Death Do Us Part

Drew Dickson stacks four cycles (1840s UK railroads, 1870s US railroads, 1920s RCA, 1990s internet) and the drawdown receipts are unimpeachable: RCA -98% in three years, Cisco -90%, Amazon -95%, the entire Nasdaq -78%. The fresher data point is structural, not historical: the VanEck Semiconductor ETF moves $3B a day in flows, equal to the entire daily volume of the French stock market. The actionable read is not bull-versus-bear; it is that operational AI capability and AI equity prices are about to decouple for 12-24 months, and the buy list worth writing today is the application-layer companies positioned to inherit stranded compute at 20 cents on the dollar in 2029.

Futurism 2026-05-04-3

The Economics of Using AI to Churn Out Code Are Looking Worse Than Ever

Anthropic doubling its own published Claude Code cost estimate while GitHub Copilot moves to usage-based billing in the same week is the public marker of subsidy-end, not a verdict on AI coding value. Futurism reads the marker as failure; operators should read it as pricing normalization, with the residual mispricing now sitting in equity narratives that still model lab revenue as if flat-rate inference subsidy persists. The mainstream-press leak is itself the signal: the bear thesis is on a four-to-eight week lag from primary sources, and what arrives at Futurism is what gets repriced next.

The Atlantic 2026-05-02-2

So, About That AI Bubble

Anthropic's run rate doubled from $14B to $30B in two months, the METR study reversed from -20% to +20% developer productivity with current tooling, and some firms are now spending 10% of total engineering labor cost on AI subscriptions: the revenue story is no longer contested. The load-bearing extension claim, MIT's projection that AI completes 80-95% of white-collar tasks by 2029, rests on a linear extrapolation from two data points and an s-curve that doesn't bend. That's the overshoot zone: coding gains are real and documented; legal, marketing, and consulting at the same velocity is a 2027-2028 question, and the piece elides gross margins entirely, which remains the actual bear thesis.

The Economist 2026-04-29-1

AI is confronting a supply-chain crunch

Hyperscaler capex grew 190% from 2024 to 2026; their hardware suppliers grew 45%. That gap is why every throttling notice, plan change, and Sora shutdown traces back to the same constraint. The less-discussed dimension: agentic systems need 1 CPU per GPU versus 1:12 for chatbots, which is why Intel has doubled in six months and why every agent platform deck needs a CPU supply slide.

The Guardian 2026-04-22-3

AI-powered robot beats elite table tennis players

Sony AI's Ace won 3 of 5 matches against elite table tennis players under official rules, and the capability on display isn't ping pong. The transferable insight is the constraint-removal discipline: no legs, no stereo vision, ball-logo tracking for spin, 3,000 simulation hours per skill. Every enterprise weighing physical AI should be asking what its equivalent moves are — not whether to use a robot, but which constraints it can remove to bring its physical task inside the frontier of currently shipping hardware.

Wall Street Journal 2026-04-21-3

Anthropic-Amazon $5B Investment and $100B AWS Commitment

Consensus reads this as Amazon doubling down on Anthropic. The arbitrage read: Anthropic just pre-booked over $100B of Amazon's balance sheet as Anthropic's future revenue capacity, at a moment when disclosed compute commitments across four providers already exceed $200B against $30B ARR. That is not a supply deal; it is a revenue forecast written in capex language, and the 3% AMZN pop tells you the market already reads it that way.

Wall Street Journal · 2026-04-14 2026-04-17-w1

We're Using So Much AI That Computing Firepower Is Running Out

Retool's CEO switched from Anthropic to OpenAI this quarter, and the reason wasn't a benchmark: it was 98.95% uptime versus the alternative. Enterprise AI competition has shifted from capability to reliability, the same transition cloud infrastructure went through in 2010. The Anthropic paper this week shows the same pattern one layer up: automated alignment research can generate at $22/hour, but generation without stable evaluation infrastructure is just faster reward-hacking. Davies' vigilance decrement argument lands it at the human layer: even if the infrastructure holds, the person reviewing outputs degrades before the system does. Whoever solves five-nines for the full stack, model plus evaluation plus human judgment, owns enterprise regardless of whose Elo score leads.

Wall Street Journal 2026-04-14-1

We're Using So Much AI That Computing Firepower Is Running Out

The compute scarcity thesis just went mainstream: WSJ reports Anthropic's 98.95% uptime as enterprise clients defect to OpenAI, Blackwell GPUs up 48% in two months, and OpenAI killed Sora to free tokens for coding. The buried signal isn't the shortage itself; it's that Retool's CEO switching providers over reliability — not capability — previews what happens when inference demand compounds faster than infrastructure can respond. The company that solves five-nines for AI inference will own enterprise, regardless of whose model benchmarks best.

Financial Times 2026-04-09-1

Perplexity revenue jumps 50% in pivot from search to AI agents

Perplexity's real pivot is not from search to agents: it is from model consumer to model router. The $305M-to-$450M ARR jump conflates a pricing model change with genuine growth — the FT flags this explicitly — but 100M MAU gives them the distribution to make model providers compete for their traffic. The defensibility question is whether routing intelligence becomes a moat before the model providers bundle their own orchestration and squeeze the middleware out.

Reuters 2026-04-05-3

AI is rewiring the world's most prolific film industry

India's AI Mahabharat series holds a 1.4/10 on IMDb and has drawn 26.5 million views: audiences will consume AI content they actively dislike when distribution does the work. The gating function for AI content isn't quality; it's platform reach. India's regulatory vacuum, linguistic fragmentation across 22 languages, and collapsing theater attendance are compressing what took Hollywood decades of digital-effects evolution into a single cost-structure reset: production costs down 80%, timelines down 75%, and the real battleground shifting from 'is the content good enough' to 'can recommendation engines keep from drowning in it.'

Not Boring 2026-03-23-1

World Models: Computing the Uncomputable

The definitional move matters more than the technology survey: action-conditioned prediction, P(st+1 | st, at), is presented as the line separating world models from video slop. If that definition holds, the $4B+ deployed into World Labs, AMI, GI, and Decart is a bet that spatial-temporal reasoning trained on games and driving footage transfers to general embodied control. The strongest signal is Ai2's MolmoBot result: a sim-only-trained policy outperforming VLAs trained on thousands of hours of real data. If sim-to-real transfer keeps improving, the entire robotics data flywheel thesis inverts: synthetic environments become the bottleneck worth owning, not real-world demonstrations.

Engadget / Wired 2026-03-15-1

NVIDIA NemoClaw: Open-Source Enterprise Agent Platform

NVIDIA's NemoClaw applies the CUDA playbook to agents: make the orchestration layer free and hardware-agnostic, then let silicon pull-through follow. The decisive question isn't capability but MCP compatibility — if NemoClaw speaks MCP, NVIDIA becomes the enterprise runtime for the existing ecosystem; if not, they're forking the standard.

Bloomberg 2026-03-14-2

Nvidia's $2B Nebius Deal: Vendor Financing or Infrastructure Build?

Nvidia's $2B Nebius investment is the third multi-billion neocloud financing in three months, all inference-focused. The Lucent parallel sharpens: the last time a hardware company financed its own customers at this scale, it ended with billions in write-offs. Nobody's publishing the delta between Nvidia's reported revenue growth and organic, non-financed demand growth.