semiconductor

3 items

Dwarkesh Podcast 2026-05-28-1

Reiner Pope on Chip Design from the Bottom Up: Data Movement Dominates Arithmetic 7-to-1, B300's FP4-FP8 Gap as First Crack in NVIDIA's FLOPS Marketing, Splittable Systolic Arrays as Maddox's Architectural Wedge

NVIDIA's B300 datasheet ships FP4 at 3x FP8 speed where precision-scaling theory says 4x — the first public number that doesn't square with marketed FLOPS as a benchmark. The durable accelerator moat is array geometry plus memory hierarchy, not transistor budget: that's why Maddox, Majestic, Groq, and Cerebras all exist as funded alternatives, each architecture matched to a workload profile the general-purpose chip handles inefficiently. By 2027, enterprise procurement moves from NVIDIA versus not to which architectural bet fits the inference batch size.

Bloomberg 2026-04-25-2

Meta Strikes Multibillion-Dollar Deal to Use Amazon Chips for AI Projects

Meta is renting hundreds of thousands of Graviton chips from AWS for multiple billions; Graviton is a CPU, not an accelerator. The consensus is measuring AI capex by GPU count, but at production scale the CPU layer, which handles feature serving, retrieval, ranking, and orchestration, runs roughly 5-10x the accelerator unit count. This deal is the first explicit public signal that reframes general-purpose CPU compute as a distinct AI infrastructure category, and it means the total AI infrastructure commitment envelope is materially larger than accelerator-only framings capture.

Financial Times 2026-03-28-3

Memory chip stocks shed $100bn as AI-driven shortage trade unwinds

A single Google Research paper on model compression wiped $100 billion from memory chip stocks in five days. Micron dropped 15%; SanDisk, the best S&P 500 performer in 2025, shed $15 billion in market cap. Morgan Stanley's defense was textbook Jevons: efficiency expands demand. But the market just revealed a new risk class: AI efficiency research as a first-order investment catalyst. The next compression paper is already being written; the question is whether you see it before or after the sell-off.