Nvidia

18 items

The Verge 2026-06-02-1

Microsoft to unveil new AI models and Windows improvements at Build

Build 2026 is a developer-trust-repair operation with a second plot running underneath it. Microsoft is assembling the full OpenAI-independence stack: its first reasoning model trained without distillation, its own image models, a new agent, and a hard push toward local inference on Windows silicon. The "no distillation" detail is the tell — Microsoft wants to prove it can train reasoning without learning from another model's outputs.

⟷ links
art_20260403_microsoft-mid-class-model-admission-compart_20260529_engadget-microsoft-s-buttoned-up-copilot2026-03-27-32026-04-07-2
The Verge 2026-06-02-3

Microsoft and OpenAI broke up — now they're ready to fight

At Build 2026, Suleyman did the rarest thing an AI exec can do: ranked his own company outside the top tier. The humility is the strategy, not a weakness. Microsoft is shipping from-scratch models, custom silicon, and a vendor-neutral Windows-native harness while explicitly competing on cost, distribution, and 11,000-model optionality rather than capability. The frontier-lab leaderboard the press scores is the wrong scoreboard; whoever owns enterprise distribution, governance, and the cheapest good-enough model captures the value, and Microsoft is deliberately choosing to fight there.

WIRED 2026-05-27-3

AI Agents Plunged the Tech World Into Chaos. Here's Exactly How That Happened

OpenClaw plus NemoClaw is Linux Foundation plus Red Hat compressed from decades to months: 366K GitHub stars in under six months, Jensen Huang allocating 10 minutes of GTC 2026 to it, Nvidia shipping a 'more secure' enterprise variant before the upstream OSS turned one year old, and OpenAI capturing the founder talent that Anthropic answered with legal notices. The new agent-strategy question for every enterprise is now binary: upstream OSS, enterprise hardener, or neither, with 'neither' the dead zone. WIRED's 4,000-word canonization names the verification gap in a single closing sentence, which is the signal: verification, governance, and FinOps are the 12-24 month accumulation window the celebration forgot.

Axios 2026-05-21-2

Two hours that changed AI

Anthropic's first profitable quarter is the wrong headline. The $559M of operating profit will fund $1.25B per month of compute commitments to Elon Musk's SpaceX through 2029 — roughly $15B per year flowing to a single counterparty who also runs xAI. Lab IPO valuations need a compute-supplier-concentration discount that nobody is modeling, and Axios packaging six scheduled disclosures as "two hours that changed AI" is itself the late-cycle consensus marker.

Financial Times 2026-05-20-2

Klement: The Impossible Maths of the AI Boom

Klement's FT op-ed makes the cleanest bear case to date: hyperscaler capex grows 20 percent annually through 2030 against 15 percent revenue growth, and under a zero-cost assumption the implied ROI is highly negative for every hyperscaler except Amazon. Clearing a 10 percent return requires 2 to 5 trillion in additional annual revenue against a current 1.5 trillion base. The methodology is opaque and the Amazon exception goes unexplained, but the piece's real signal is positional: when the bear case migrates from Substack to FT op-ed pages, with Chancellor, Constan, WSJ Heard on the Street, and Munster all aligned within five weeks, the consensus has moved. The contrarian trade is now bull on capex sustainability, contingent on smooth IPO absorption and one quarter of hyperscaler AI revenue acceleration outpacing capex growth.

⟷ links
art_20260520_klement-impossible-maths-ai-boom-ftart_20260514_andy-constan-on-investing-through-bubbleart_20260514_edward-chancellor-on-ai-capital-cycle-caart_20260430_clock-ticking-big-tech-ai-payart_20260519_munster-clinton-excess-returns-ai-19952026-03-08-12026-04-14-22026-03-27-22026-03-26-32026-04-17-w32026-04-05-12026-03-27-w22026-04-08-12026-04-10-12026-04-17-32026-04-25-32026-04-30-12026-05-01-22026-05-11-32026-05-13-2
NBC News · 2026-05-14 2026-05-15-w2

OpenEvidence: Most physicians quietly use this medical AI tool

OpenAI launched ChatGPT for Clinicians in April without licensing NEJM or JAMA. OpenEvidence has both, and the market repriced it from $1B to $12B in 15 months on the back of 65% US physician reach and 27 million April clinical encounters. The binding constraint for entering credentialed verticals was never model quality; it was licensed-data governance and the operational-regime approval that comes with it. The Deployment Company and the LF Networking pattern this week are structurally identical: the moat that holds isn't capability, it's the layer of credential, distribution, or implementation sitting above it. For frontier labs, that means the verticals with the clearest content-licensing moats (clinical, legal, financial) will reprice fastest against whoever shows up without the corpus.

NBC News 2026-05-14-2

OpenEvidence: Most physicians quietly use this medical AI tool

OpenAI launched ChatGPT for Clinicians in April without licensing NEJM or JAMA. OpenEvidence has both, hit 65% of US physicians across 27 million April clinical encounters, and got repriced from $1B to $12B in 15 months. The binding constraint for frontier labs entering credentialed verticals is content licensing, not model capability, and OpenAI just supplied the revealed-preference proof.

Wall Street Journal 2026-04-29-2

AI Worries Have Returned to Wall Street. Now Come Earnings.

April 28 was the first day the AI trade split in two: Oracle, CoreWeave, and SoftBank fell 4-9% on OpenAI's missed revenue and user targets while Adobe, Salesforce, and ServiceNow rose. Same news, opposite direction; the market stopped pricing OpenAI counterparties as cloud infrastructure stocks. They are receivables now, and the multiple compresses until non-OpenAI revenue concentration is demonstrated.

Bloomberg 2026-04-25-2

Meta Strikes Multibillion-Dollar Deal to Use Amazon Chips for AI Projects

Meta is renting hundreds of thousands of Graviton chips from AWS for multiple billions; Graviton is a CPU, not an accelerator. The consensus is measuring AI capex by GPU count, but at production scale the CPU layer, which handles feature serving, retrieval, ranking, and orchestration, runs roughly 5-10x the accelerator unit count. This deal is the first explicit public signal that reframes general-purpose CPU compute as a distinct AI infrastructure category, and it means the total AI infrastructure commitment envelope is materially larger than accelerator-only framings capture.

FT Alphaville 2026-03-25-3

Charting the OpenAI 'ecosystem'

Morgan Stanley's forensic accounting team maps the OpenAI commitment web: $30B from Nvidia, $300B to Oracle, $100B from AMD with warrants, $250B to Azure. The accounting team's own conclusion: disclosures can't keep pace with transaction sophistication. Oracle didn't disclose that a single OpenAI contract drove most of its $318B RPO growth. The investable question isn't whether AI infrastructure is a bubble; it's whether the accounting can even tell you. AMD's 160M warrants to OpenAI mean headline deal values include equity sweeteners that distort real compute pricing. Every contract number needs decomposing into cash-equivalent compute plus warrant component. If the people whose job is to evaluate this can't fully map the risk, enterprise buyers making multi-year compute commitments are flying blind.

CNBC 2026-03-24-2

Nvidia's Huang pitches AI tokens on top of salary as agents reshape how humans work

Jensen Huang isn't selling GPUs at GTC: he's selling the accounting category that makes buying them non-discretionary. Tokens-as-compensation reclassifies compute from IT discretionary to people cost; if that framing sticks, AI budgets become as unkillable as headcount. The buried lede is the 80-85% AI project failure rate since 2018 sitting in paragraph 25 while Huang envisions "hundreds of thousands of digital employees" in paragraph 7. That gap between aspiration and execution is the real signal: the demand narrative for compute is bulletproof, but agent reliability at scale remains the unpriced risk.

The Economist 2026-03-21-3

Nvidia's Full-Stack Reinvention: The $65B Portfolio Isn't a Moat, It's a Dependency Map

The Economist's GTC week profile frames Nvidia's expansion into networking, CPUs, models, and sovereign AI as a strategic reinvention; the article never asks the margin question. Nvidia's $216B revenue at ~73% gross margin is a GPU monopoly number: networking, CPU-only servers, and government bundles don't carry that margin. The $65B investment portfolio ($30B in OpenAI alone) is presented as ecosystem lock-in, but OpenAI already runs inference on Azure custom silicon. The portfolio isn't a moat; it's a subsidy that masks true cost-of-compute and unwinds the moment inference gets cheap enough on non-Nvidia hardware. The buried structural risk: three hyperscalers account for over half of receivables, and those same three are the ones building the substitutes.

CNBC 2026-03-17-1

Nvidia GTC Preview: Why the CPU is Taking Center Stage

Agentic AI creates genuine CPU demand expansion: orchestration is sequential, CPU-bound work that GPUs can't do. Nvidia's "standalone CPU" story is really a coprocessor story, though; Grace and Vera are optimized to feed GPUs, not compete for general-purpose workloads at 6.2% share and 72 cores vs. 128. The higher-signal play is NVLink licensing, where Nvidia captures networking value regardless of whose CPU fills the socket.

Wall Street Journal 2026-03-17-2

Can Nvidia's Dominance Survive the Sea Change Under Way in AI Computing?

Nvidia's 73% GPU margins are structurally incompatible with an efficiency-first inference economy, but the displacement story isn't "Cerebras replaces Nvidia." Inference is heterogeneous, and Nvidia is racing to sell all three form factors: GPU for training, CPU for orchestration, LPU for inference throughput. The transition from monopolist-margin chipmaker to platform-margin integrator is the real architectural bet at GTC this year.

New York Times 2026-03-17-3

Nvidia Built the A.I. Era. Now It Has to Defend It.

Nvidia is the first major chipmaker to unbundle training from inference at the architecture level, pairing its GPUs with Groq's inference-optimized LPUs in a $20B licensing deal. The supply chain math is as interesting as the product: Groq on Samsung fab with no HBM dependency sidesteps both TSMC allocation constraints and memory chip shortages. If inference grows to 70-80% of total AI compute spend, the companies building chip-agnostic inference routing will capture a new middleware layer that doesn't exist yet.

Meta 2026-03-14-1

Meta and AMD Partner for 6GW AI Infrastructure Agreement

The "6GW" ceiling is a negotiating lever, not an engineering plan: classic dual-sourcing to pressure Nvidia on price and allocation. Zuckerberg's precise language ("efficient inference compute") tells you AMD wins the commodity inference layer while Nvidia retains training. Two weeks later, Nvidia paid $150M to keep AMD GPUs out of the Stargate expansion; the training/inference hardware split is hardening into separate supply chains.

WIRED 2026-03-14-3

Nvidia Will Spend $26B to Build Open-Weight AI Models

Complement strategy disguised as frontier ambition: $26B in open-weight models optimized for Nvidia silicon, given away free to ensure the ecosystem stays on their hardware. The defensive trigger is visible; Chinese open models (DeepSeek, Qwen) are becoming the global default, and Meta's retreat from fully open Llama creates the US vacuum Nvidia is filling.

Bloomberg 2026-03-10-1

Oracle and OpenAI End Plans to Expand Flagship Stargate Data Center

Nvidia paid $150M to a DC developer to ensure its GPUs — not AMD's — fill the expansion, making it an infrastructure intermediary, not just a chip vendor. The deeper signal: OpenAI's "often-changing demand forecasting" suggests even the largest training compute buyer is uncertain about forward requirements, cracking the infinite-linear-scaling thesis. Cooling failures taking buildings offline in winter are the first concrete evidence of operational fragility at hyperscale AI density.