tpu

2 items

Dwarkesh Podcast 2026-05-28-1

Reiner Pope on Chip Design from the Bottom Up: Data Movement Dominates Arithmetic 7-to-1, B300's FP4-FP8 Gap as First Crack in NVIDIA's FLOPS Marketing, Splittable Systolic Arrays as Maddox's Architectural Wedge

NVIDIA's B300 datasheet ships FP4 at 3x FP8 speed where precision-scaling theory says 4x — the first public number that doesn't square with marketed FLOPS as a benchmark. The durable accelerator moat is array geometry plus memory hierarchy, not transistor budget: that's why Maddox, Majestic, Groq, and Cerebras all exist as funded alternatives, each architecture matched to a workload profile the general-purpose chip handles inefficiently. By 2027, enterprise procurement moves from NVIDIA versus not to which architectural bet fits the inference batch size.

Wall Street Journal 2026-05-25-1

Anthropic Q2: $10.9B Revenue, $559M Operating Profit, Compute-to-Revenue 71¢→56¢ — Cost-Structure Asymmetry Bifurcates the AI Bubble Thesis

Anthropic disclosed to investors — and WSJ reviewed the projections — Q2 revenue of $10.9B versus $4.8B in Q1, with $559M operating profit and compute-to-revenue down from 71¢ to 56¢. The 56¢ ratio is the first published frontier-lab data point that materially decouples profitability from Nvidia silicon and Microsoft-circular financing. The bubble call now applies to OpenAI-Microsoft specifically, not the sector — and the reseller-gross accounting, which OpenAI's CRO already disputes, is the post-IPO short-report flashpoint to watch.