Industry Newsmlperfinferencenvidiablackwell ultra

NVIDIA Dominates MLPerf Inference v6 Benchmarks

|April 1, 2026

8.6

Relevance Score

NVIDIA Dominates MLPerf Inference v6 Benchmarks — Photo: cdn.wccftech.com · rights & takedowns

NVIDIA submitted MLPerf Inference v6.0 results on April 1, 2026, reporting that its Blackwell Ultra platform delivered the highest throughput and lowest token cost across the suite. The company cites up to 2.77x token/sec speedups versus GB200 NVL72 and ninefold training wins over the nearest competitor, driven by hardware and software co-design. These results affect large-scale inference and TCO calculations.

Scoring Rationale

High score due to official MLPerf v6.0 submissions showing industry-wide performance leadership and broad workload coverage; credibility benefits from submitted benchmark results and NVIDIA's transparency. I gave moderate novelty since this is performance improvement rather than a new architecture, and slightly reduced for vendor-authored coverage with limited technical detail.

MoreNVIDIA news