Nvidia Unveils RTX Spark AI Superchip for PCs

Nvidia unveiled the RTX Spark PC "superchip" at COMPUTEX, a system-on-chip designed to run local AI agents with 1 petaflop of AI compute and up to 128GB of unified memory, according to Nvidia's blog and Engadget. The chip was developed with Taiwan's MediaTek and in collaboration with Microsoft, per Reuters and Nvidia. Nvidia said OEMs including Dell, HP, Lenovo, ASUS, Microsoft Surface and MSI will ship RTX Spark systems this fall, Reuters reports. Nvidia also introduced the Vera CPU, which Reuters reports is already being tested by OpenAI, Anthropic and SpaceX. Market reaction included sharp falls in shares of rival chip makers, Reuters and Newser report. Jensen Huang said RTX Spark aims to enable PCs to run persistent agent workflows on-device, per Engadget.
What happened
Nvidia unveiled RTX Spark, a new Arm-based system-on-chip for Windows laptops and compact desktops, at COMPUTEX 2026, according to Nvidia's blog and reporting by Engadget. Nvidia and Engadget report the company claims 1 petaflop of AI compute and a unified memory pool configurable up to 128GB, with power use scalable up to 80W. Reuters reports the chip was developed with Taiwan's MediaTek and in collaboration with Microsoft, and that OEMs including Dell, HP, Lenovo, ASUS, Microsoft Surface and MSI will ship RTX Spark systems this fall. Reuters also reports Nvidia introduced a new CPU called Vera, which it says is being trialed by OpenAI, Anthropic and SpaceX.
Technical details
Nvidia's product blog and Engadget describe RTX Spark as combining an Arm CPU cluster with a Blackwell-derived GPU and tensor cores, and an NPU sufficient for Microsoft Copilot+ (a 40 TOPS target referenced by Nvidia). Engadget and Tom's Hardware report RTX Spark includes 6,144 Blackwell RTX cores and up to 128GB of unified memory, and is positioned as similar in performance to an RTX 5070 laptop GPU at lower power. Nvidia's blog highlights an ecosystem stack - including llama.cpp optimizations in NVIDIA OpenShell and Windows security primitives - intended to run local agents and developer toolkits on-device.
Industry context
Editorial analysis: The announcement fits a broader industry push to move inference and "agentic" workflows from cloud to edge. Companies deploying local inference typically prioritize memory bandwidth, unified memory architectures, and power-efficiency to host larger models or ensembles on-device. Observers quoted by Reuters, including Neil Shah of Counterpoint Research, framed the launch as a potential inflection for personal computing, while market moves reported by Reuters and Newser showed immediate investor sensitivity around incumbents in PC and mobile processors.
For practitioners
Editorial analysis: Developers and ML engineers should watch three technical tradeoffs driving adoption of on-device agents: model size versus quantization and operator support; unified memory and zero-copy data paths for multi-modal workloads; and toolchain compatibility with existing Windows x86 and Arm application stacks. The Nvidia blog emphasizes NVIDIA OpenShell and integration with Microsoft security primitives, which will matter for reproducing cloud LLM behaviors locally while meeting platform security constraints.
What to watch
For practitioners: monitor real-world OEM system configurations and thermal/power limits once review units ship this fall, availability of model runtimes optimized for RTX Spark (including llama.cpp and upstream frameworks), and concrete benchmarks for agent-style multi-step tasks. Also track Vera's ecosystem adoption and any independent benchmarks from reviewers that measure inference throughput, memory pressure for multi-model agents, and latency for conversational/agent workloads.
Quoted material
Jensen Huang said onstage at COMPUTEX, "Today, when you think about your phone, the one thing you don't do with it is make phone calls. You do just about everything else," as reported by Engadget. Reuters quoted Counterpoint Research co-founder Neil Shah saying the RTX Spark "looks to transform the traditional app-centric PC to a real useful Agentic AI personal computer."
Scoring Rationale
This is a major hardware announcement targeting inference and agentic workflows on consumer PCs, which directly affects deployment choices, tooling, and performance tradeoffs for practitioners. The story is industry-shaking but not paradigm-altering on its own.
Practice with real Health & Insurance data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Health & Insurance problems


