Microsoft unveils Surface RTX Spark Dev Box for local AI

Microsoft has unveiled the Surface RTX Spark Dev Box, a compact desktop aimed at local-first AI development, according to Microsoft's devices blog and coverage from Tom's Hardware, VentureBeat, and TechSpot. The machine is built on Nvidia's RTX Spark superchip, which pairs a Blackwell-architecture GPU with an Arm-based Grace CPU to deliver up to one petaflop of AI compute and 128GB of unified memory. Microsoft and reviewers say that is enough to load and run models exceeding 120 billion parameters locally, with large context windows, without sending data to the cloud. An anodized aluminum chassis doubles as the cooling system within a 100-watt thermal envelope, and the box ships with developer tools such as Visual Studio Code, WSL, and PowerShell 7 preinstalled. Microsoft says it will be available in the United States later this year; pricing has not been announced, though analysts estimate roughly $3,000 to $3,500.
What happened
Microsoft has unveiled the Surface RTX Spark Dev Box, a compact desktop aimed at local-first AI development, according to Microsoft's devices blog and coverage from Tom's Hardware, VentureBeat, and TechSpot. The system is built on Nvidia's RTX Spark superchip, which combines a Blackwell-architecture GPU with an Arm-based Grace CPU.
Technical details
Microsoft and reviewers report up to one petaflop of AI compute and 128GB of unified memory, enough to load and run models exceeding 120 billion parameters locally with large context windows, without sending data to the cloud. An anodized aluminum chassis doubles as the cooling system within a 100-watt thermal envelope, larger than the 45-to-80-watt envelopes of RTX Spark laptops. The box ships with developer tooling such as Visual Studio Code, WSL, and PowerShell 7 preinstalled. Microsoft says it will be available in the United States later this year; pricing is unannounced, with analyst estimates around $3,000 to $3,500.
Editorial analysis (generic industry)
Hardware vendors are increasingly targeting developer workflows for on-device AI, pairing high unified memory with sustained thermal designs to extend local inference and lightweight fine-tuning. For teams concerned about latency, data sovereignty, or cloud cost during iterative development, machines like this make local-first experimentation more practical, though real value depends on pricing relative to cloud instances and on the maturity of Arm-native developer tooling and model runtimes.
Scoring Rationale
A compact developer machine that can run 120B-plus parameter models locally meaningfully lowers the friction and cost of on-device experimentation, which matters to ML practitioners and developer teams. It is a notable hardware product rather than a new model or paradigm, and final impact will depend on pricing and software-stack maturity.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems

