Apple restricts on-device AI to select hardware, subscription

At WWDC 2026, Apple unveiled a Gemini-powered Siri AI and a next generation of Apple Intelligence, with Siri AI slated to arrive in English later this year, per Apple. 9to5Mac, MacStories, and TechRadar report that Apple's most powerful on-device model is limited to iPhone 17 Pro, iPhone 17 Pro Max, and iPhone Air, plus Macs and iPads with M3/M4-class chips and at least 12 GB of RAM. Apple runs foundation models both on-device and through Private Cloud Compute, and at a WWDC media event Craig Federighi said, "We believe privacy in AI is non-negotiable" (The Verge; The Next Web). Server-side image generation carries daily limits that users can raise with an iCloud+ subscription, and Apple's new foundation models were custom-built with Google's Gemini, according to TechCrunch and The Next Web.
What happened
At its WWDC 2026 keynote, Apple announced a rebuilt, Gemini-powered Siri AI and a next generation of Apple Intelligence, with Siri AI listed to arrive in English later this year (Apple). Multiple outlets describe it as Apple's most significant Siri overhaul in years, spanning iOS 27, iPadOS 27, and macOS "Golden Gate" (The Verge; TechRadar).
Hardware and memory gates
9to5Mac, MacStories, and TechRadar report that Apple's most powerful on-device model, and features it enables such as expressive voices and advanced dictation, is restricted to newer, higher-memory hardware: iPhone 17 Pro, iPhone 17 Pro Max, and iPhone Air; iPads with M4 or later; Macs with M3 or later; and Apple Vision Pro, each with at least 12 GB of RAM (9to5Mac; MacStories; TechRadar). Apple frames the memory floor as necessary to preserve latency, energy, and reliability for local model execution (MacStories).
Hybrid architecture and Gemini
Apple runs foundation models both on-device and through Private Cloud Compute, its server-side privacy tier (MacStories; The Verge). At a WWDC media event, Apple software chief Craig Federighi said, "We believe privacy in AI is non-negotiable," and said Apple uses none of the models Google deploys to its own customers (The Verge; The Next Web). Apple's new foundation models were custom-built in collaboration with Google's Gemini, and heavier generative features, including photorealistic image generation in Image Playground, rely on cloud models (TechCrunch; The Next Web).
Subscription and regional limits
Because server-side features such as image generation run on powerful cloud models, they carry daily usage limits that users can raise through most iCloud+ subscription plans (TechCrunch). Coverage also notes regional staggering: Notebookcheck reports Siri AI is initially restricted on iPhone and iPad in the EU while remaining usable on Macs set to English (Notebookcheck).
Why it matters
Apple is a major platform owner, so its hardware gates and delivery model shape developer expectations and purchase decisions. Many Apple Intelligence features still run on older hardware via cloud or Private Cloud Compute, while the highest-tier on-device capabilities are reserved for newer, higher-RAM machines, affecting latency, offline use, and feature availability. Apple's reliance on Gemini for parts of the stack also signals tolerance for third-party cloud dependencies rather than a fully in-house, on-device pipeline for every feature.
What to watch
- •How Apple documents exact hardware and memory checks, and whether developers get APIs to detect on-device versus cloud-backed capability.
- •How iCloud+ usage caps affect teams building features that lean on frequent model-backed generation.
- •Regional rollout timing and any expansion of on-device support to additional devices.
Scoring Rationale
Apple's WWDC 2026 rollout of a Gemini-powered Siri AI and next-generation Apple Intelligence is a major platform-level AI release from a dominant vendor, with concrete, well-sourced hardware and subscription constraints that shape developer and device decisions. It is scored as notable rather than industry-shaking because this event covers the hardware and subscription-gating slice of a broader keynote already represented across multiple feed items.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems