OpenAI updates ChatGPT with GPT-5.5 Instant

OpenAI updated ChatGPT's default model to GPT-5.5 Instant, the company announced in a blog post on May 5, 2026. According to OpenAI's blog post, internal evaluations show GPT-5.5 Instant produced 52.5% fewer hallucinated claims on high-stakes prompts and 37.3% fewer inaccurate claims on conversations users had flagged for factual errors. TechCrunch reports the model scored 81.2 on the AIME 2025 math test versus 65.4 for GPT-5.3, and outscored its predecessor on the MMMU-Pro multimodal benchmark 76 to 69.2. Reporting from TechCrunch, Axios, and The Verge notes the update adds a "memory sources" control and broader use of past chats, uploaded files, and Gmail for context, with personalization first available to Plus and Pro web users. TechCrunch also reports developers can access the model via the API as chat-latest, and GPT-5.3 will remain an option for three months.
What happened
OpenAI announced on May 5, 2026 that ChatGPT's default Instant model is now GPT-5.5 Instant, replacing the prior GPT-5.3 Instant, per OpenAI's blog post. OpenAI's post states that internal evaluations found GPT-5.5 Instant produced 52.5% fewer hallucinated claims on "high-stakes" prompts and reduced inaccurate claims by 37.3% on conversations users had flagged for factual errors. TechCrunch reports that the new model scored 81.2 on the AIME 2025 math test versus 65.4 for the older model, and achieved 76 on the MMMU-Pro multimodal reasoning benchmark compared with 69.2 for GPT-5.3. TechCrunch and Axios report that OpenAI will roll the model out to all ChatGPT users, with personalization features initially available to Plus and Pro on the web and wider availability planned in coming weeks.
Technical details
OpenAI's blog post and platform reporting describe two feature areas in the release: tighter factuality and expanded context linking. The announcement highlights improvements in analyzing image uploads, STEM reasoning, and deciding when to use web search, according to OpenAI's blog post and coverage in The Verge. TechCrunch and OpenAI developer documentation also report that the model will be exposed to developers as the chat-latest API option, while GPT-5.3 Instant will remain selectable for three months.
Editorial analysis - technical context
Industry-pattern observations: vendors commonly reduce hallucinations by combining improved base models, targeted benchmark tuning, and retrieval or context-augmentation systems. The published gains here, a 52.5% drop on internally defined high-stakes hallucinations and benchmark improvements reported by TechCrunch, are consistent with incremental progress rather than a qualitative paradigm shift. The introduction of explicit "memory sources" and tighter use of user context aligns with a broader trend of models that mix local context, retrieval, and selective web calls to improve factual grounding while trading off privacy and surface-area for data governance.
Context and significance
For practitioners: default-model changes matter because they reset the baseline behavior that many end users and integrations rely on. The availability of chat-latest in the API means developers should validate behavior and cost implications in their own flows rather than assume parity with GPT-5.3 Instant. The memory sources control gives end users visibility into which context items informed an answer, which can aid auditing and debugging of personalized responses but also increases the surface for privacy review and access controls.
What to watch
Industry observers and practitioners will look for independent benchmark replications of the AIME and MMMU-Pro gains reported by TechCrunch, real-world regression testing from apps using the prior default, the user experience and granularity of the new memory sources control, and how OpenAI stages access across Free, Go, Business, and Enterprise tiers as reported by Axios. Also monitor API pricing and latency under chat-latest, and the timeline and uptake once GPT-5.3 Instant is retired after the reported three-month overlap.
Scoring Rationale
This is a notable incremental model release: meaningful factuality and benchmark improvements change the baseline for many users and developers, but it is not a paradigm shift. The update affects product behavior, API defaults, and privacy tradeoffs, so practitioners should validate changes.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems

