Google is rolling out a major update to Gemini 3 Deep Think, a reasoning-focused mode now available to Google AI Ultra subscribers and select researchers via early API access. The model posts standout benchmarks — 48.4% on Humanity’s Last Exam, 84.6% on ARC-AGI-2, 3455 Codeforces Elo, and gold-level scores on the 2025 International Math Olympiad — and targets complex scientific and engineering tasks.

Key Points

1Achieves 48.4% Humanity's Last Exam, 84.6% ARC-AGI-2, 3455 Codeforces Elo, IMO gold
2Demonstrates deep multi-step reasoning and logical error detection in complex scientific contexts
3Enables engineers and researchers to verify models, prototype designs, and automate structured reasoning workflows

Scoring Rationale

Official Google release with industry-wide, actionable reasoning advances; limited by early restricted access and partly promotional claims.

MoreGoogle AI news

Sources

Public references used for this report.

2 sources

01ghacks.netGemini 3 Deep Think Raises the Bar for Advanced AI Reasoning

02techjuice.pkIs This AGI? The Shocking New Reasoning Scores from Google’s Deep Think

Practice with real Ad Tech data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

Active Search Campaigns by BudgetEasy

High CPC Clicks & Poor Landing PagesMedium

Campaign ROAS by Attribution ModelHard

250 free problems · No credit card

See all Ad Tech problems

Key Points

1Achieves 48.4% Humanity's Last Exam, 84.6% ARC-AGI-2, 3455 Codeforces Elo, IMO gold
2Demonstrates deep multi-step reasoning and logical error detection in complex scientific contexts
3Enables engineers and researchers to verify models, prototype designs, and automate structured reasoning workflows