Google wins early enterprise users for Gemini Flash
Business Insider reports that Vercel CEO Guillermo Rauch told the outlet he is seeing rising demand for Google's Gemini models among Vercel customers, with the Gemini 3 Flash model overtaking Anthropic on Vercel's AI Gateway in early April after Anthropic led in March. Business Insider says Rauch told the reporter he called a top Google executive to request additional Gemini tokens, the platform's unit of usage. The shift appears ahead of Google I/O next week; Business Insider wrote the company is likely to unveil additional AI models and features at the event. The story is based on reporting from Business Insider and on comments attributed to Vercel's CEO.
What happened
Business Insider reports that Vercel CEO Guillermo Rauch told the outlet he is seeing markedly higher demand for Google models among Vercel customers. Per Business Insider, Vercel's internal traffic metrics on its AI Gateway show Anthropic leading in March, then Gemini 3 Flash jumping into the lead in early April and remaining the top model by token volume. Business Insider also reports Rauch said he contacted a senior Google executive to request more Gemini tokens, the platform's unit of AI usage. Business Insider notes this shift occurred ahead of Google I/O, scheduled for next week, and wrote the company is likely to announce additional AI models and tooling at the event.
Technical details
Editorial analysis - technical context: Companies that centralize model selection through gateways typically evaluate offerings on latency, cost-per-token, and available feature sets such as multimodal input or streaming. Models labeled Flash are commonly engineered to trade some model size for lower latency and lower serving cost, which can increase token throughput for embed-and-infer workloads. High token demand reported by an integrator like Vercel plausibly reflects a mix of developer experimentation and production inference load, but Business Insider's report is the only source for the observed token ranking.
Context and significance
Industry context: Public coverage in recent months has focused on Anthropic and OpenAI, yet Business Insider's reporting highlights that Google is gaining visible adoption among developer platforms. For practitioners, a move in token share at a gateway provider matters because it can presage changes in benchmarked latency, pricing pressure, and third-party integrations. This single-source report does not disclose enterprise contracts, SLA terms, or pricing changes; those remain unreported by Business Insider.
What to watch
For practitioners and platform engineers: monitor:
- •token-volume trends from multi-model gateways and API partners
- •latency and cost-per-token benchmarks for Gemini 3 Flash versus Anthropic deployments
- •Google I/O announcements next week for new model SKUs, pricing, or developer SDKs. Business Insider is the sole source for the specific Vercel metrics cited; readers should seek confirmation from additional telemetry or public releases before treating the ranking as definitive
Scoring Rationale
The report is notable because it signals model-share movement at a developer platform, which matters for deployment choices. It is sourced to a single Business Insider story and an executive's comments, so the finding is significant but not yet independently corroborated.
Practice with real Ad Tech data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Ad Tech problems


