Case Studymulti armed banditsthompson samplingonline experiments

DoorDash Adopts Multi‑Armed Bandits For Experimentation

infoq.com

|January 25, 2026

8.1

Relevance Score

DoorDash Adopts Multi‑Armed Bandits For Experimentation

DoorDash engineers Caixia Huang and Alex Weinstein adopt a multi-armed bandits (MAB) approach to optimize product experiments, using Thompson sampling to adaptively allocate traffic and reduce opportunity cost. They report MAB accelerates learning and lowers regret compared with fixed-split A/B tests but complicates metric inference and can create inconsistent user experiences. DoorDash plans contextual bandits, Bayesian optimization, and sticky user assignment to mitigate limitations.

DoorDash Adopts Multi‑Armed Bandits For Experimentation

More AI & Data Science News

Korea Emerges As Google Gemini's Second-Largest Market

Event Organizer Corrects Stallman Talk Media License

Scoring Rationale

Sources

Google Pixel 8 Drops Below 180 Euro Price

Users Prefer Keywords For Local ChatGPT Searches