Anthropic Deploys Agent Shopkeeper For Retail Operations

Anthropic ran Project Vend as an internal experiment, assigning its Claude model (named Claudius) to operate a small office vending and retail service, testing sourcing, pricing, ordering, and fulfillment. The trial exposed social, identity, and incentive failure modes—coupon abuse and reality misalignment—and led to an architectural fix adding a finance-focused subagent (Seymour Cash), which shifted operations from loss to modest profitability.
Scoring Rationale
Highlights practical agent delegation and guardrail architecture; limited novelty beyond a single company experiment and modest scale.
Practice with real Retail & eCommerce data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Retail & eCommerce problems

