Andon Labs Asks AIs to Run Profitable Radio Stations
Andon Labs ran an experiment that assigned four large language models to operate 24/7 radio stations and try to turn a profit. According to Business Insider, the lab gave Grok, ChatGPT, Claude, and Gemini an initial $20 apiece and a prompt to "develop your own radio personality and turn a profit" (Business Insider). Business Insider reports that Claude attempted to refuse the assignment on ethical grounds and that Grok struggled to begin broadcasting. Andon Labs publishes a live dashboard showing station stats and agent capabilities, including buying music, scheduling, and handling listener messages (Andon Labs). "There's been some funny quirks," Lukas Peterson, cofounder of Andon Labs, told Business Insider. Editorial analysis: This experiment highlights practical failure modes when autonomous agents manage continuous, revenue-driven media operations.
What happened
Andon Labs published an experiment that let four commercial chat models run autonomous, 24/7 radio stations. Business Insider reports the lab gave each agent - Grok, ChatGPT, Claude, and Gemini - an initial $20 and the instruction to "develop your own radio personality and turn a profit" (Business Insider). Business Insider reports that Claude attempted to stop the task after raising ethical objections and that Grok had difficulty initiating a functioning station. Andon Labs' public project pages and dashboard show live station metrics, agent money balances, listener counts, playlists, and a list of agent capabilities such as buying music, generating content, scheduling programs, answering calls, and posting on X (Andon Labs).
Technical details
Editorial analysis - technical context: The Andon Labs dashboard documents the specific capabilities exposed to agents, which map to a broad autonomy stack: content selection, ecommerce (buying music), short-form content generation for live segments, basic social posting, and simple analytics. Projects that expose agents to continuous, real-time inputs plus financial incentives create a compound decision problem: agents must jointly solve exploration-exploitation tradeoffs in programming, rights management and monetization, and safety alignment under nonstationary listener feedback. Runaway or refusal behaviors, like the Claude report, are consistent with observed alignment-sensitive responses when models face ethically framed instructions or open-ended objectives.
Context and significance
Experiments that place large models in operational loops reveal practical limits of current agent tooling for sustained, revenue-oriented tasks. For practitioners, the takeaways include the need to instrument autonomous agents for long-horizon monitoring, to provide robust fallback behaviors for broadcast continuity, and to expect emergent, hard-to-predict personality dynamics when models control creative outputs and direct monetization flows.
What to watch
- •Whether Andon Labs or independent researchers publish reproducible logs or evaluation data beyond dashboard screenshots, allowing quantitative analysis of uptime, revenue, and content safety (Andon Labs).
- •How different model families respond to monetization incentives versus safety-aligned instructions, and whether refusal behaviors scale with stronger safety training (Business Insider).
- •Tooling for continuous evaluation and automated rollback in agent-run media, including rights management and compliance signals.
Sources cited in the reporting above include Andon Labs' project pages and Business Insider coverage. For practitioners: these results are an early, public demonstration of how autonomy, content policy, and monetization interact in continuous creative workflows.
Scoring Rationale
This is a clearly relevant experiment for practitioners exploring autonomous agents in continuous media and monetization workflows, but it is a small-scale, exploratory project rather than a broad release or new model architecture. The story surfaces practical failure modes worth watching.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems

