Karpathy Demonstrates Autonomous ML Experiment Loop

On March 7, Andrej Karpathy pushed a 630-line Python script to GitHub and his AutoResearch agent ran roughly 50 experiments overnight, discovered a better learning rate, and committed the change automatically. The article identifies three primitives—an editable asset, a scalar metric, and a time-boxed cycle—and argues that a Markdown program.md file serves as the high-leverage human-agent interface. The pattern generalizes beyond ML to databases and support routing.
Key Points
- 1Runs ~50–100 experiments overnight, modifying a single editable file and committing improvements automatically.
- 2Highlights three primitives—editable asset, scalar metric, time-boxed cycle—that enable generalizable autonomous experiment loops.
- 3Prioritizes program.md writing as a high-leverage, parseable human-agent interface for safe, interpretable automation.
Scoring Rationale
Presents broad, actionable pattern with high applicability, but relies on a single-project demonstration rather than systematic evaluation.
Sources
Public references used for this report.
Practice interview problems based on real data
1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems


