Anthropic's Claude Code Accelerates Developer Workflows
Cat Wu, who leads product for Claude Code and Cowork at Anthropic, said 90% of Anthropic's own codebase is now written by Claude Code, according to an O'Reilly Radar writeup of her fireside chat with Addy Osmani at O'Reilly's AI Codecon. Per O'Reilly, Anthropic engineers are producing roughly 200% more code than a year earlier, which shifted the team's bottleneck to code review; Anthropic's own separately released Code Review product reports substantive review comments on pull requests rose from 16% to 54% after rollout. The article also describes an internal dogfooding Slack channel that gets a new message every 5 to 10 minutes, and recounts an episode where a scheduled Claude Code task scanned that channel and opened a pull request to fix an issue before a human engineer got to it.
So what
The headline number is 90% of Anthropic's code being machine-written, but the more durable signal for practitioners is what Anthropic had to build around that fact: a load-bearing automated code-review layer, an ownership model that keeps a human accountable for every merged change, and a feedback loop tight enough to auto-generate its own bug-fix pull requests. Teams adopting agentic coding at any scale will hit the same review bottleneck before they hit any generation-quality ceiling.
What happened
O'Reilly reports that Cat Wu, who leads product for Claude Code and Cowork at Anthropic, said 90% of Anthropic's codebase is now written by Claude Code. The account comes from a fireside chat Wu held with Addy Osmani at O'Reilly's AI Codecon, written up by Tim O'Reilly on O'Reilly Radar (April 30, 2026). Per the writeup, Anthropic's internal "dog-fooding" Slack channel for Claude Code receives a new message every 5 to 10 minutes, and engineers ship new internal builds of the tool multiple times a day. O'Reilly recounts one specific episode: Wu introduced a small interaction bug between prompts and auto-suggestions, and before she got to fixing it, a teammate's scheduled Claude Code task - set to scan the feedback channel for anything unaddressed after 24 hours - had already found the issue and opened a pull request to fix it.
Technical context
Wu told O'Reilly that Anthropic engineers are now producing roughly 200% more code than a year earlier, and that the resulting bottleneck is review, not generation. Her team's response was to run the most comprehensive automated review configuration it could, tuning for high bug recall rather than a lighter touch, on the reasoning that a review system only earns trust as a load-bearing part of the pipeline if it is thorough. That internal practice lines up with Anthropic's separately shipped Code Review feature for Claude Code: the company has reported that substantive review comments on pull requests rose from 16% to 54% after deploying it, with engineers disagreeing with fewer than 1% of the tool's findings, per Anthropic's own product materials. O'Reilly's account adds that the review agents trace changes across files, catching bugs unrelated to the immediate diff - such as a cache-invalidation bug surfaced during an unrelated encryption refactor.
For practitioners
Three separate layers are doing distinct work here, and teams building agentic coding pipelines should instrument each one rather than treating "AI writes the code" as a single capability: the generation layer (the agent producing edits), the orchestration layer (scheduled or triggered agents that monitor channels and open PRs on their own), and the verification layer (automated review tuned for high recall, plus a human owner accountable end-to-end for each merged change, including post-deploy bugs). Anthropic's account suggests the ownership model matters as much as the tooling: PR authors, not reviewers, are on the hook for what ships, which is what makes it acceptable to lean on heavy automated review in the first place.
What to watch
Formal provenance capture for agent-authored changes (signed metadata, reproducible prompts), broader adoption of test-oracle patterns that check intent rather than surface output, and operational guardrails - rate limits, merge gating, security scanning - as more teams let scheduled agents open PRs unattended the way Anthropic's internal tooling already does.
Key Points
- 1Anthropic's Cat Wu says 90% of the company's code is now written by Claude Code, with output up roughly 200% year over year.
- 2The resulting bottleneck shifted to review, which Anthropic addressed with high-recall automated review that lifted substantive PR comments from 16% to 54%.
- 3A scheduled Claude Code task independently found and fixed a bug from an internal feedback channel before a human engineer reached it, showing agent-driven orchestration in production.
Scoring Rationale
A concrete, well-sourced data point on production-scale agentic coding at a leading AI lab, with a specific before/after review metric, but it is a single company's self-reported internal practice rather than a new model, product launch, or industry-wide shift.
Sources
Public references used for this report.
Practice interview problems based on real data
1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems

