Amanda Askell Says Claude Could Replace Her Role

At the Bloomberg Tech Summit, Anthropic philosopher Amanda Askell said, "Eventually, Claude is going to be a much better philosopher than I am, and probably be much better at every aspect of my job than I am," according to the Observer. She also said, "Human input is going to be rarer and rarer. That's the thing that we need to prepare models for." The Observer notes Askell joined Anthropic in 2021 after safety and alignment work at OpenAI, co-authored what it calls Claude's "soul doc," leads the company's personality alignment team, and was named to the TIME 100 AI list in 2024. Her remarks add a prominent voice to debates over how labs will evaluate and supervise models whose judgment increasingly rivals that of expert humans.
What happened
The Observer reports that Anthropic philosopher Amanda Askell, speaking at the Bloomberg Tech Summit, said: "Eventually, Claude is going to be a much better philosopher than I am, and probably be much better at every aspect of my job than I am." She added, per the Observer, "Human input is going to be rarer and rarer. That's the thing that we need to prepare models for." According to the Observer, Askell joined Anthropic in 2021 after earlier safety and alignment work at OpenAI, co-authored what the article calls Claude's "soul doc," leads Anthropic's personality alignment team, and was named to the TIME 100 AI list in 2024.
Why it matters
Askell's remarks are notable less for any announcement than for who is making them: a senior alignment researcher at a frontier lab publicly framing advanced models as candidates to outperform experts at qualitative, judgment-heavy work, and arguing that the field should prepare models for a future with less human input. The comments feed ongoing debates about how labs will supervise and evaluate systems whose outputs are increasingly hard for humans to check.
Editorial analysis - for practitioners
As a generic industry pattern rather than a statement of Anthropic's specific plans, growing model competence at nuanced reasoning tends to turn human-rater evaluation into a scaling bottleneck. Teams working on alignment and evaluation increasingly lean on automated and hybrid evaluation suites, adversarial and long-horizon testing, interpretability, and scalable-oversight techniques to assess behavior beyond single-label accuracy. Askell's "human input ... rarer" framing aligns with that broader shift from human-in-the-loop checking toward scalable oversight, though how to do it reliably remains unsettled.
What to watch
- •Whether labs publish reproducible evaluation suites for qualitative and value-laden tasks.
- •Adoption of automated or hybrid human-machine evaluation pipelines in production.
- •Research connecting capability gains to specific alignment and safety failure modes.
Scoring Rationale
A widely noted public statement from a prominent frontier-lab alignment researcher (Anthropic's Amanda Askell) that Claude could surpass her at qualitative, judgment-heavy work and that the field should prepare for less human input. Substantive for the alignment and evaluation community, but it is a single-source profile of summit remarks rather than a research result or product release, keeping it solid but modest.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems

