Skip to content

Andrej Karpathy Joined Anthropic. His First Job: Use Claude to Build the Next Claude.

DS
LDS Team
Let's Data Science
10 min
The OpenAI founding member paused his education startup Eureka Labs to join Anthropic's pre-training team under Nick Joseph. His new mandate: spin up a group that uses Claude itself to accelerate the next generation of Claude.

Andrej Karpathy opened X on Tuesday morning and typed seven sentences. The first one read: "Personal update: I've joined Anthropic."

By the time the news reached the Bloomberg terminal an hour later, the headline had already rewritten itself in the heads of every senior ML engineer who saw it. Karpathy is one of the original eleven people who started OpenAI in 2015. He ran computer vision at Tesla. He came back to OpenAI in 2023 to work on mid-training and synthetic data. He left again in February 2024 to found Eureka Labs, an AI-native education startup he had been promising the field for years.

This week, he started reporting to Nick Joseph, Anthropic's Head of Pretraining and another former OpenAI alumnus, on the team responsible for the massive compute runs that give Claude its core knowledge.

The hire matters for a reason that has very little to do with Karpathy's celebrity. According to a statement Anthropic gave TechCrunch, Karpathy will build a new team focused on using Claude to accelerate pre-training research. That is the part that should make practitioners stop scrolling. Anthropic is no longer just betting that bigger clusters will beat OpenAI. It is betting that the model itself, used as a research collaborator, is the path forward. And it has put one of the most respected pre-training minds alive in charge of proving it.

The Defection That Wasn't About Money

Karpathy is not a free agent who needed a job. Eureka Labs, the startup he had built around AI-native education, was healthy enough to be paused rather than wound down. He is independently wealthy from Tesla equity. He has been one of the most-watched independent voices in machine learning for two years running, with a YouTube channel that taught a generation of engineers how transformers actually work.

He went to Anthropic anyway.

In the X post that broke the news, Karpathy wrote: "I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time."

The phrase "get back to R&D" is the one to read twice. Karpathy spent the past two years explaining models to the world. He spent the year before that running an education company. He has not been inside a frontier lab during the buildout of the current generation of models. Tuesday's move ends that gap.

Anthropic confirmed to multiple outlets, including TechCrunch and CNBC, that he will sit inside the pre-training organization rather than the alignment or product teams. That is the most expensive, most compute-intensive part of building a frontier model, and the one where most of Anthropic's competitive position against OpenAI and Google is decided.

The Career That Made This Hire Inevitable

Karpathy's resume reads like a map of every major lab transition of the past decade. The reason Anthropic wanted him is buried in that timeline.

December 2015
OpenAI is founded with Karpathy as one of 11 original members
He joins as a research scientist, working alongside Ilya Sutskever, Greg Brockman, and Sam Altman on the lab's earliest deep learning work.
June 2017
Elon Musk recruits him to lead Tesla's AI division
Karpathy becomes Director of AI at Tesla, owning the neural network stack behind Autopilot and Full Self-Driving.
July 2022
Karpathy leaves Tesla after five years
He publishes an exit note saying he wants to return to long-form research and educational work.
February 2023
He returns to OpenAI
Karpathy rejoins to work on mid-training and synthetic data pipelines, both of which are now central to every frontier model build.
February 2024
He leaves OpenAI a second time, founds Eureka Labs
Eureka Labs is positioned as an "AI-native" education company. Karpathy spends two years building, teaching, and publishing the most-watched LLM lectures on the internet.
May 19, 2026
Karpathy announces he has joined Anthropic
He starts the week on Anthropic's pre-training team under Nick Joseph. A new team focused on using Claude to accelerate pre-training research is announced alongside the hire.

The signal in that chronology is not where Karpathy worked. It is what he worked on. Tesla taught him how to ship neural networks at production scale across millions of cars. OpenAI in 2023 gave him direct experience with mid-training and synthetic data, two of the levers labs now pull hardest when raw scaling stalls. Eureka Labs forced him to teach modern language modeling to people who do not already have PhDs, which is its own form of compression.

That combination is unusually well suited to running a team whose job is to figure out how to get a model to help train the next model. Anthropic could have hired a hundred competent infrastructure engineers. It hired the person whose career has been a tour of the exact problem.

Why Anthropic Wants Models Training Models

The phrase "using Claude to accelerate pre-training research" is the kind of corporate sentence that hides a strategic bet. Unpacked, it means three things.

First, Anthropic believes the frontier is no longer purely compute-bound. Every major lab now has access to enormous clusters. NVIDIA's recent equity push tied roughly $40 billion to top AI buyers including OpenAI in 2026 alone. Compute is necessary, but compute is not the moat.

Second, Anthropic believes the next moat is research velocity. Whoever can run more experiments per dollar of compute, find better data mixes faster, and pick the right architecture changes faster will pull ahead. That is a problem language models can actually help solve, by generating ablations, summarizing internal results, writing infrastructure code, and proposing training recipes.

Third, Anthropic believes its own model is good enough to be that collaborator. Claude Opus 4.7, released in April, leads several agentic coding benchmarks. Asking Claude to help build Claude only makes sense if Claude is already useful as a researcher. Tuesday's hire is the company betting publicly that it is.

Signal from the hireWhat it means for Anthropic's strategy
Karpathy on pre-training, not alignment or productCompute and data are still the company's most important investments
A new team specifically for AI-assisted researchAnthropic thinks model-augmented research compounds faster than linear hiring
Reporting to Nick Joseph, another ex-OpenAI hireContinued concentration of OpenAI alumni at the top of Anthropic's research stack
Eureka Labs is paused, not closedKarpathy left himself the option to leave again

There is a fourth, quieter signal that practitioners will notice. Karpathy has spent the last year publicly skeptical of the idea that AI agents are ready to replace engineering work. He coined the term "vibe coding" partly as a wry description of the messy reality of letting LLMs drive software. Anthropic just put him in charge of a team built around the idea that AI agents can drive research. The bet is that pre-training, which involves enormous amounts of grunge work over well-defined search spaces, is exactly the kind of domain where current models help most and current limitations matter least.

The Other Side of the Trade

Not every reaction was bullish on Anthropic. The argument cutting in the other direction is one Gizmodo made on Tuesday afternoon: hires this senior do not change roadmaps overnight. By the time Karpathy ramps up, builds a team, and produces meaningful results, OpenAI may have shipped the next two generations of GPT.

There is also a structural question about whether a research-collaborator team can really compound the way Anthropic hopes. Skeptics point out that LLMs accelerate routine engineering, but the hardest pre-training decisions, including data composition and architecture, still require taste and intuition that current models lack. The team Karpathy is being asked to build is essentially a wager that this gap closes faster than most ML researchers think.

And there is the simple optics question, raised by Axios and Fortune. Karpathy is the latest senior OpenAI alum to publicly join Anthropic, with Anthropic itself co-founded in 2021 by former OpenAI research leads Dario and Daniela Amodei. That is a recruiting story for Anthropic and a retention story for OpenAI. CEO Sam Altman has not commented publicly on Karpathy's move.

For OpenAI's part, the company has not lost a co-founder of Karpathy's profile to a direct competitor before. The next earnings cycle, and the next round of talent reports, will say more about whether Tuesday is one defection or the start of a pattern.

The Bottom Line

Anthropic just hired the man whose YouTube lectures taught half of the field how transformers work, and pointed him at the problem of making Claude help train Claude. He is doing it in the same week his new employer is reportedly closing a funding round at a roughly $900 billion valuation, a number that would put Anthropic above OpenAI in private markets for the first time.

That is the literal version of the story. The deeper version is that the AI frontier is starting to behave like a flywheel. The labs with the best models have the best AI-augmented research, which produces better models, which attract better researchers, which run more experiments. Anthropic is not the first lab to think this way. It is the first lab to put a co-founder of OpenAI in charge of executing it.

Karpathy's own line on the move was, as ever, the most quotable: "I think the next few years at the frontier of LLMs will be especially formative." That is a polite version of a much louder claim. The next few years are not going to be formative for the field in general. They are going to be formative for whichever lab gets model-assisted research to compound first. Anthropic just made that bet in public.

The most interesting question now is not whether Karpathy succeeds. It is what OpenAI does next.

Sources

Practice interview problems based on real data

1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems
Free Career Roadmaps8 PATHS

Step-by-step roadmaps from zero to job-ready — curated courses, salary data, and the exact learning order that gets you hired.

Explore all career paths