Skip to content
FAQ

Andrej Karpathy Joins Anthropic to Build AI-Accelerated Pretraining Team

OpenAI co-founder and former Tesla AI director Andrej Karpathy has joined Anthropic to lead a new team using Claude itself to accelerate pretraining research. The hire — arguably the most high-profile AI talent acquisition of 2026 — deepens Anthropic's push to make AI-assisted research the engine of its competitive advantage over OpenAI and Google.

5 min read

The AI industry’s talent war reached a new inflection point on May 19, 2026, when Andrej Karpathy — OpenAI co-founder, former Tesla AI director, and one of the most technically credible researchers of the deep learning era — announced he had joined Anthropic to lead pretraining research on Claude.

“I am very excited to join the team here and get back to R&D,” Karpathy posted on X. “I think the next few years at the frontier of LLMs will be especially formative.”

The announcement sent an immediate signal across the industry. Karpathy’s name carries a weight in AI research that few others match — a combination of foundational technical contributions, hard-won operational experience at two of the most consequential companies in the field, and an extraordinary ability to translate complex research into accessible teaching. His choice to join Anthropic, at this moment, says something about where he believes the most important work is happening.

The Pretraining Imperative

To understand why this hire matters, it helps to understand what pretraining actually is and why it remains the most consequential — and least publicized — battleground in large language model development.

Pretraining is the phase in which a model learns from vast quantities of text, code, scientific papers, and other data before any specialized instruction-tuning or fine-tuning begins. It is the process that instills raw capability: the ability to reason, recall facts, follow complex chains of logic, and generalize from limited examples. Everything that distinguishes a genuinely intelligent model from a sophisticated pattern-matcher traces back to the quality of this foundational training stage.

The largest frontier labs — OpenAI, Google DeepMind, Meta — have invested enormous resources into this phase, from data curation and filtering pipelines to the scaling laws that determine how much compute translates into how much capability. Anthropic, which has always been tight-lipped about the technical specifics of Claude’s architecture, has historically kept its pretraining research even more private than its competitors.

Bringing in Karpathy to lead this effort signals a step change in ambition.

Using Claude to Build Claude

What makes the hire particularly significant is the specific mandate Karpathy has been given. He will not simply join Anthropic’s existing pretraining team. He will build a new team focused on using Claude itself to accelerate pretraining research — a concrete implementation of what researchers have long theorized but few have systematically attempted.

The concept is sometimes called AI-assisted scientific research, and it is distinct from the science-fiction notion of recursive self-improvement. Rather than an AI rewriting its own weights or architecture, the approach involves deploying frontier AI assistants to do the labor-intensive scientific and engineering work that produces the next generation of models: designing experiments, writing analysis code, interpreting model behavior, generating and evaluating hypotheses, and surfacing patterns in large datasets that human researchers might miss.

Pretraining research is precisely the kind of work that can benefit from this approach. An experiment that would take a human researcher a week to design, run, and analyze can potentially be compressed into hours if a capable AI handles significant portions of the scaffolding work. Accumulated over months of research cycles, those time savings compound into a meaningful competitive advantage.

“Tapping him to build such a team is a clear sign from Anthropic that it believes AI-assisted research, rather than pure compute, is how it stays competitive with OpenAI and Google,” said one industry observer close to the company’s research direction. Karpathy will work under Nick Joseph, who leads Anthropic’s pretraining division.

A Career That Shaped an Industry

For those outside the AI research community, Karpathy’s resume requires some unpacking. He received his PhD under Fei-Fei Li at Stanford, where his work on convolutional neural networks and image captioning helped establish the empirical foundations of what would become the deep learning revolution. He was a founding member of OpenAI in 2015, contributing to some of the earliest large-scale language model research and participating in the scaling experiments that demonstrated the now-famous power laws governing model intelligence.

He left OpenAI in 2017 to become Tesla’s VP of Autopilot and AI, where he built the Autopilot team from around 40 engineers to over 150. The most consequential decision of his Tesla tenure was championing the transition from radar-plus-camera systems to a pure vision approach — a bet the company defended publicly under heavy criticism and which has since become the foundation of its Full Self-Driving product.

He returned to OpenAI briefly in 2023 before departing again in early 2024 to found Eureka Labs, a startup applying AI to education. The pivot was characteristically thoughtful — Karpathy had always maintained that education and research were the areas most likely to be transformed first by capable AI assistants. But Eureka Labs, only 18 months old, apparently gave way to what he described as an even more formative opportunity.

Beyond his industry work, Karpathy is perhaps the AI field’s most effective technical educator. His open-source courses — including a from-scratch neural network tutorial and a popular “microGPT” series — have directly trained tens of thousands of engineers who are now building at frontier labs and startups. The ability to explain things from first principles, with clarity and precision, is not incidental to his research contributions. It reflects the same underlying habit of mind.

Anthropic’s Talent Flywheel

Karpathy’s hire arrives at a remarkable moment for Anthropic. The company recently reported its first profitable quarter, with Q2 2026 revenue reaching $1.09 billion — a milestone that would have seemed improbable to most industry observers 18 months ago. Its annualized revenue run rate sits above $4 billion, driven heavily by Claude Code, which has become one of the most-used developer tools in software engineering.

Google has committed up to $40 billion in total investment at a $350 billion valuation. A separate deal gives Anthropic access to SpaceX’s Colossus supercomputing cluster for training runs. The company is no longer a well-funded safety lab operating in the shadow of more commercially successful competitors — it is increasingly defining what the frontier looks like.

The ability to attract Karpathy at this stage suggests Anthropic has reached the kind of momentum where talent recruitment becomes self-reinforcing. Researchers who have choices — and Karpathy could command interest from virtually any organization in the field — tend to choose environments where they believe the most important work is being done. His decision to join Anthropic is itself an endorsement of the company’s technical trajectory.

It is also worth noting what the hire says about the relationship between safety and capability research. Karpathy is not a researcher who chose Anthropic despite its safety focus — he has spoken repeatedly about the importance of building AI systems that are genuinely aligned with human values. Anthropic’s explicit commitment to responsible development appears to be a feature, not a footnote, of his decision.

The Next Claude

Karpathy has not disclosed specific research timelines or targets. But his arrival on the pretraining team, combined with Anthropic’s growing compute resources and the resources to run more ambitious training runs than ever before, suggests the next major Claude training run will be qualitatively different from its predecessors.

Whether that translates into a step-change improvement in Claude’s capabilities relative to GPT-5.5 and Gemini 3.5 Flash remains to be seen. What is clear is that Anthropic has assembled, in a short period of time, both the capital and the talent to make a serious run at shaping the next generation of frontier models.

Karpathy has chosen his table. The industry will be watching closely to see what he builds at it.

Anthropic Andrej Karpathy pretraining AI research talent Claude
Share

Related Stories