Reinforcement Learning Is A Lot Worse Than The Average Person Thinks: Andrej Karpathy

Reinforcement Learning Limitations

Andrej Karpathy, former Tesla AI Director and founding member of OpenAI, has expressed concerns about the potential pitfalls of Reinforcement Learning approaches in achieving Artificial General Intelligence (AGI).

In a recent podcast appearance, Karpathy delivered a blunt assessment of Reinforcement Learning (RL), stating that it produces suboptimal outcomes.

Reinforcement Learning Is A Lot Worse Than The Average Person Thinks

Despite enabling breakthroughs such as AlphaGo and ChatGPT's conversational abilities, Karpathy's critique suggests that RL may be more limited than its recent success stories suggest.

Author's summary: Karpathy critiques Reinforcement Learning's limitations.

Reinforcement Learning produces suboptimal outcomes
RL has enabled breakthroughs like AlphaGo and ChatGPT
Karpathy's critique suggests RL may be more limited than suggested

OfficeChai — 2025-10-18