Andrej Karpathy, former Tesla AI Director and founding member of OpenAI, has expressed concerns about the potential pitfalls of Reinforcement Learning approaches in achieving Artificial General Intelligence (AGI).
In a recent podcast appearance, Karpathy delivered a blunt assessment of Reinforcement Learning (RL), stating that it produces suboptimal outcomes.
Reinforcement Learning Is A Lot Worse Than The Average Person Thinks
Despite enabling breakthroughs such as AlphaGo and ChatGPT's conversational abilities, Karpathy's critique suggests that RL may be more limited than its recent success stories suggest.
Author's summary: Karpathy critiques Reinforcement Learning's limitations.