Richard Sutton – Father of RL thinks LLMs are a dead end

Dwarkesh Patel
1:07:08
396,232 views
2 months ago

Full Transcript

English

Today I'm chatting with Richard Sutton,  who is one of the founding fathers of   reinforcement learning and inventor of  many of the main techniques used there,

like TD learning and policy gradient methods. For that, he received this year's Turing Award

which, if you don’t know, is the Nobel Prize  for computer science. Richard, congratulations.  Thank you, Dwarkesh. Thanks for coming on the podcast.

It's my pleasure. First question. My audience and I are   familiar with the LLM way of thinking about AI. Conceptually, what are we missing in terms of

thinking about AI from the RL perspective? It's really quite a different point of view.

Unlock Full Transcript

You're viewing the first 5 lines

Register for free to access the complete transcript with timestamps

Free forever
No credit card
Showing first 5 lines • 463 more lines available
Loading comment intelligence...