Richard Sutton – Father of RL thinks LLMs are a dead end

Dwarkesh Patel•

1:07:08

•

396,232 views

•

2 months ago

Full Transcript

English

Today I'm chatting with Richard Sutton, who is one of the founding fathers of reinforcement learning and inventor of many of the main techniques used there,

like TD learning and policy gradient methods. For that, he received this year's Turing Award

which, if you don’t know, is the Nobel Prize for computer science. Richard, congratulations. Thank you, Dwarkesh. Thanks for coming on the podcast.

It's my pleasure. First question. My audience and I are familiar with the LLM way of thinking about AI. Conceptually, what are we missing in terms of

thinking about AI from the RL perspective? It's really quite a different point of view.

Unlock Full Transcript

You're viewing the first 5 lines

Register for free to access the complete transcript with timestamps

Register Free Sign In

Free forever

No credit card

Showing first 5 lines • 463 more lines available

Loading comment intelligence...

More from Dwarkesh Patel

Ilya Sutskever – We're moving from the age of scaling to the age of research

Ilya Sutskever – We're moving from the age of scaling to the age of research

Satya Nadella – How Microsoft thinks about AGI

Satya Nadella – How Microsoft thinks about AGI

Sarah Paine — How Russia sabotaged China's rise

Sarah Paine — How Russia sabotaged China's rise

Andrej Karpathy — “We’re summoning ghosts, not building animals”

Andrej Karpathy — “We’re summoning ghosts, not building animals”

“I find it almost disturbing that the universe favors life this strongly” – Nick Lane

“I find it almost disturbing that the universe favors life this strongly” – Nick Lane

Some thoughts on the Sutton interview

Some thoughts on the Sutton interview