Today I'm chatting with Richard Sutton, who is one of the founding fathers of reinforcement learning and inventor of many of the main techniques used there,
like TD learning and policy gradient methods. For that, he received this year's Turing Award
which, if you don’t know, is the Nobel Prize for computer science. Richard, congratulations. Thank you, Dwarkesh. Thanks for coming on the podcast.
It's my pleasure. First question. My audience and I are familiar with the LLM way of thinking about AI. Conceptually, what are we missing in terms of
thinking about AI from the RL perspective? It's really quite a different point of view.
You're viewing the first 5 lines
Register for free to access the complete transcript with timestamps





