Policy gradient method in the context of Richard S. Sutton


Policy gradient method in the context of Richard S. Sutton

Policy gradient method Study page number 1 of 1

Play TriviaQuestions Online!

or

Skip to study material about Policy gradient method in the context of "Richard S. Sutton"


HINT:

👉 Policy gradient method in the context of Richard S. Sutton

Richard Stuart Sutton FRS FRSC (born 1957 or 1958) is a Canadian computer scientist. He is a professor of computing science at the University of Alberta, fellow & Chief Scientific Advisor at the Alberta Machine Intelligence Institute, and a research scientist at Keen Technologies. Sutton is considered one of the founders of modern computational reinforcement learning. In particular, he contributed to temporal difference learning and policy gradient methods. He received the 2024 Turing Award with Andrew Barto.

↓ Explore More Topics
In this Dossier