Bertsekas, Dimitri P.

Reinforcement learning and optimal control - Massachusetts Athena Scientific 2019 - xiv, 373p.,

http://www.athenasc.com/rlbook_athena.html

This book considers large and challenging multistage decision problems, which can be solved in principle by dynamic programming (DP), but their exact solution is computationally intractable. We discuss solution methods that rely on approximations to produce suboptimal policies with adequate performance. These methods are collectively known by several essentially equivalent names: reinforcement learning, approximate dynamic programming, and neuro-dynamic programming. They have been at the forefront of research for the last 25 years, and they underlie, among others, the recent impressive successes of self-learning in the context of games such as chess and Go.

9781886529397 (hb.)


Mathematics
Mathematical optimization
Dynamic programming
Reinforcement learning

519.703 / BER