TY  - BOOK
AU  - Bertsekas, Dimitri P.
TI  - Reinforcement learning and optimal control
SN  - 9781886529397 (hb.)
U1  - 519.703 
PY  - 2019///
CY  - Massachusetts
PB  - Athena Scientific
KW  - Mathematics
KW  - Mathematical optimization
KW  - Dynamic programming
KW  - Reinforcement learning
N1  - http://www.athenasc.com/rlbook_athena.html
N2  - This book considers large and challenging multistage decision problems, which can be solved in principle by dynamic programming (DP), but their exact solution is computationally intractable. We discuss solution methods that rely on approximations to produce suboptimal policies with adequate performance. These methods are collectively known by several essentially equivalent names: reinforcement learning, approximate dynamic programming, and neuro-dynamic programming. They have been at the forefront of research for the last 25 years, and they underlie, among others, the recent impressive successes of self-learning in the context of games such as chess and Go
ER  -