MA 661 Dynamic Programming and Reinforcement Learning

The main purpose of this course is to present an introduction to dynamic programming as the most popular methodology for learning and control of dynamic stochastic systems. We discuss basic models, some theoretical results and numerical methods for these problems. They will be developed starting from basic models of dynamical systems, through finite-horizon stochastic problems, to infinite-horizon stochastic models of fully or partially observable systems. Throughout the class special attention will be paid on the application of dynamic programming to statistical learning. The class will include introduction to approximate dynamic programming techniques, which are used in statistical learning, such as tree-based methods for classification, Bayesian learning, etc. The concepts and methods will be illustrated by various applications including learning in stochastic networks, engineering, business, and finance.

Credits

3

Prerequisite

MA 540 and MA 611