Reinforcement learning, spring 2024

Spring Semester, Wednesday, from 13:55 to 16:10

ULK-1 No. 4.18-5.17

Course description

The course examines the principles of operation of the main OsP algorithms, which have made it possible to achieve breakthrough results in many tasks: from gaming artificial intelligence to robotics. All the necessary theoretical results are presented with proofs using a unified approach, unified designations and definitions.

The objectives of the course are to provide up—to-date information about reinforcement learning tasks and algorithms for solving them, as well as to explain the difference between algorithms of various types and the reasons for their presentation in specific forms. In the classroom, students will be able to discuss basic training issues with reinforcement, as well as analyze tasks with a teacher.

To master the course, the student needs to know the basics of probability theory, numerical optimization methods, programming in Python, as well as get acquainted with the packages of application programs for mathematical modeling in the Python programming language: SciPy, NumPy, Matplotlib, Scikit-learn, PyTorch, OpenAI Gym.

Instructors

Nikita Evgenievich Yudin

Course materials

1) Ivanov, Sergey. "Reinforcement Learning Textbook." arXiv preprint arXiv:2201.09746 (2022).

2) Sutton, Richard S., and Andrew G. Barto. Reinforcement learning: An introduction. MIT press, 2018;

3) Гасников, А. В., Э. А. Горбунов, and С. А. Гуз. "Лекции по случайным процессам: учебное пособие." Под ред. АВ Гасникова.–«Москва»: МФТИ (2019);

4) Agarwal, Alekh, et al. "Reinforcement learning: Theory and algorithms." CS Dept., UW Seattle, Seattle, WA, USA, Tech. Rep (2019): 10-4.