Note: This is the 2020–2021 eCalendar. Update the year in your browser's URL bar for the most recent version of this page, or jump to the newest eCalendar.
Overview
Computer Science (Sci) : Bandit algorithms, finite Markov decision processes, dynamic programming, Monte-Carlo Methods, temporal-difference learning, bootstrapping, planning, approximation methods, on versus off policy learning, policy gradient methods temporal abstraction and inverse reinforcement learning.
Terms: This course is not scheduled for the 2020-2021 academic year.
Instructors: There are no professors associated with this course for the 2020-2021 academic year.