Location: | McConnell Engineering room 13 |
Times: | Tuesday and Thursday, 4:05-5:25pm |
Instructor: | Doina Precup and Pierre-Luc Bacon, School of Computer Science |
Office: | McConnell Engineering building, room 111N and 107 |
Phone: | (514) 398-6443 (Doina) |
Email: | dprecup@cs.mcgill.ca and pbacon@cs.mcgill.ca |
Office hours:
|
See course home page
Meetings at other times by appointment only! |
Class web page: | http://www.cs.mcgill.ca/~dprecup/courses/rl.html
IMPORTANT: This is where class notes, announcements and homeworks are posted! |
The goal of this class is to provide an introduction to reinforcement learning, a very active part of machine learning. Reinforcement learning is concerned with building programs which learn how to predict and act in a stochastic environment, based on past experience. Applications of reinforcement learning range from classical control problems, such as powerplant optimization or dynamical system control, to game playing, inventory control, and many other fields. Notably, reinforcement learning has also produced very compelling models of animal and human learning. During this course, we will study theoretical properties and practical applications of reinforcement learning. We will follow the second edition of the classic textbook by Sutton & Barto (available online), and supplement it as needed with papers and other materials.
MyCourses will be used only for bulletin board, discussion groups and assignment submission and grading.
McGill University values academic integrity. Therefore all students must understand the meaning and consequences of cheating, plagiarism and other academic offenses under the Code of Student Conduct and Disciplinary Procedures (see www.mcgill.ca/students/srr/honest for more information).
In accord with McGill University's Charter of Students' Rights, students in this course have the right to submit in English or in French any written work that is to be graded.
In the event of extraordinary circumstances beyond the University's control, the content and/or evaluation scheme in this course is subject to change.