Lectures for Probabilistic Reasoning in AI (308-526B)

Winter 2002


All the lecture notes are linked to this web page, in PDF and postscript format. For convenience of printing, slides are also available in a 4 per page format (the links labeled PS4 and PDF4). The reader for PDF files is available free from Adobe for UNIX, Apple Macintosh, and Windows. You can view both postscript and PDF files using ghostview.

Tentative Schedule

Lec.
Date
Topic
Readings
Notes
1
Jan. 7
 Introduction  Pearl, Chapter 1
 Slides: PS, PDF, PS4, PDF4
 
2
Jan. 9
 Bayesian Inference  Pearl, Sections 2.1,2.3.1
 Slides: PS, PDF,PS4, PDF4
 Answers to puzzles: PS, PDF, PS4, PDF4
 Problem Set 1 Posted
PS, PDF
3
Jan. 14
 Bayesian Networks: Introduction, I-maps  Pearl, Sections 3.1, 3.3
 Slides: PS, PDF,PS4, PDF4
 
4
Jan. 16
 Bayesian Networks: d-separation, construction  Friedman and Koller's notes on Bayesian networks
 Slides: PS, PDF, PS4, PDF4
  Problem Set 1 Due!
5
Jan. 21
 Inference: Variable Elimination  Friedman and Koller's notes on Variable Elimination
 Slides: PS, PDF, PS4, PDF4
 Problem Set 2 Posted
PS, PDF
6
Jan. 23
 Inference: Clique tree construction, Cutset conditioning  Pearl, Sections 4.4.1, 4.4.2
 Friedman and Koller's notes on Clique tree inference
 Slides: PS, PDF, PS4, PDF4
 
7
Jan. 28
 Inference: Sampling  Pearl, Section 4.4.3;
Friedman and Koller's notes on Approximate Inference
 Slides: PS, PDF, PS4, PDF4
 
8
Jan. 30
 Learning: Parameter Estimation  Koller and Friedman's notes on Parameter Estimation
 Heckerman's tutorial on Learning with Bayesian Networks
 Slides: PS, PDF, PS4, PDF4
  Problem Set 2 Due!
9
Feb. 4
 Learning: Structure  Koller and Friedman's notes on Learning Structure
 Heckerman's tutorial on Learning with Bayesian Networks
 Slides:
 
-
Feb. 6
 No class  
 
 
10
Feb. 11
 Learning: Structure, Hidden Variables  TBA
 Slides:
 Problem Set 3 Posted
PS, PDF
12
Feb. 13
 Expectation Maximization (EM)  TBA
 Slides:
13
Feb. 18
 Dynamic Bayesian Networks and Hidden Markov Models  TBA
 Slides:
  Solution 2: PS, PDF
14
Feb. 20
   Pearl, Chapter 6
 Slides:
  Problem Set 3 Due!
-
Feb. 25
 Spring break  
 
 
-
Feb. 27
 Spring break  
 
 
15
Mar. 4
 Introduction to Decision Making  Pearl, Sections 6.1, 6.3, 6.4
 Slides: PS, PDF, PS4, PDF4
 Programming Assignment 1 Posted
 Reading 1 Posted
16
Mar. 6
 Markov Decision Processes  Slides: PS, PDF, PS4, PDF4
 R.S. Sutton and A.G. Barto, Reinforcement Learning: An Introduction, Chapters 3-4
 C. Boutilier, T. Dean and S. Hanks. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage, Sections 1-3.
 
17
Mar. 11
 Monte Carlo Methods  No slides
 R.S. Sutton and A.G. Barto, Reinforcement Learning: An Introduction, Chapter 5, Chapter 2 FYI
 
  Reading 1 Due!
18
Mar. 13
 Temporal Difference Learning  No slides
 R.S. Sutton and A.G. Barto, Reinforcement Learning: An Introduction, Sections 6.1-6.5
 
  Programming Assignment 1 Due!
19
Mar. 18
 Eligibility traces  No slides
 R.S. Sutton and A.G. Barto, Reinforcement Learning: An Introduction Sections 7.1-7.6, 7.8-7.11
 
Project Info Posted
20
Mar. 20
 Approximation methods for MDPs  Slides: PS, PDF, PS4, PDF4
 R.S. Sutton and A.G. Barto, Reinforcement Learning: An Introduction, Chapter 8
 Problem Set 4 Posted
PS, PDF
21
Mar. 22
 Structured state spaces  
 
 
22
Mar. 27
 Hierarchical actions and temporal abstraction  
 
 
-
Apr. 1
 No class (Easter)  
 
 
23
Apr. 3
 Partially Observable Markov Decision Processes  No Slides
 L. Kaelbling, M. Littman, and A. Cassandra.
 
24
Apr. 8
 Memory-based methods for POMDPs  
 
 Programming Assignment 2 Posted
 Reading 2 Posted
25
Apr. 10
 Finite-state controllers for POMDPs
 Class evaluations
 
 
 
26
Apr. 15
 Wrap-up  
 
 

Prof. Doina PRECUP
Last modified: Mon Apr 8 12:49:09 EDT 2002