Reinforcement Learning (CS489, Spring 2019)


Time and Venue :

-   Time : 10:00 - 11:40 , Friday , Week 1 - 16

-   Venue : 东中院3-103

Instructor :

-   Prof : Junni Zou

-   Email : zou-jn@cs.sjtu.edu.cn

-   Office : 3-437, SEIEE Building

Teaching Assistant :

-   Teaching Assistant : Yuankun Jiang, Nuowen Kan

-   Email : yuankunjiang@sjtu.edu.cn, kannw_1230@sjtu.edu.cn

Reference Book :

-   Richard S. Sutton and Andrew G. Barto, Reinforcement Learning: An Introduction, Second Edition

Grading Policy :

-   Homework and Project: 50%

-   Final Exam : 50%

Lecture Slides :

We use the lecture slides of Prof. David Silver as a reference: David Silver
-   Lecture 0: Experiments Setup
-   Lecture 1: Introduction and Course Overview
-   Lecture 2: Markov Decision Processes
-   Lecture 3: Dynamic Programming
-   Lecture 4: Model-Free Prediction (1)
-   Lecture 5: Model-Free Prediction (2)
-   Lecture 6: Model-Free Control
-   Lecture 7: Value Function Approximation
-   Lecture 8: Convolutional Neural Network
-   Lecture 9: DQN Variants
-   Lecture 10: Policy Gradient
-   Lecture 11: Integrating Learning

Assignments :

-   Assignment 1: Dynamic Programming
-   Assignment 2: Model-Free Prediction
-   Assignment 3: Model-Free Control
-   Assignment 4: Value Function Approximation
-   Assignment 5: Policy Gradient

 

Discrete Mathematics (MA115, Spring 2018)


Time and Venue :

-   Time : 14:00 - 15:40 , Friday , Week 1 - 16

-   Venue : 东上院509

Instructor :

-   Prof : Junni Zou

-   Email : zou-jn@cs.sjtu.edu.cn

-   Office : 3-437, SEIEE Building

Teaching Assistant :

-   Teaching Assistant : Qiaoyu Lu

-   Email : luqiaoyu@sjtu.edu.cn

Reference Book :

-   Kenneth H.Rosen, Discrete Mathematics and Its Applications, Seventh Edition

-   Logic : Ch0-绪论
             Ch1-命题逻辑
             Ch2-命题演算
             Ch4-谓词逻辑
             Ch5-谓词演算

-   Graph : Ch1
               Ch2

Grading Policy :

-   Attendence and Homework : 30%

-   Final Exam : 70%

 


Last Updated: Dec. 5, 2017