Conference 2015
Top image

 
Home
Program LNMB Conference
Invited Speakers LNMB Conference
Program PhD presentations
Abstracts PhD presentations
Registration LNMB Conference
Announcement NGB/LNMB Seminar
Abstracts/Bios NGB/LNMB Seminar
Registration NGB/LNMB Seminar
Registered Participants
Conference Office
How to get there
 
Return to LNMB Site
 

Benjamin Van Roy: Learning to Optimize: Delayed Consequences

Abstract: Learning to make effective decisions that may influence observations appearing after subsequent decisions poses challenges beyond those faced when all consequences are immediate. In particular, observations must somehow be attributed to past actions. The area of reinforcement learning addresses this issue alongside the challenges of exploration and generalization. I will discuss reinforcement learning algorithms and results pertaining to them.