Zap Stochastic Approximation and Reinforcement Learning

When

07/10/2020    
11:00 am-12:00 pm
François Durand
Nokia Bell Labs France

Where

Paris-Rennes Room (EIT Digital)
23 avenue d'Italie, 75013 Paris

Event Type

Many reinforcement learning problems can be seen from the point of view of stochastic approximation. Unfortunately, classic stochastic approximation algorithms, such as Robbins-Monro, may have an infinite asymptotic variance. The class of “zap” algorithms aim at solving that problem. We then examine the application of zap algorithms to reinforcement learning, with the example of zap Q-learning.

Slides

Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.