Zap Stochastic Approximation and Reinforcement Learning

When

07/10/2020    
11:00 am-12:00 pm
François Durand
Nokia Bell Labs France

Where

Paris-Rennes Room (EIT Digital)
23 avenue d'Italie, 75013 Paris

Event Type

Many reinforcement learning problems can be seen from the point of view of stochastic approximation. Unfortunately, classic stochastic approximation algorithms, such as Robbins-Monro, may have an infinite asymptotic variance. The class of “zap” algorithms aim at solving that problem. We then examine the application of zap algorithms to reinforcement learning, with the example of zap Q-learning.

Slides