Seminário Matemática, Física & Aprendizagem Automática – Pedro Santos
“Two-time scale stochastic approximation for reinforcement learning with linear function approximation” - 14h...
“Two-time scale stochastic approximation for reinforcement learning with linear function approximation” - 14h...