Heuristically accelerated reinforcement learning: Theoretical and experimental results

Reinaldo Bianchi; RIBEIRO, C. H. C.; COSTA, A. H. R.

Heuristically accelerated reinforcement learning: Theoretical and experimental results

Citações na Scopus

21

Tipo de produção

Artigo de evento

Data

2012-08-05

Autores

Reinaldo Bianchi
RIBEIRO, C. H. C.
COSTA, A. H. R.

Periódico

Frontiers in Artificial Intelligence and Applications

Citação

BIANCHI, R.; RIBEIRO, C. H. C.; COSTA, A. H. R. Heuristically accelerated reinforcement learning: Theoretical and experimental results. Frontiers in Artificial Intelligence and Applications, v. 242, p. 169-174, Aug. 2012.

Texto completo (DOI)

10.3233/978-1-61499-098-7-169

URI

https://repositorio.fei.edu.br/handle/FEI/4151

Resumo

Since finding control policies using Reinforcement Learning (RL) can be very time consuming, in recent years several authors have investigated how to speed up RL algorithms by making improved action selections based on heuristics. In this work we present new theoretical results - convergence and a superior limit for value estimation errors - for the class that encompasses all heuristics-based algorithms, called Heuristically Accelerated Reinforcement Learning. We also expand this new class by proposing three new algorithms, the Heuristically Accelerated Q(λ), SARSA(λ) and TD(λ), the first algorithms that uses both heuristics and eligibility traces. Empirical evaluations were conducted in traditional control problems and results show that using heuristics significantly enhances the performance of the learning process. © 2012 The Author(s).

Coleções

Artigos

Página do item completo