Heuristically accelerated Q-learning: A new approach to speed up reinforcement learning

dc.contributor.advisorOrcidhttps://orcid.org/0000-0001-9097-827X
dc.contributor.authorReinaldo Bianchi
dc.contributor.authorRIBEIRO, C. H. C.
dc.contributor.authorCOSTA, A. H. R.
dc.date.accessioned2023-08-26T23:50:48Z
dc.date.available2023-08-26T23:50:48Z
dc.date.issued2004-01-05
dc.description.abstractThis work presents a new algorithm, called Heuristically Accelerated Q-Learning (HAQL), that allows the use of heuristics to speed up the well-known Reinforcement Learning algorithm Q-learning. A heuristic function H that influences the choice of the actions characterizes the HAQL algorithm. The heuristic function is strongly associated with the policy: it indicates that an action must be taken instead of another. This work also proposes an automatic method for the extraction of the heuristic function H from the learning process, called Heuristic from Exploration. Finally, experimental results shows that even a very simple heuristic results in a significant enhancement of performance of the reinforcement learning algorithm. © Springer-Verlag 2004.
dc.description.firstpage245
dc.description.lastpage254
dc.description.volume3171
dc.identifier.citationBIANCHI, R.; RIBEIRO, C. H. C.; COSTA, A. H. R. Heuristically accelerated Q-learning: A new approach to speed up reinforcement learning. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 3171, p. 245-254, 2004.
dc.identifier.issn0302-9743
dc.identifier.urihttps://repositorio.fei.edu.br/handle/FEI/5058
dc.relation.ispartofLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.rightsAcesso Restrito
dc.subject.otherlanguageCognitive robotics
dc.subject.otherlanguageReinforcement learning
dc.titleHeuristically accelerated Q-learning: A new approach to speed up reinforcement learning
dc.typeArtigo
fei.scopus.citations43
fei.scopus.eid2-s2.0-33751369840
fei.scopus.subjectAutomatic method
fei.scopus.subjectCognitive robotics
fei.scopus.subjectHeuristic functions
fei.scopus.subjectLearning process
fei.scopus.subjectNew approaches
fei.scopus.subjectQ-learning
fei.scopus.subjectSpeed up
fei.scopus.updated2024-05-01
fei.scopus.urlhttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=33751369840&origin=inward
Arquivos
Coleções