Using cases as heuristics in reinforcement learning: A transfer learning application

CELIBERTO JUNIOR, L. A.; MATSUURA, J. P.; DE MANTARAS, R. L.; Reinaldo Bianchi

Using cases as heuristics in reinforcement learning: A transfer learning application

dc.contributor.author	CELIBERTO JUNIOR, L. A.
dc.contributor.author	MATSUURA, J. P.
dc.contributor.author	DE MANTARAS, R. L.
dc.contributor.author	Reinaldo Bianchi
dc.contributor.authorOrcid	https://orcid.org/0000-0001-9097-827X
dc.date.accessioned	2022-01-12T22:03:05Z
dc.date.available	2022-01-12T22:03:05Z
dc.date.issued	2011-07-02
dc.description.abstract	In this paper we propose to combine three AI techniques to speed up a Reinforcement Learning algorithm in a Transfer Learning problem: Case-based Reasoning, Heuristically Accelerated Reinforcement Learning and Neural Networks. To do so, we propose a new algorithm, called L3, which works in 3 stages: in the first stage, it uses Reinforcement Learning to learn how to perform one task, and stores the optimal policy for this problem as a case-base; in the second stage, it uses a Neural Network to map actions from one domain to actions in the other domain and; in the third stage, it uses the case-base learned in the first stage as heuristics to speed up the learning performance in a related, but different, task. The RL algorithm used in the first phase is the Q-learning and in the third phase is the recently proposed Case-based Heuristically Accelerated Q-learning. A set of empirical evaluations were conducted in transferring the learning between two domains, the Acrobot and the Robocup 3D: the policy learned during the solution of the Acrobot Problem is transferred and used to speed up the learning of stability policies for a humanoid robot in the Robocup 3D simulator. The results show that the use of this algorithm can lead to a significant improvement in the performance of the agent.
dc.description.firstpage	1211
dc.description.lastpage	1217
dc.identifier.citation	CELIBERTO JUNIOR, L. A.; MATSUURA, J. P.; DE MANTARAS, R. L.; BIANCHI, R. Using cases as heuristics in reinforcement learning: A transfer learning application. IJCAI International Joint Conference on Artificial Intelligence. p. 1211-1217, July, 2011.
dc.identifier.doi	10.5591/978-1-57735-516-8/IJCAI11-206
dc.identifier.issn	1045-0823
dc.identifier.uri	https://repositorio.fei.edu.br/handle/FEI/4180
dc.relation.ispartof	IJCAI International Joint Conference on Artificial Intelligence
dc.rights	Acesso Restrito
dc.title	Using cases as heuristics in reinforcement learning: A transfer learning application
dc.type	Artigo de evento
fei.scopus.citations	25
fei.scopus.eid	2-s2.0-84871606206
fei.scopus.subject	AI techniques
fei.scopus.subject	Empirical evaluations
fei.scopus.subject	Humanoid robot
fei.scopus.subject	Learning performance
fei.scopus.subject	Optimal policies
fei.scopus.subject	Third phase
fei.scopus.subject	Transfer learning
fei.scopus.subject	Two domains
fei.scopus.updated	2024-07-01
fei.scopus.url	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84871606206&origin=inward

Coleções

Artigos

Using cases as heuristics in reinforcement learning: A transfer learning application

Arquivos

Coleções