Heuristically-accelerated multiagent reinforcement learning

Reinaldo Bianchi; MARTINS, M. F.; RIBEIRO, C. H. C.; COSTA, A. H. R.

Heuristically-accelerated multiagent reinforcement learning

dc.contributor.author	Reinaldo Bianchi
dc.contributor.author	MARTINS, M. F.
dc.contributor.author	RIBEIRO, C. H. C.
dc.contributor.author	COSTA, A. H. R.
dc.contributor.authorOrcid	https://orcid.org/0000-0001-9097-827X
dc.date.accessioned	2022-01-12T22:00:41Z
dc.date.available	2022-01-12T22:00:41Z
dc.date.issued	2014-02-05
dc.description.abstract	This paper presents a novel class of algorithms, called Heuristically-Accelerated Multiagent Reinforcement Learning (HAMRL), which allows the use of heuristics to speed up well-known multiagent reinforcement learning (RL) algorithms such as the Minimax-Q. Such HAMRL algorithms are characterized by a heuristic function, which suggests the selection of particular actions over others. This function represents an initial action selection policy, which can be handcrafted, extracted from previous experience in distinct domains, or learnt from observation. To validate the proposal, a thorough theoretical analysis proving the convergence of four algorithms from the HAMRL class (HAMMQ, HAMQ}(λ, HAMQS, and HAMS) is presented. In addition, a comprehensive systematical evaluation was conducted in two distinct adversarial domains. The results show that even the most straightforward heuristics can produce virtually optimal action selection policies in much fewer episodes, significantly improving the performance of the HAMRL over vanilla RL algorithms. © 2013 IEEE.
dc.description.firstpage	252
dc.description.issuenumber	2
dc.description.lastpage	265
dc.description.volume	44
dc.identifier.citation	BIANCHI, R.; MARTINS, M. F.; RIBEIRO, C. H. C.; COSTA, A. H. R. Heuristically-accelerated multiagent reinforcement learning. IEEE Transactions on Cybernetics, v. 44, n. 2, p. 252-265, Feb. 2014.
dc.identifier.doi	10.1109/TCYB.2013.2253094
dc.identifier.issn	2168-2267
dc.identifier.uri	https://repositorio.fei.edu.br/handle/FEI/4016
dc.relation.ispartof	IEEE Transactions on Cybernetics
dc.rights	Acesso Restrito
dc.subject.otherlanguage	Artificial intelligence
dc.subject.otherlanguage	heuristic algorithms
dc.subject.otherlanguage	machine learning
dc.subject.otherlanguage	multiagent systems
dc.title	Heuristically-accelerated multiagent reinforcement learning
dc.type	Artigo
fei.scopus.citations	58
fei.scopus.eid	2-s2.0-84893355297
fei.scopus.subject	Action selection
fei.scopus.subject	Heuristic functions
fei.scopus.subject	Multi-agent reinforcement learning
fei.scopus.subject	Optimal actions
fei.scopus.subject	Speed up
fei.scopus.updated	2024-08-01
fei.scopus.url	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84893355297&origin=inward

Coleções

Artigos

Heuristically-accelerated multiagent reinforcement learning

Arquivos

Coleções