Answer set programming for non-stationary Markov decision processes

Ferreira L.A.; C. Bianchi R.A.; Santos P.E.; de Mantaras R.L.

Answer set programming for non-stationary Markov decision processes

dc.contributor.author	Ferreira L.A.
dc.contributor.author	C. Bianchi R.A.
dc.contributor.author	Santos P.E.
dc.contributor.author	de Mantaras R.L.
dc.date.accessioned	2019-08-19T23:45:19Z
dc.date.available	2019-08-19T23:45:19Z
dc.date.issued	2017
dc.description.abstract	© 2017, Springer Science+Business Media New York.Non-stationary domains, where unforeseen changes happen, present a challenge for agents to find an optimal policy for a sequential decision making problem. This work investigates a solution to this problem that combines Markov Decision Processes (MDP) and Reinforcement Learning (RL) with Answer Set Programming (ASP) in a method we call ASP(RL). In this method, Answer Set Programming is used to find the possible trajectories of an MDP, from where Reinforcement Learning is applied to learn the optimal policy of the problem. Results show that ASP(RL) is capable of efficiently finding the optimal solution of an MDP representing non-stationary domains.
dc.description.firstpage	993
dc.description.issuenumber	4
dc.description.lastpage	1007
dc.description.volume	47
dc.identifier.citation	Anjoletto, L.; Bianchi; Santos, Paulo; De MANTARAS, R. L.. Answer Set Programming for Non-Stationary Markov Decision Processes. APPLIED INTELLIGENCE, v. 1, p. 1, 2017.
dc.identifier.doi	10.1007/s10489-017-0988-y
dc.identifier.issn	1573-7497
dc.identifier.uri	https://repositorio.fei.edu.br/handle/FEI/1210
dc.relation.ispartof	Applied Intelligence
dc.rights	Acesso Restrito
dc.subject.otherlanguage	Action languages
dc.subject.otherlanguage	Answer set programming
dc.subject.otherlanguage	Markov decision processes
dc.subject.otherlanguage	Non-determinism
dc.title	Answer set programming for non-stationary Markov decision processes
dc.type	Artigo
fei.scopus.citations	10
fei.scopus.eid	2-s2.0-85025441238
fei.scopus.subject	Action language
fei.scopus.subject	Answer set programming
fei.scopus.subject	Markov Decision Processes
fei.scopus.subject	Non Determinism
fei.scopus.subject	Nonstationary
fei.scopus.subject	Optimal policies
fei.scopus.subject	Optimal solutions
fei.scopus.subject	Sequential decision making
fei.scopus.updated	2024-07-01
fei.scopus.url	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85025441238&origin=inward

Coleções

Artigos

Answer set programming for non-stationary Markov decision processes

Arquivos

Coleções