Reinforcement learning in a physics-inspired semi-Markov environment

Par Conseil national de recherches du Canada

DOI	Trouver le DOI : https://doi.org/10.1007/978-3-030-47358-7_6
Auteur	Rechercher : Bellinger, Colin¹; Rechercher : Coles, Rory; Rechercher : Crowley, Mark; Rechercher : Tamblyn, Isaac²
Affiliation	Conseil national de recherches du Canada. Technologies numériques Conseil national de recherches du Canada. Technologies de sécurité et de rupture
Format	Texte, Article
Conférence	Canadian Conference on Artificial Intelligence, Canadian AI 2020: Advances in Artificial Intelligence, May 13–15, 2020, Ottawa, Ontario, Canada
Sujet	materials science; reinforcement learning; semi-Markov decision processes
Résumé	Reinforcement learning (RL) has been demonstrated to have great potential in many applications of scientific discovery and design. Recent work includes, for example, the design of new structures and compositions of molecules for therapeutic drugs. Much of the existing work related to the application of RL to scientific domains, however, assumes that the available state representation obeys the Markov property. For reasons associated with time, cost, sensor accuracy, and gaps in scientific knowledge, many scientific design and discovery problems do not satisfy the Markov property. Thus, something other than a Markov decision process (MDP) should be used to plan/find the optimal policy. In this paper, we present a physics-inspired semi-Markov RL environment, namely the phase change environment. In addition, we evaluate the performance of value-based RL algorithms for both MDPs and partially observable MDPs (POMDPs) on the proposed environment. Our results demonstrate deep recurrent Q-networks (DRQN) significantly outperform deep Q-networks (DQN), and that DRQNs benefit from training with hindsight experience replay. Implications for the use of semi-Markovian RL and POMDPs for scientific laboratories are also discussed.
Date de publication	2020-05-06
Maison d’édition	Springer Nature Switzerland AG
Dans	Advances in Artificial Intelligence: 33rd Canadian Conference on Artificial Intelligence, Canadian AI 2020, Ottawa, ON, Canada, May 13–15, 2020, Proceedings, 239929 : 55–66.
Série	Lecture Notes in Computer Science 12109.
Langue	anglais
Publications évaluées par des pairs	Oui
Exporter la notice	Exporter en format RIS
Signaler une correction	Signaler une correction (s'ouvre dans un nouvel onglet)
Identificateur de l’enregistrement	3f553680-18d6-429c-8fc0-4c3ae973c0aa
Enregistrement créé	2020-07-24
Enregistrement modifié	2022-01-14

Date de modification :: 2024-07-16