Hierarchical reinforcement learning for vehicle routing problems with time windows

Par Conseil national de recherches du Canada

Téléchargement	Voir la version finale : Hierarchical reinforcement learning for vehicle routing problems with time windows (PDF, 317 Kio)
DOI	Trouver le DOI : https://doi.org/10.21428/594757db.f0516e23
Auteur	Rechercher : Wang, Yunli¹; Rechercher : Sun, Sun¹; Rechercher : Li, Wei
Affiliation	Conseil national de recherches du Canada. Technologies numériques
Format	Texte, Article
Conférence	34th Canadian Conference on Artificial Intelligence, Canadian AI 2021, May 25-28, 2021, online
Sujet	vehicle routing problem with time windows; hierarchical reinforcement learning; generalization
Résumé	Vehicle routing problem with time windows (VRPTW) is a practical and complex vehicle routing problem (VRP) which is faced by thousands of companies in logistics and transportation. Usually, VRP is solved by traditional heuristic algorithms. Recently, deep learning models under the reinforcement learning (RL) framework have been proposed to solve variants of VRP. In our study, we propose to use the hierarchical RL to find an optimal policy for generating optimal routes in VRPTW. The hierarchical RL structure includes a low level which generates feasible solutions and a high level which further searches for an optimal solution. Experimental results show that the proposed hierarchical RL model outperforms the non-hierarchical RL model and the heuristic algorithms Google OR-Tools. The proposed model also shows generalization capability in three different scenarios: varied time window constraints, from small-scale to large-scale problems, and generalization across different datasets. The flexible framework of hierarchical RL can also be applied to solve other complex VRPs with multiple objectives.
Date de publication	2021-06-08
Maison d’édition	Canadian Artificial Intelligence Association
Licence	Creative Commons, Attribution 4.0 Générique (CC BY 4.0) https://creativecommons.org/licenses/by/4.0/deed.fr
Dans	Proceedings of the 34th Canadian Conference on Artificial Intelligence (8 juin 2021).
Langue	anglais
Publications évaluées par des pairs	Oui
Exporter la notice	Exporter en format RIS
Signaler une correction	Signaler une correction (s'ouvre dans un nouvel onglet)
Identificateur de l’enregistrement	e02634fa-53d9-4666-8876-5db877efe04a
Enregistrement créé	2021-08-11
Enregistrement modifié	2021-08-11

Date de modification :: 2024-07-21