Téléchargement | - Voir la version finale : Hierarchical reinforcement learning for vehicle routing problems with time windows (PDF, 317 Kio)
|
---|
DOI | Trouver le DOI : https://doi.org/10.21428/594757db.f0516e23 |
---|
Auteur | Rechercher : Wang, Yunli1; Rechercher : Sun, Sun1; Rechercher : Li, Wei |
---|
Affiliation | - Conseil national de recherches du Canada. Technologies numériques
|
---|
Format | Texte, Article |
---|
Conférence | 34th Canadian Conference on Artificial Intelligence, Canadian AI 2021, May 25-28, 2021, online |
---|
Sujet | vehicle routing problem with time windows; hierarchical reinforcement learning; generalization |
---|
Résumé | Vehicle routing problem with time windows (VRPTW) is a practical and complex vehicle routing problem (VRP) which is faced by thousands of companies in logistics and transportation. Usually, VRP is solved by traditional heuristic algorithms. Recently, deep learning models under the reinforcement learning (RL) framework have been proposed to solve variants of VRP. In our study, we propose to use the hierarchical RL to find an optimal policy for generating optimal routes in VRPTW. The hierarchical RL structure includes a low level which generates feasible solutions and a high level which further searches for an optimal solution. Experimental results show that the proposed hierarchical RL model outperforms the non-hierarchical RL model and the heuristic algorithms Google OR-Tools. The proposed model also shows generalization capability in three different scenarios: varied time window constraints, from small-scale to large-scale problems, and generalization across different datasets. The flexible framework of hierarchical RL can also be applied to solve other complex VRPs with multiple objectives. |
---|
Date de publication | 2021-06-08 |
---|
Maison d’édition | Canadian Artificial Intelligence Association |
---|
Licence | |
---|
Dans | |
---|
Langue | anglais |
---|
Publications évaluées par des pairs | Oui |
---|
Exporter la notice | Exporter en format RIS |
---|
Signaler une correction | Signaler une correction (s'ouvre dans un nouvel onglet) |
---|
Identificateur de l’enregistrement | e02634fa-53d9-4666-8876-5db877efe04a |
---|
Enregistrement créé | 2021-08-11 |
---|
Enregistrement modifié | 2021-08-11 |
---|