Name: Collaborative Deep Reinforcement Learning for Solving Multi-Objective Vehicle Routing Problems
Keywords: Deep reinforcement learning, Artificial Intelligence and Robotics, Attention network, Collaborative active search, Multi-objective vehicle routing problems, 004

descriptionPublicationkeyboard_double_arrow_right Conference object , Article 06 May 2024 English Publisher:International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)

Authors: WU, Yaoxin; FAN, Mingfeng; CAO, Zhiguang; GAO, Ruobin; HOU, Yaqing; SARTORETTI, Guillaume;

Collaborative Deep Reinforcement Learning for Solving Multi-Objective Vehicle Routing Problems

- Summary
- Subjects
- Metrics

Abstract

Existing deep reinforcement learning (DRL) methods for multi-objective vehicle routing problems (MOVRPs) typically decompose an MOVRP into subproblems with respective preferences and then train policies to solve corresponding subproblems. However, such a paradigm is still less effective in tackling the intricate interactions among subproblems, thus holding back the quality of the Pareto solutions. To counteract this limitation, we introduce a collaborative deep reinforcement learning method. We first propose a preference-based attention network (PAN) that allows the DRL agents to reason out solutions to subproblems in parallel, where a shared encoder learns the instance embedding and a decoder is tailored for each agent by preference intervention to construct respective solutions. Then, we design a collaborative active search (CAS) to further improve the solution quality, which updates only a part of the decoder parameters per instance during inference. In the CAS process, we also explicitly foster the interactions of neighboring DRL agents by imitation learning, empowering them to exchange insights of elite solutions to similar subproblems. Extensive results on random and benchmark instances verified the efficacy of PAN and CAS, which is particularly pronounced on the configurations (i.e., problem sizes or node distributions) beyond the training ones. Our code is available at https://github.com/marmotlab/PAN-CAS.

Related Organizations

Singapore Management University
Singapore
Technical University Eindhoven
Netherlands

Keywords

Deep reinforcement learning, Artificial Intelligence and Robotics, Attention network, Collaborative active search, Multi-objective vehicle routing problems, 004

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green

Related to Research communities

Netherlands Research Portal