descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jun 2024Embargo end date: 01 Jan 2022Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Neural Networks and Learning Systems, volume 35, pages 8,557-8,569 (issn: 2162-237X, eissn: 2162-2388,

Authors: Donghan Xie; Zhi Wang; Chunlin Chen; Daoyi Dong;

doi: 10.1109/tnnls.2022.3230701 , 10.48550/arxiv.2203.02896

pmid: 37015645

arXiv: 2203.02896

Depthwise Convolution for Multi-Agent Communication With Enhanced Mean-Field Approximation

- Summary
- Subjects
- External Databases
  (1)
- Metrics

Abstract

Multi-agent settings remain a fundamental challenge in the reinforcement learning (RL) domain due to the partial observability and the lack of accurate real-time interactions across agents. In this paper, we propose a new method based on local communication learning to tackle the multi-agent RL (MARL) challenge within a large number of agents coexisting. First, we design a new communication protocol that exploits the ability of depthwise convolution to efficiently extract local relations and learn local communication between neighboring agents. To facilitate multi-agent coordination, we explicitly learn the effect of joint actions by taking the policies of neighboring agents as inputs. Second, we introduce the mean-field approximation into our method to reduce the scale of agent interactions. To more effectively coordinate behaviors of neighboring agents, we enhance the mean-field approximation by a supervised policy rectification network (PRN) for rectifying real-time agent interactions and by a learnable compensation term for correcting the approximation bias. The proposed method enables efficient coordination as well as outperforms several baseline approaches on the adaptive traffic signal control (ATSC) task and the StarCraft II multi-agent challenge (SMAC).

Accepted by IEEE Transactions on Neural Networks, 2022, DOI: 10.1109/TNNLS.2022.3230701

Related Organizations

NANJING UNIVERSITY
China (People's Republic of)
Hebei University
China (People's Republic of)
UNSW Sydney
Australia
Nanjing University
China (People's Republic of)
Nanjing University
China (People's Republic of)

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Science - Multiagent Systems, Machine Learning (cs.LG), Multiagent Systems (cs.MA)

1ndq

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Top 10%

Average

Green

Fields of Science (4) View all

natural sciences

Fields of Science

natural sciences

View all

Funded by

ARC| ARC Future Fellowships - Grant ID: FT220100656