MOEA with adaptive operator based on reinforcement learning for weapon target assignment

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2024Publisher:American Institute of Mathematical Sciences (AIMS)Journal:Electronic Research Archive, volume 32, pages 1,498-1,532 (issn: 2688-1594,

Authors: Shiqi Zou; Xiaoping Shi; Shenmin Song;

doi: 10.3934/era.2024069

MOEA with adaptive operator based on reinforcement learning for weapon target assignment

- Summary
- Subjects
- Metrics

Abstract

<abstract><p>Weapon target assignment (WTA) is a typical problem in the command and control of modern warfare. Despite the significance of the problem, traditional algorithms still have shortcomings in terms of efficiency, solution quality, and generalization. This paper presents a novel multi-objective evolutionary optimization algorithm (MOEA) that integrates a deep Q-network (DQN)-based adaptive mutation operator and a greedy-based crossover operator, designed to enhance the solution quality for the multi-objective WTA (MO-WTA). Our approach (NSGA-DRL) evolves NSGA-II by embedding these operators to strike a balance between exploration and exploitation. The DQN-based adaptive mutation operator is developed for predicting high-quality solutions, thereby improving the exploration process and maintaining diversity within the population. In parallel, the greedy-based crossover operator employs domain knowledge to minimize ineffective searches, focusing on exploitation and expediting convergence. Ablation studies revealed that our proposed operators significantly boost the algorithm performance. In particular, the DQN mutation operator shows its predictive effectiveness in identifying candidate solutions. The proposed NSGA-DRL outperforms state-and-art MOEAs in solving MO-WTA problems by generating high-quality solutions.</p></abstract>

Related Organizations

Harbin Institute of Technology
China (People's Republic of)

Keywords

reinforcement learning, T57-57.97, Applied mathematics. Quantitative methods, deep q-network, weapon target assignment, multi-objective evolutionary algorithm, QA1-939, exploration and exploration, Mathematics

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Top 10%

Average

gold

Fields of Science (3) View all

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

View all

Related to Research communities

UArctic