Multiagent-Based Reinforcement Learning for Optimal Reactive Power Dispatch

descriptionPublicationkeyboard_double_arrow_right Article 01 Nov 2012Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), volume 42, pages 1,742-1,751 (issn: 1094-6977, eissn: 1558-2442,

Copyright policy )

Authors: Yinliang Xu; Wei Zhang 0111; Wenxin Liu 0001; Frank T. Ferrese;

doi: 10.1109/tsmcc.2012.2218596

Multiagent-Based Reinforcement Learning for Optimal Reactive Power Dispatch

- Summary
- Metrics

Abstract

This paper proposes a fully distributed multiagent-based reinforcement learning method for optimal reactive power dispatch. According to the method, two agents communicate with each other only if their corresponding buses are electrically coupled. The global rewards that are required for learning are obtained with a consensus-based global information discovery algorithm, which has been demonstrated to be efficient and reliable. Based on the discovered global rewards, a distributed Q-learning algorithm is implemented to minimize the active power loss while satisfying operational constraints. The proposed method does not require accurate system model and can learn from scratch. Simulation studies with power systems of different sizes show that the method is very computationally efficient and able to provide near-optimal solutions. It can be observed that prior knowledge can significantly speed up the learning process and decrease the occurrences of undesirable disturbances. The proposed method has good potential for online implementation.

Related Organizations

Naval Sea Systems Command
United States
Naval Surface Warfare Center
United States
New Mexico State University
United States

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	118
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%