Powered by OpenAIRE graph
Found an issue? Give us feedback
addClaim

Reinforcement learning for adaptive traffic signal control

Authors: Cheung, Wai Lun;

Reinforcement learning for adaptive traffic signal control

Abstract

Traffic signal control is very important to urban development. Most of the governments in developed countries continue to improve the safety as well as efficiency of their traffic signal control systems. In a well-designed traffic control system, fewer traffic accidents will occur. Moreover, the vehicle waiting times can also be shortened. In this thesis, we attempt to apply adaptive learning techniques to traffic signal control. In particular, emphasis is put on three important aspects: (a) Adaptation: The controller should adapt its signal time plan to varying traffic patterns that are characteristic of non-stationary stochastic environments. (b) Memorization: The controller should be capable of remembering previously encountered traffic patterns so that there is no need to re-learn the patterns when these situations are encountered again. (c) Cooperation: Controllers for different traffic junctions should interact and cooperate with each other to optimize global performance. Several control strategies are studied in the thesis. First, the regular time method is a non-adaptive method that serves as a baseline for comparison with other methods. An adaptive extension of the regular time method, called the adaptive regular time method, improves performance through its adaptiveness to traffic changes. This adaptive method is then further generalised by formulating the adaptive traffic signal control problem as a reinforcement learning problem. Two reinforcement learning methods, called LRP and Q-learning respectively, are studied. It is found that reinforcement learning is very effective for implementing adaptive traffic signal control systems that can satisfy the three criteria above.

Country
China (People's Republic of)
Keywords

000, Traffic signs and signals, Electronic traffic controls, Adaptive control systems, Traffic flow

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Upload OA version
Are you the author of this publication? Upload your Open Access version to Zenodo!
It’s fast and easy, just two clicks!