Back to Basics: Deep Reinforcement Learning in Traffic Signal Control

Sierk Kanis, Laurens Samson, Daan Bloembergen, Tim Bakker

November 2021

Deep RL for traffic signal control

Abstract

In this paper we revisit some of the fundamental premises for a reinforcement learning (RL) approach to self-learning traffic lights. We propose RLight, a combination of choices that offers robust performance and good generalization to unseen traffic flows. In particular, our main contributions are threefold. Our lightweight and cluster-aware state representation leads to improved performance, we reformulate the Markov Decision Process (MDP) such that it skips redundant timesteps of yellow light, speeding up learning by 30%, and we investigate the action space and provide insight into the difference in performance between acyclic and cyclic phase transitions. Additionally, we provide insights into the generalisation of the methods to unseen traffic. Evaluations using the real-world Hangzhou traffic dataset show that RLight outperforms state-of-the-art rule-based and deep reinforcement learning algorithms, demonstrating the potential of RL-based methods to improve urban traffic flows.

Type

Conference paper

Publication

In The 10th International Workshop on Urban Computing (UrbComp, 2021)

Deep RL Urban computing Self-organising traffic lights

Back to Basics: Deep Reinforcement Learning in Traffic Signal Control

Abstract

Tim Bakker

PhD researcher in Machine Learning