Scopus Harvesting Series

TransformerLight: A Novel Sequence Modeling Based Traffic Signaling Mechanism via Gated Transformer

Qiang Wu, University of Electronic Science and Technology of China
Mingyuan Li, Beijing University of Posts and Telecommunications
Jun Shen, University of Wollongong
Linyuan Lü, University of Science and Technology of China
Bo Du, University of Wollongong
Ke Zhang, Beijing University of Posts and Telecommunications

Publication Name

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Abstract

Traffic signal control (TSC) is still one of the most significant and challenging research problems in the transportation field. Reinforcement learning (RL) has achieved great success in TSC but suffers from critically high learning costs in practical applications due to the excessive trial-and-error learning process. Offline RL is a promising method to reduce learning costs whereas the data distribution shift issue is still up in the air. To this end, in this paper, we formulate TSC as a sequence modeling problem with a sequence of Markov decision process described by states, actions, and rewards from the traffic environment. A novel framework, namely TransformerLight, is introduced, which does not aim to fit into value functions by averaging all possible returns, but produces the best possible actions using a gated Transformer. Additionally, the learning process of TransformerLight is much more stable by replacing the residual connections with gated transformer blocks due to a dynamic system perspective. Through numerical experiments on offline datasets, we demonstrate that the TransformerLight model: (1) can build a high-performance adaptive TSC model without dynamic programming; (2) achieves a new state-of-the-art compared to most published offline RL methods so far; and (3) shows a more stable learning process than offline RL and recent Transformer-based methods. The relevant dataset and code are available at Github.

Open Access Status

This publication may be available as open access

First Page

2639

Last Page

2647

Funding Number

11622538

Funding Sponsor

National Natural Science Foundation of China

Link to Full Text

COinS

Link to publisher version (DOI)

http://dx.doi.org/10.1145/3580305.3599530

Scopus Harvesting Series

TransformerLight: A Novel Sequence Modeling Based Traffic Signaling Mechanism via Gated Transformer

Publication Name

Abstract

Open Access Status

First Page

Last Page

Funding Number

Funding Sponsor

Link to publisher version (DOI)

Search

Browse

Links

Scopus Harvesting Series

TransformerLight: A Novel Sequence Modeling Based Traffic Signaling Mechanism via Gated Transformer

Authors

Publication Name

Abstract

Open Access Status

First Page

Last Page

Funding Number

Funding Sponsor

Share

Link to publisher version (DOI)

Search

Browse

Links