The University of Southampton
University of Southampton Institutional Repository

Deep reinforcement learning aided platoon control relying on V2X information

Deep reinforcement learning aided platoon control relying on V2X information
Deep reinforcement learning aided platoon control relying on V2X information
The impact of Vehicle-to-Everything (V2X) communications on platoon control performance is investigated. Platoon control is essentially a sequential stochastic decision problem (SSDP), which can be solved by Deep Reinforcement Learning (DRL) to deal with both the control constraints and uncertainty in the platoon leading vehicle’s behavior. In this context, the value of V2X communications for DRL-based platoon controllers is studied with an emphasis on the tradeoff between the gain of including exogenous information in the system state for reducing uncertainty and the performance erosion due to the curse-of-dimensionality. Our objective is to find the specific set of information that should be shared among the vehicles for the construction of the most appropriate state space. SSDP models are conceived for platoon control under different information topologies (IFT) by taking into account ‘just sufficient’ information. Furthermore, theorems are established for comparing the performance of their optimal policies. In order to determine whether a piece of information should or should not be transmitted for improving the DRL-based control policy, we quantify its value by deriving the conditional KL divergence of the transition models. More meritorious information is given higher priority in transmission, since including it in the state space has a higher probability in offsetting the negative effect of having higher state dimensions. Finally, simulation results are provided to illustrate the theoretical analysis.
0018-9545
Lei, Lei
72192a6f-a439-4fd4-8010-d389b5ae3bf8
Liu, Tong
17f1a70b-449d-4078-af64-957a5b374698
Zheng, Kan
1141004c-e359-4b26-a49b-2a821d76edf0
Hanzo, Lajos
66e7266f-3066-4fc0-8391-e000acce71a1
Lei, Lei
72192a6f-a439-4fd4-8010-d389b5ae3bf8
Liu, Tong
17f1a70b-449d-4078-af64-957a5b374698
Zheng, Kan
1141004c-e359-4b26-a49b-2a821d76edf0
Hanzo, Lajos
66e7266f-3066-4fc0-8391-e000acce71a1

Lei, Lei, Liu, Tong, Zheng, Kan and Hanzo, Lajos (2022) Deep reinforcement learning aided platoon control relying on V2X information. IEEE Transactions on Vehicular Technology. (In Press)

Record type: Article

Abstract

The impact of Vehicle-to-Everything (V2X) communications on platoon control performance is investigated. Platoon control is essentially a sequential stochastic decision problem (SSDP), which can be solved by Deep Reinforcement Learning (DRL) to deal with both the control constraints and uncertainty in the platoon leading vehicle’s behavior. In this context, the value of V2X communications for DRL-based platoon controllers is studied with an emphasis on the tradeoff between the gain of including exogenous information in the system state for reducing uncertainty and the performance erosion due to the curse-of-dimensionality. Our objective is to find the specific set of information that should be shared among the vehicles for the construction of the most appropriate state space. SSDP models are conceived for platoon control under different information topologies (IFT) by taking into account ‘just sufficient’ information. Furthermore, theorems are established for comparing the performance of their optimal policies. In order to determine whether a piece of information should or should not be transmitted for improving the DRL-based control policy, we quantify its value by deriving the conditional KL divergence of the transition models. More meritorious information is given higher priority in transmission, since including it in the state space has a higher probability in offsetting the negative effect of having higher state dimensions. Finally, simulation results are provided to illustrate the theoretical analysis.

Text
platoon_final - Accepted Manuscript
Available under License Creative Commons Attribution.
Download (1MB)

More information

Accepted/In Press date: 20 March 2022

Identifiers

Local EPrints ID: 456132
URI: http://eprints.soton.ac.uk/id/eprint/456132
ISSN: 0018-9545
PURE UUID: 4ed497e9-e4a3-490a-84b8-09856b0b48bb
ORCID for Lajos Hanzo: ORCID iD orcid.org/0000-0002-2636-5214

Catalogue record

Date deposited: 26 Apr 2022 14:59
Last modified: 17 Mar 2024 02:35

Export record

Contributors

Author: Lei Lei
Author: Tong Liu
Author: Kan Zheng
Author: Lajos Hanzo ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×