The University of Southampton
University of Southampton Institutional Repository

Multi-agent deep reinforcement learning based trajectory planning for multi-UAV assisted mobile edge computing

Multi-agent deep reinforcement learning based trajectory planning for multi-UAV assisted mobile edge computing
Multi-agent deep reinforcement learning based trajectory planning for multi-UAV assisted mobile edge computing
An unmanned aerial vehicle (UAV)-aided mobile edge computing (MEC) framework is proposed, where several UAVs having different trajectories fly over the target area and support the user equipments (UEs) on the ground. We aim to jointly optimize the geographical fairness among all the UEs, the fairness of each UAV’ UE-load and the overall energy consumption of UEs. The above optimization problem includes both integer and continues variables and it is challenging to solve. To address the above problem, a multi-agent deep reinforcement learning based trajectory control algorithm is proposed for managing the trajectory of each UAV independently, where the popular Multi-Agent Deep Deterministic Policy Gradient
(MADDPG) method is applied. Given the UAVs’ trajectories, a low-complexity approach is introduced for optimizing the offloading decisions of UEs. We show that our proposed solution has considerable performance over other traditional algorithms, both in terms of the fairness for serving UEs, fairness of UE-load at each UAV and energy consumption for all the UEs.
MADDPG, mobile edge computing, Multi-agent deep reinforcement learning, trajectory control, UAV
2332-7731
73-84
Wang, Liang
09b9a7f4-7732-43cb-aec0-ab671d309f93
Wang, Kezhi
338ae80c-5b3d-4c9b-b536-f8def4b858b5
Pan, Cunhua
9ee3d968-c5c2-42ba-8041-d54667205d5b
Xu, Wei
d012c621-8510-4ac3-bd83-9da7515b98d2
Aslam, Nauman
54d2a8cd-4ef5-454b-8df6-7b11f8a8b473
Hanzo, Lajos
66e7266f-3066-4fc0-8391-e000acce71a1
Wang, Liang
09b9a7f4-7732-43cb-aec0-ab671d309f93
Wang, Kezhi
338ae80c-5b3d-4c9b-b536-f8def4b858b5
Pan, Cunhua
9ee3d968-c5c2-42ba-8041-d54667205d5b
Xu, Wei
d012c621-8510-4ac3-bd83-9da7515b98d2
Aslam, Nauman
54d2a8cd-4ef5-454b-8df6-7b11f8a8b473
Hanzo, Lajos
66e7266f-3066-4fc0-8391-e000acce71a1

Wang, Liang, Wang, Kezhi, Pan, Cunhua, Xu, Wei, Aslam, Nauman and Hanzo, Lajos (2020) Multi-agent deep reinforcement learning based trajectory planning for multi-UAV assisted mobile edge computing. IEEE Transactions on Cognitive Communications and Networking, 7 (1), 73-84, [9209079]. (doi:10.1109/TCCN.2020.3027695).

Record type: Article

Abstract

An unmanned aerial vehicle (UAV)-aided mobile edge computing (MEC) framework is proposed, where several UAVs having different trajectories fly over the target area and support the user equipments (UEs) on the ground. We aim to jointly optimize the geographical fairness among all the UEs, the fairness of each UAV’ UE-load and the overall energy consumption of UEs. The above optimization problem includes both integer and continues variables and it is challenging to solve. To address the above problem, a multi-agent deep reinforcement learning based trajectory control algorithm is proposed for managing the trajectory of each UAV independently, where the popular Multi-Agent Deep Deterministic Policy Gradient
(MADDPG) method is applied. Given the UAVs’ trajectories, a low-complexity approach is introduced for optimizing the offloading decisions of UEs. We show that our proposed solution has considerable performance over other traditional algorithms, both in terms of the fairness for serving UEs, fairness of UE-load at each UAV and energy consumption for all the UEs.

Text
Main_jrl - Accepted Manuscript
Download (747kB)

More information

Accepted/In Press date: 21 September 2020
Published date: 29 September 2020
Additional Information: Funding Information: The work of W. Xu was supported in part by the NSFC under grants 62022026 and 61871109. L. Hanzo would like to acknowledge the financial support of the Engineering and Physical Sciences Research Council projects EP/N004558/1, EP/P034284/1, EP/P034284/1, EP/P003990/1 (COALESCE), of the Royal Society?s Global Challenges Research Fund Grant as well as of the European Research Council?s Advanced Fellow Grant QuantCom. Publisher Copyright: © 2015 IEEE. Copyright: Copyright 2021 Elsevier B.V., All rights reserved.
Keywords: MADDPG, mobile edge computing, Multi-agent deep reinforcement learning, trajectory control, UAV

Identifiers

Local EPrints ID: 444183
URI: http://eprints.soton.ac.uk/id/eprint/444183
ISSN: 2332-7731
PURE UUID: ced5d94e-e90d-458a-a06e-8c76a7c8bba0
ORCID for Lajos Hanzo: ORCID iD orcid.org/0000-0002-2636-5214

Catalogue record

Date deposited: 30 Sep 2020 16:31
Last modified: 18 Mar 2024 02:36

Export record

Altmetrics

Contributors

Author: Liang Wang
Author: Kezhi Wang
Author: Cunhua Pan
Author: Wei Xu
Author: Nauman Aslam
Author: Lajos Hanzo ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×