3D UAV trajectory and data collection optimisation via deep reinforcement learning
3D UAV trajectory and data collection optimisation via deep reinforcement learning
Unmanned aerial vehicles (UAVs) are now beginning to be deployed for enhancing the network performance and coverage in wireless communication. However, due to the limitation of their on-board power and flight time, it is challenging to obtain an optimal resource allocation scheme for the UAV-assisted Internet of Things (IoT). In this paper, we design a new UAV assisted IoT system relying on the shortest flight path of the UAVs while maximising the amount of data collected from IoT devices. Then, a deep reinforcement learning-based technique is conceived for finding the optimal trajectory and throughput in a specific coverage area. After training, the UAV has the ability to autonomously collect all the data from user nodes at a significant total sum-rate improvement while minimising the associated resources used. Numerical results are provided to highlight how our techniques strike a balance between the throughput attained, trajectory, and the time spent. More explicitly, we characterise the attainable performance in terms of the UAV trajectory, the expected reward and the total sum-rate.
2358 - 2371
Nguyen, Khoi Khac
f6b4b72c-d404-4cb0-a472-95d78d3b90f5
Duong, Trung Q.
406d80a2-b57f-4955-85aa-c5bc5a236b04
Do-Duy, Tan
cde17472-d115-4685-9a42-95d4eb19083e
Claussen, Holger
de6f8584-39a9-428c-aea3-41ffd2937512
Hanzo, Lajos
66e7266f-3066-4fc0-8391-e000acce71a1
Nguyen, Khoi Khac
f6b4b72c-d404-4cb0-a472-95d78d3b90f5
Duong, Trung Q.
406d80a2-b57f-4955-85aa-c5bc5a236b04
Do-Duy, Tan
cde17472-d115-4685-9a42-95d4eb19083e
Claussen, Holger
de6f8584-39a9-428c-aea3-41ffd2937512
Hanzo, Lajos
66e7266f-3066-4fc0-8391-e000acce71a1
Nguyen, Khoi Khac, Duong, Trung Q., Do-Duy, Tan, Claussen, Holger and Hanzo, Lajos
(2022)
3D UAV trajectory and data collection optimisation via deep reinforcement learning.
IEEE Transactions on Communications, 70 (4), .
(doi:10.1109/TCOMM.2022.3148364).
Abstract
Unmanned aerial vehicles (UAVs) are now beginning to be deployed for enhancing the network performance and coverage in wireless communication. However, due to the limitation of their on-board power and flight time, it is challenging to obtain an optimal resource allocation scheme for the UAV-assisted Internet of Things (IoT). In this paper, we design a new UAV assisted IoT system relying on the shortest flight path of the UAVs while maximising the amount of data collected from IoT devices. Then, a deep reinforcement learning-based technique is conceived for finding the optimal trajectory and throughput in a specific coverage area. After training, the UAV has the ability to autonomously collect all the data from user nodes at a significant total sum-rate improvement while minimising the associated resources used. Numerical results are provided to highlight how our techniques strike a balance between the throughput attained, trajectory, and the time spent. More explicitly, we characterise the attainable performance in terms of the UAV trajectory, the expected reward and the total sum-rate.
Text
version9
- Accepted Manuscript
More information
Accepted/In Press date: 29 January 2022
e-pub ahead of print date: 1 February 2022
Identifiers
Local EPrints ID: 454675
URI: http://eprints.soton.ac.uk/id/eprint/454675
ISSN: 0090-6778
PURE UUID: 3cd47ce3-e332-49cd-b0a2-bc5baae974ea
Catalogue record
Date deposited: 21 Feb 2022 17:31
Last modified: 18 Mar 2024 05:15
Export record
Altmetrics
Contributors
Author:
Khoi Khac Nguyen
Author:
Trung Q. Duong
Author:
Tan Do-Duy
Author:
Holger Claussen
Author:
Lajos Hanzo
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics