3D UAV trajectory and data collection optimisation via deep reinforcement learning

Unmanned aerial vehicles (UAVs) are now beginning to be deployed for enhancing the network performance and coverage in wireless communication. However, due to the limitation of their on-board power and flight time, it is challenging to obtain an optimal resource allocation scheme for the UAV-assisted Internet of Things (IoT). In this paper, we design a new UAV assisted IoT system relying on the shortest flight path of the UAVs while maximising the amount of data collected from IoT devices. Then, a deep reinforcement learning-based technique is conceived for finding the optimal trajectory and throughput in a specific coverage area. After training, the UAV has the ability to autonomously collect all the data from user nodes at a significant total sum-rate improvement while minimising the associated resources used. Numerical results are provided to highlight how our techniques strike a balance between the throughput attained, trajectory, and the time spent. More explicitly, we characterise the attainable performance in terms of the UAV trajectory, the expected reward and the total sum-rate.

10.1109/TCOMM.2022.3148364

0090-6778

2358 - 2371

Nguyen, Khoi Khac

f6b4b72c-d404-4cb0-a472-95d78d3b90f5

Duong, Trung Q.

406d80a2-b57f-4955-85aa-c5bc5a236b04

Do-Duy, Tan

cde17472-d115-4685-9a42-95d4eb19083e

Claussen, Holger

de6f8584-39a9-428c-aea3-41ffd2937512

Hanzo, Lajos

66e7266f-3066-4fc0-8391-e000acce71a1