Reinforcement learning-based downlink transmit precoding for mitigating the impact of delayed CSI in satellite systems
Reinforcement learning-based downlink transmit precoding for mitigating the impact of delayed CSI in satellite systems
The integration of low earth orbit (LEO) satellites with terrestrial communication networks holds the promise of seamless global connectivity. The efficiency of this connection,however, depends on the availability of reliable channel state information (CSI). Due to the large space-ground propagation delays, the estimated CSI is outdated. In this paper we consider the downlink of a satellite operating as a base station in support of multiple mobile users. The estimated outdated CSI is used at the satellite side to design a transmit precoding (TPC) matrix for the downlink. We propose a deep reinforcement learning(DRL)-based approach to optimize the TPC matrices, with the goal of maximizing the achievable data rate. We utilize the deep deterministic policy gradient (DDPG) algorithm to handle the continuous action space, and we employ state augmentation techniques to deal with the delayed observations and rewards.We show that the DRL agent is capable of exploiting the time-domain correlations of the channels for constructing accurate TPC matrices. This is because the proposed method is capable of compensating for the effects of delayed CSI in different frequency bands. Furthermore, we study the effect of handovers in the system, and show that the DRL agent is capable of promptly adapting to the environment when a handover occurs.
6G, Reinforcement Learning, LEO Satellite Communication, Non-terrestrial Networks, Delayed CSI, Machine Learning
Omid, Yasaman
d316c976-2288-4771-a4db-62a6755a63ef
Aristodemou, Marios
b6187b3e-15fd-4ef2-a788-ac253750126d
Lambotharan, Sangarapillai
9839317e-0bf4-4d7c-8722-87d3ec9086de
Derakhshani, Mahsa
24b67912-a4bd-4b16-bc36-14051f17987f
Hanzo, Lajos
66e7266f-3066-4fc0-8391-e000acce71a1
13 May 2025
Omid, Yasaman
d316c976-2288-4771-a4db-62a6755a63ef
Aristodemou, Marios
b6187b3e-15fd-4ef2-a788-ac253750126d
Lambotharan, Sangarapillai
9839317e-0bf4-4d7c-8722-87d3ec9086de
Derakhshani, Mahsa
24b67912-a4bd-4b16-bc36-14051f17987f
Hanzo, Lajos
66e7266f-3066-4fc0-8391-e000acce71a1
Omid, Yasaman, Aristodemou, Marios, Lambotharan, Sangarapillai, Derakhshani, Mahsa and Hanzo, Lajos
(2025)
Reinforcement learning-based downlink transmit precoding for mitigating the impact of delayed CSI in satellite systems.
IEEE Transactions on Communications.
(doi:10.1109/TCOMM.2025.3569690).
Abstract
The integration of low earth orbit (LEO) satellites with terrestrial communication networks holds the promise of seamless global connectivity. The efficiency of this connection,however, depends on the availability of reliable channel state information (CSI). Due to the large space-ground propagation delays, the estimated CSI is outdated. In this paper we consider the downlink of a satellite operating as a base station in support of multiple mobile users. The estimated outdated CSI is used at the satellite side to design a transmit precoding (TPC) matrix for the downlink. We propose a deep reinforcement learning(DRL)-based approach to optimize the TPC matrices, with the goal of maximizing the achievable data rate. We utilize the deep deterministic policy gradient (DDPG) algorithm to handle the continuous action space, and we employ state augmentation techniques to deal with the delayed observations and rewards.We show that the DRL agent is capable of exploiting the time-domain correlations of the channels for constructing accurate TPC matrices. This is because the proposed method is capable of compensating for the effects of delayed CSI in different frequency bands. Furthermore, we study the effect of handovers in the system, and show that the DRL agent is capable of promptly adapting to the environment when a handover occurs.
Text
Final PDF
- Accepted Manuscript
More information
Accepted/In Press date: 30 April 2025
e-pub ahead of print date: 13 May 2025
Published date: 13 May 2025
Keywords:
6G, Reinforcement Learning, LEO Satellite Communication, Non-terrestrial Networks, Delayed CSI, Machine Learning
Identifiers
Local EPrints ID: 501966
URI: http://eprints.soton.ac.uk/id/eprint/501966
ISSN: 0090-6778
PURE UUID: bed7d590-67e3-4f2d-b186-30a6bf399d29
Catalogue record
Date deposited: 12 Jun 2025 17:09
Last modified: 04 Sep 2025 01:58
Export record
Altmetrics
Contributors
Author:
Yasaman Omid
Author:
Marios Aristodemou
Author:
Sangarapillai Lambotharan
Author:
Mahsa Derakhshani
Author:
Lajos Hanzo
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics