Reinforcement learning-based downlink transmit precoding for mitigating the impact of delayed CSI in satellite systems

The integration of low earth orbit (LEO) satellites with terrestrial communication networks holds the promise of seamless global connectivity. The efficiency of this connection,however, depends on the availability of reliable channel state information (CSI). Due to the large space-ground propagation delays, the estimated CSI is outdated. In this paper we consider the downlink of a satellite operating as a base station in support of multiple mobile users. The estimated outdated CSI is used at the satellite side to design a transmit precoding (TPC) matrix for the downlink. We propose a deep reinforcement learning(DRL)-based approach to optimize the TPC matrices, with the goal of maximizing the achievable data rate. We utilize the deep deterministic policy gradient (DDPG) algorithm to handle the continuous action space, and we employ state augmentation techniques to deal with the delayed observations and rewards.We show that the DRL agent is capable of exploiting the time-domain correlations of the channels for constructing accurate TPC matrices. This is because the proposed method is capable of compensating for the effects of delayed CSI in different frequency bands. Furthermore, we study the effect of handovers in the system, and show that the DRL agent is capable of promptly adapting to the environment when a handover occurs.

6G, Reinforcement Learning, LEO Satellite Communication, Non-terrestrial Networks, Delayed CSI, Machine Learning

10.1109/TCOMM.2025.3569690

0090-6778

Omid, Yasaman

d316c976-2288-4771-a4db-62a6755a63ef

Aristodemou, Marios

b6187b3e-15fd-4ef2-a788-ac253750126d

Lambotharan, Sangarapillai

9839317e-0bf4-4d7c-8722-87d3ec9086de

Derakhshani, Mahsa

24b67912-a4bd-4b16-bc36-14051f17987f

Hanzo, Lajos

66e7266f-3066-4fc0-8391-e000acce71a1

13 May 2025

Omid, Yasaman

d316c976-2288-4771-a4db-62a6755a63ef

Aristodemou, Marios

b6187b3e-15fd-4ef2-a788-ac253750126d

Lambotharan, Sangarapillai

9839317e-0bf4-4d7c-8722-87d3ec9086de

Derakhshani, Mahsa

24b67912-a4bd-4b16-bc36-14051f17987f

Hanzo, Lajos

66e7266f-3066-4fc0-8391-e000acce71a1

Omid, Yasaman, Aristodemou, Marios, Lambotharan, Sangarapillai, Derakhshani, Mahsa and Hanzo, Lajos (2025) Reinforcement learning-based downlink transmit precoding for mitigating the impact of delayed CSI in satellite systems. IEEE Transactions on Communications. (doi:10.1109/TCOMM.2025.3569690).

Record type: Article

Abstract

Text

Final PDF - Accepted Manuscript

Available under License Creative Commons Attribution.

Download (780kB)

More information

Accepted/In Press date: 30 April 2025

e-pub ahead of print date: 13 May 2025

Published date: 13 May 2025

Keywords: 6G, Reinforcement Learning, LEO Satellite Communication, Non-terrestrial Networks, Delayed CSI, Machine Learning

Learn more about Next Generation Wireless research

Identifiers

Local EPrints ID: 501966

URI: http://eprints.soton.ac.uk/id/eprint/501966

DOI: doi:10.1109/TCOMM.2025.3569690

ISSN: 0090-6778

PURE UUID: bed7d590-67e3-4f2d-b186-30a6bf399d29

ORCID for Lajos Hanzo:

orcid.org/0000-0002-2636-5214

Catalogue record

Date deposited: 12 Jun 2025 17:09

Last modified: 04 Sep 2025 01:58

Export record

Altmetrics

Share this record

Share this on Facebook Share this on Twitter Share this on Weibo

Contributors

Author: Yasaman Omid

Author: Marios Aristodemou

Author: Sangarapillai Lambotharan

Author: Mahsa Derakhshani

Author: Lajos Hanzo

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Library staff additional information