The University of Southampton
University of Southampton Institutional Repository

Reinforcement learning-based downlink transmit precoding for mitigating the impact of delayed CSI in satellite systems

Reinforcement learning-based downlink transmit precoding for mitigating the impact of delayed CSI in satellite systems
Reinforcement learning-based downlink transmit precoding for mitigating the impact of delayed CSI in satellite systems
The integration of low earth orbit (LEO) satellites with terrestrial communication networks holds the promise of seamless global connectivity. The efficiency of this connection,however, depends on the availability of reliable channel state information (CSI). Due to the large space-ground propagation delays, the estimated CSI is outdated. In this paper we consider the downlink of a satellite operating as a base station in support of multiple mobile users. The estimated outdated CSI is used at the satellite side to design a transmit precoding (TPC) matrix for the downlink. We propose a deep reinforcement learning(DRL)-based approach to optimize the TPC matrices, with the goal of maximizing the achievable data rate. We utilize the deep deterministic policy gradient (DDPG) algorithm to handle the continuous action space, and we employ state augmentation techniques to deal with the delayed observations and rewards.We show that the DRL agent is capable of exploiting the time-domain correlations of the channels for constructing accurate TPC matrices. This is because the proposed method is capable of compensating for the effects of delayed CSI in different frequency bands. Furthermore, we study the effect of handovers in the system, and show that the DRL agent is capable of promptly adapting to the environment when a handover occurs.
6G, Reinforcement Learning, LEO Satellite Communication, Non-terrestrial Networks, Delayed CSI, Machine Learning
0090-6778
Omid, Yasaman
d316c976-2288-4771-a4db-62a6755a63ef
Aristodemou, Marios
b6187b3e-15fd-4ef2-a788-ac253750126d
Lambotharan, Sangarapillai
9839317e-0bf4-4d7c-8722-87d3ec9086de
Derakhshani, Mahsa
24b67912-a4bd-4b16-bc36-14051f17987f
Hanzo, Lajos
66e7266f-3066-4fc0-8391-e000acce71a1
Omid, Yasaman
d316c976-2288-4771-a4db-62a6755a63ef
Aristodemou, Marios
b6187b3e-15fd-4ef2-a788-ac253750126d
Lambotharan, Sangarapillai
9839317e-0bf4-4d7c-8722-87d3ec9086de
Derakhshani, Mahsa
24b67912-a4bd-4b16-bc36-14051f17987f
Hanzo, Lajos
66e7266f-3066-4fc0-8391-e000acce71a1

Omid, Yasaman, Aristodemou, Marios, Lambotharan, Sangarapillai, Derakhshani, Mahsa and Hanzo, Lajos (2025) Reinforcement learning-based downlink transmit precoding for mitigating the impact of delayed CSI in satellite systems. IEEE Transactions on Communications. (doi:10.1109/TCOMM.2025.3569690).

Record type: Article

Abstract

The integration of low earth orbit (LEO) satellites with terrestrial communication networks holds the promise of seamless global connectivity. The efficiency of this connection,however, depends on the availability of reliable channel state information (CSI). Due to the large space-ground propagation delays, the estimated CSI is outdated. In this paper we consider the downlink of a satellite operating as a base station in support of multiple mobile users. The estimated outdated CSI is used at the satellite side to design a transmit precoding (TPC) matrix for the downlink. We propose a deep reinforcement learning(DRL)-based approach to optimize the TPC matrices, with the goal of maximizing the achievable data rate. We utilize the deep deterministic policy gradient (DDPG) algorithm to handle the continuous action space, and we employ state augmentation techniques to deal with the delayed observations and rewards.We show that the DRL agent is capable of exploiting the time-domain correlations of the channels for constructing accurate TPC matrices. This is because the proposed method is capable of compensating for the effects of delayed CSI in different frequency bands. Furthermore, we study the effect of handovers in the system, and show that the DRL agent is capable of promptly adapting to the environment when a handover occurs.

Text
Final PDF - Accepted Manuscript
Available under License Creative Commons Attribution.
Download (780kB)

More information

Accepted/In Press date: 30 April 2025
e-pub ahead of print date: 13 May 2025
Published date: 13 May 2025
Keywords: 6G, Reinforcement Learning, LEO Satellite Communication, Non-terrestrial Networks, Delayed CSI, Machine Learning

Identifiers

Local EPrints ID: 501966
URI: http://eprints.soton.ac.uk/id/eprint/501966
ISSN: 0090-6778
PURE UUID: bed7d590-67e3-4f2d-b186-30a6bf399d29
ORCID for Lajos Hanzo: ORCID iD orcid.org/0000-0002-2636-5214

Catalogue record

Date deposited: 12 Jun 2025 17:09
Last modified: 04 Sep 2025 01:58

Export record

Altmetrics

Contributors

Author: Yasaman Omid
Author: Marios Aristodemou
Author: Sangarapillai Lambotharan
Author: Mahsa Derakhshani
Author: Lajos Hanzo ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×