The University of Southampton
University of Southampton Institutional Repository

Multi-agent DRL for RIS partitioning, beam selection, and power control in MIMO-NOMA system

Multi-agent DRL for RIS partitioning, beam selection, and power control in MIMO-NOMA system
Multi-agent DRL for RIS partitioning, beam selection, and power control in MIMO-NOMA system

Reconfigurable intelligent surface (RIS) partitioning offers a strategic solution for serving multiple users equipment (UEs) simultaneously in blockage-prone wireless environments, leveraging multiple-input multiple-output (MIMO) and non-orthogonal multiple access (NOMA) technologies within the millimeter-wave (mmWave) spectrum. However, deploying RIS partitioning (RISP) in MIMO-NOMA is hindered by the exacerbated computational complexity involved in acquiring accurate channel state information (CSI). This paper proposes a novel learning framework based on multi-agent deep reinforcement learning (MA-DRL) that maximizes the sum rate of the RISP-aided MIMO-NOMA system without requiring UEs’ CSI. The proposed framework jointly optimizes RIS phase shifts, number of RIS partitions, UEs’ beamformers, and NOMA power allocation (PA), transforming the challenge into a high-dimensional combinatorial optimization problem. The MA-DRL algorithm integrates double deep Q networks (DDQN) and deep deterministic policy gradient (DDPG) agents, where each UE acts as a DDQN agent optimizing its beamformer, while the RIS serves as a DDPG agent handling partitioning and power control. An experimental testbed is developed to gather real-world data for training and evaluation. Results show that the MA-DRL algorithm closely approaches optimal performance, trailing the exhaustive search by only 8%, while reducing complexity by 95% and improving the sum rate by an average of 18% compared to traditional full RIS setups.

beam selection, Deep reinforcement learning (DRL), multiple-input multiple-output (MIMO), non-orthogonal multiple access (NOMA), reconfigurable intelligent surface (RIS), RIS partitioning (RISP)
2644-125X
9073-9089
Nasser, Ahmed
ea30427f-fe1a-4c76-bcab-e0bd945a3cc5
Celik, Abdulkadir
f8e72266-763c-4849-b38e-2ea2f50a69d0
Eltawil, Ahmed M.
5eb9e965-5ec8-4da1-baee-c3cab0fb2a72
Nasser, Ahmed
ea30427f-fe1a-4c76-bcab-e0bd945a3cc5
Celik, Abdulkadir
f8e72266-763c-4849-b38e-2ea2f50a69d0
Eltawil, Ahmed M.
5eb9e965-5ec8-4da1-baee-c3cab0fb2a72

Nasser, Ahmed, Celik, Abdulkadir and Eltawil, Ahmed M. (2025) Multi-agent DRL for RIS partitioning, beam selection, and power control in MIMO-NOMA system. IEEE Open Journal of the Communications Society, 6, 9073-9089. (doi:10.1109/OJCOMS.2025.3624260).

Record type: Article

Abstract

Reconfigurable intelligent surface (RIS) partitioning offers a strategic solution for serving multiple users equipment (UEs) simultaneously in blockage-prone wireless environments, leveraging multiple-input multiple-output (MIMO) and non-orthogonal multiple access (NOMA) technologies within the millimeter-wave (mmWave) spectrum. However, deploying RIS partitioning (RISP) in MIMO-NOMA is hindered by the exacerbated computational complexity involved in acquiring accurate channel state information (CSI). This paper proposes a novel learning framework based on multi-agent deep reinforcement learning (MA-DRL) that maximizes the sum rate of the RISP-aided MIMO-NOMA system without requiring UEs’ CSI. The proposed framework jointly optimizes RIS phase shifts, number of RIS partitions, UEs’ beamformers, and NOMA power allocation (PA), transforming the challenge into a high-dimensional combinatorial optimization problem. The MA-DRL algorithm integrates double deep Q networks (DDQN) and deep deterministic policy gradient (DDPG) agents, where each UE acts as a DDQN agent optimizing its beamformer, while the RIS serves as a DDPG agent handling partitioning and power control. An experimental testbed is developed to gather real-world data for training and evaluation. Results show that the MA-DRL algorithm closely approaches optimal performance, trailing the exhaustive search by only 8%, while reducing complexity by 95% and improving the sum rate by an average of 18% compared to traditional full RIS setups.

This record has no associated files available for download.

More information

Accepted/In Press date: 18 October 2025
e-pub ahead of print date: 27 October 2025
Keywords: beam selection, Deep reinforcement learning (DRL), multiple-input multiple-output (MIMO), non-orthogonal multiple access (NOMA), reconfigurable intelligent surface (RIS), RIS partitioning (RISP)

Identifiers

Local EPrints ID: 507375
URI: http://eprints.soton.ac.uk/id/eprint/507375
ISSN: 2644-125X
PURE UUID: e2de16d2-fdd9-4cef-8e7a-b824ba2d3466
ORCID for Abdulkadir Celik: ORCID iD orcid.org/0000-0001-9007-9979

Catalogue record

Date deposited: 08 Dec 2025 17:34
Last modified: 09 Dec 2025 03:20

Export record

Altmetrics

Contributors

Author: Ahmed Nasser
Author: Abdulkadir Celik ORCID iD
Author: Ahmed M. Eltawil

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×