The University of Southampton
University of Southampton Institutional Repository

Multimodal sensing and DRL-driven beam selection in RIS-aided mmWave mMIMO systems

Multimodal sensing and DRL-driven beam selection in RIS-aided mmWave mMIMO systems
Multimodal sensing and DRL-driven beam selection in RIS-aided mmWave mMIMO systems

The IMT-2030 vision emphasizes two key 6G directions: integrated sensing and communication (ISAC) alongside artificial intelligence (AI)-native frameworks, where multimodal sensory data inputs enhance situational awareness and adaptive decision-making of communication systems. Accordingly, this paper introduces a deep reinforcement learning (DRL)-based beam selection framework for downlink multi-user reconfigurable intelligent surface (RIS)-assisted millimeter-wave (mmWave) massive MIMO (mMIMO) systems. Targeting maximized sum rates under quality of service (QoS) and fairness constraints, the framework employs two primary sensing modalities: a stereo camera mounted on the RIS for user equipment (UE) detection and inertial measurement units (IMUs) on UEs to obtain 3D Cartesian coordinates, thus eliminating the need for channel state information (CSI) acquisition. The DRL framework combines two algorithms - double deep Q-network (DDQN) and proximal policy optimization (PPO) - to jointly optimize RIS phase shifts and UE receive beamformers through predefined codebooks and adaptive beam selection. A testbed was developed to validate the system, leveraging real-world data to train the DRL algorithms. Experimental results demonstrate that both agents achieve nearoptimal sum rates across diverse base station (BS) transmit power levels and QoS thresholds while reducing computational complexity by 95%, illustrating the framework's potential for efficient and scalable beam alignment for AI-native wireless systems.

2166-9570
IEEE
Kanaan, Khalid
c6b34d0c-f606-4554-bc07-0a075c1900c3
Nasser, Ahmed
ea30427f-fe1a-4c76-bcab-e0bd945a3cc5
Celik, Abdulkadir
f8e72266-763c-4849-b38e-2ea2f50a69d0
Wang, Ruiqi
b9394ebb-87fe-4770-b007-8851427bef69
Shamim, Atif
aa0feb68-da4b-48f4-b140-7b18975a5760
Eltawil, Ahmed M.
5eb9e965-5ec8-4da1-baee-c3cab0fb2a72
Kanaan, Khalid
c6b34d0c-f606-4554-bc07-0a075c1900c3
Nasser, Ahmed
ea30427f-fe1a-4c76-bcab-e0bd945a3cc5
Celik, Abdulkadir
f8e72266-763c-4849-b38e-2ea2f50a69d0
Wang, Ruiqi
b9394ebb-87fe-4770-b007-8851427bef69
Shamim, Atif
aa0feb68-da4b-48f4-b140-7b18975a5760
Eltawil, Ahmed M.
5eb9e965-5ec8-4da1-baee-c3cab0fb2a72

Kanaan, Khalid, Nasser, Ahmed, Celik, Abdulkadir, Wang, Ruiqi, Shamim, Atif and Eltawil, Ahmed M. (2025) Multimodal sensing and DRL-driven beam selection in RIS-aided mmWave mMIMO systems. In 2025 IEEE 36th International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2025. IEEE. 6 pp . (doi:10.1109/PIMRC62392.2025.11274583).

Record type: Conference or Workshop Item (Paper)

Abstract

The IMT-2030 vision emphasizes two key 6G directions: integrated sensing and communication (ISAC) alongside artificial intelligence (AI)-native frameworks, where multimodal sensory data inputs enhance situational awareness and adaptive decision-making of communication systems. Accordingly, this paper introduces a deep reinforcement learning (DRL)-based beam selection framework for downlink multi-user reconfigurable intelligent surface (RIS)-assisted millimeter-wave (mmWave) massive MIMO (mMIMO) systems. Targeting maximized sum rates under quality of service (QoS) and fairness constraints, the framework employs two primary sensing modalities: a stereo camera mounted on the RIS for user equipment (UE) detection and inertial measurement units (IMUs) on UEs to obtain 3D Cartesian coordinates, thus eliminating the need for channel state information (CSI) acquisition. The DRL framework combines two algorithms - double deep Q-network (DDQN) and proximal policy optimization (PPO) - to jointly optimize RIS phase shifts and UE receive beamformers through predefined codebooks and adaptive beam selection. A testbed was developed to validate the system, leveraging real-world data to train the DRL algorithms. Experimental results demonstrate that both agents achieve nearoptimal sum rates across diverse base station (BS) transmit power levels and QoS thresholds while reducing computational complexity by 95%, illustrating the framework's potential for efficient and scalable beam alignment for AI-native wireless systems.

This record has no associated files available for download.

More information

Published date: 12 December 2025
Venue - Dates: 36th IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2025, , Istanbul, Turkey, 2025-09-01 - 2025-09-04

Identifiers

Local EPrints ID: 510541
URI: http://eprints.soton.ac.uk/id/eprint/510541
ISSN: 2166-9570
PURE UUID: 1179b98d-66bf-432e-8641-e605a0eea07f
ORCID for Abdulkadir Celik: ORCID iD orcid.org/0000-0001-9007-9979

Catalogue record

Date deposited: 13 Apr 2026 16:41
Last modified: 14 Apr 2026 02:17

Export record

Altmetrics

Contributors

Author: Khalid Kanaan
Author: Ahmed Nasser
Author: Abdulkadir Celik ORCID iD
Author: Ruiqi Wang
Author: Atif Shamim
Author: Ahmed M. Eltawil

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×