Multimodal sensing and DRL-driven beam selection in RIS-aided mmWave mMIMO systems
Multimodal sensing and DRL-driven beam selection in RIS-aided mmWave mMIMO systems
The IMT-2030 vision emphasizes two key 6G directions: integrated sensing and communication (ISAC) alongside artificial intelligence (AI)-native frameworks, where multimodal sensory data inputs enhance situational awareness and adaptive decision-making of communication systems. Accordingly, this paper introduces a deep reinforcement learning (DRL)-based beam selection framework for downlink multi-user reconfigurable intelligent surface (RIS)-assisted millimeter-wave (mmWave) massive MIMO (mMIMO) systems. Targeting maximized sum rates under quality of service (QoS) and fairness constraints, the framework employs two primary sensing modalities: a stereo camera mounted on the RIS for user equipment (UE) detection and inertial measurement units (IMUs) on UEs to obtain 3D Cartesian coordinates, thus eliminating the need for channel state information (CSI) acquisition. The DRL framework combines two algorithms - double deep Q-network (DDQN) and proximal policy optimization (PPO) - to jointly optimize RIS phase shifts and UE receive beamformers through predefined codebooks and adaptive beam selection. A testbed was developed to validate the system, leveraging real-world data to train the DRL algorithms. Experimental results demonstrate that both agents achieve nearoptimal sum rates across diverse base station (BS) transmit power levels and QoS thresholds while reducing computational complexity by 95%, illustrating the framework's potential for efficient and scalable beam alignment for AI-native wireless systems.
Kanaan, Khalid
c6b34d0c-f606-4554-bc07-0a075c1900c3
Nasser, Ahmed
ea30427f-fe1a-4c76-bcab-e0bd945a3cc5
Celik, Abdulkadir
f8e72266-763c-4849-b38e-2ea2f50a69d0
Wang, Ruiqi
b9394ebb-87fe-4770-b007-8851427bef69
Shamim, Atif
aa0feb68-da4b-48f4-b140-7b18975a5760
Eltawil, Ahmed M.
5eb9e965-5ec8-4da1-baee-c3cab0fb2a72
12 December 2025
Kanaan, Khalid
c6b34d0c-f606-4554-bc07-0a075c1900c3
Nasser, Ahmed
ea30427f-fe1a-4c76-bcab-e0bd945a3cc5
Celik, Abdulkadir
f8e72266-763c-4849-b38e-2ea2f50a69d0
Wang, Ruiqi
b9394ebb-87fe-4770-b007-8851427bef69
Shamim, Atif
aa0feb68-da4b-48f4-b140-7b18975a5760
Eltawil, Ahmed M.
5eb9e965-5ec8-4da1-baee-c3cab0fb2a72
Kanaan, Khalid, Nasser, Ahmed, Celik, Abdulkadir, Wang, Ruiqi, Shamim, Atif and Eltawil, Ahmed M.
(2025)
Multimodal sensing and DRL-driven beam selection in RIS-aided mmWave mMIMO systems.
In 2025 IEEE 36th International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2025.
IEEE.
6 pp
.
(doi:10.1109/PIMRC62392.2025.11274583).
Record type:
Conference or Workshop Item
(Paper)
Abstract
The IMT-2030 vision emphasizes two key 6G directions: integrated sensing and communication (ISAC) alongside artificial intelligence (AI)-native frameworks, where multimodal sensory data inputs enhance situational awareness and adaptive decision-making of communication systems. Accordingly, this paper introduces a deep reinforcement learning (DRL)-based beam selection framework for downlink multi-user reconfigurable intelligent surface (RIS)-assisted millimeter-wave (mmWave) massive MIMO (mMIMO) systems. Targeting maximized sum rates under quality of service (QoS) and fairness constraints, the framework employs two primary sensing modalities: a stereo camera mounted on the RIS for user equipment (UE) detection and inertial measurement units (IMUs) on UEs to obtain 3D Cartesian coordinates, thus eliminating the need for channel state information (CSI) acquisition. The DRL framework combines two algorithms - double deep Q-network (DDQN) and proximal policy optimization (PPO) - to jointly optimize RIS phase shifts and UE receive beamformers through predefined codebooks and adaptive beam selection. A testbed was developed to validate the system, leveraging real-world data to train the DRL algorithms. Experimental results demonstrate that both agents achieve nearoptimal sum rates across diverse base station (BS) transmit power levels and QoS thresholds while reducing computational complexity by 95%, illustrating the framework's potential for efficient and scalable beam alignment for AI-native wireless systems.
This record has no associated files available for download.
More information
Published date: 12 December 2025
Venue - Dates:
36th IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2025, , Istanbul, Turkey, 2025-09-01 - 2025-09-04
Identifiers
Local EPrints ID: 510541
URI: http://eprints.soton.ac.uk/id/eprint/510541
ISSN: 2166-9570
PURE UUID: 1179b98d-66bf-432e-8641-e605a0eea07f
Catalogue record
Date deposited: 13 Apr 2026 16:41
Last modified: 14 Apr 2026 02:17
Export record
Altmetrics
Contributors
Author:
Khalid Kanaan
Author:
Ahmed Nasser
Author:
Abdulkadir Celik
Author:
Ruiqi Wang
Author:
Atif Shamim
Author:
Ahmed M. Eltawil
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics