Multi-modal sensing and communication for V2V beam tracking via camera and GPS fusion
Multi-modal sensing and communication for V2V beam tracking via camera and GPS fusion
Beam alignment is a critical challenge in vehicle-to-vehicle (V2V) communications, particularly at millimeter-wave (mmWave) frequencies, where massive multi-input multi-output (mMIMO) systems rely on high directivity to mitigate propagation losses. The highly dynamic nature of V2V scenarios, characterized by short channel coherence times, complicates channel state information (CSI) acquisition and increases the overhead associated with traditional beam training processes. To address these challenges, we propose a transformer-based beam tracking framework that leverages sensory data, including GPS and camera inputs, to enable efficient and proactive beam alignment without extensive training overhead. GPS data is transformed into relative coordinates, while features including relative velocity and orientation are extracted from this data. Moreover, camera images provide complementary contextual information about the surrounding environment, enhancing the accuracy of beam predictions. The model predicts beam states up to 500 ms ahead using predefined beamforming codebooks, facilitating both precise beam selection and effective blockage mitigation. Trained on the DeepSense V2V dataset, our method demonstrates a reduction of up to 7 dB in power loss and a 26 % improvement in top-5 accuracy compared to a linear interpolation baseline, highlighting the potential of GPS and camera fusion for advancing beam alignment in dynamic V2V mmWave networks.
Beam Tracking, mmWave, Multi-Modal, Transformer, V2V
Fabiani, Mattia
425cc952-cb48-4ce7-b5fd-67962c0ab2ad
Silva, Diego
e0263400-7d89-4c41-9507-d19c2de9eb9f
Abdallah, Asmaa
86b80268-48be-4bc8-9577-c989e496e459
Celik, Abdulkadir
f8e72266-763c-4849-b38e-2ea2f50a69d0
Eltawil, Ahmed M.
5eb9e965-5ec8-4da1-baee-c3cab0fb2a72
4 April 2024
Fabiani, Mattia
425cc952-cb48-4ce7-b5fd-67962c0ab2ad
Silva, Diego
e0263400-7d89-4c41-9507-d19c2de9eb9f
Abdallah, Asmaa
86b80268-48be-4bc8-9577-c989e496e459
Celik, Abdulkadir
f8e72266-763c-4849-b38e-2ea2f50a69d0
Eltawil, Ahmed M.
5eb9e965-5ec8-4da1-baee-c3cab0fb2a72
Fabiani, Mattia, Silva, Diego, Abdallah, Asmaa, Celik, Abdulkadir and Eltawil, Ahmed M.
(2024)
Multi-modal sensing and communication for V2V beam tracking via camera and GPS fusion.
Matthews, Michael B.
(ed.)
In 2024 58th Asilomar Conference on Signals, Systems, and Computers.
IEEE Computer Society.
6 pp
.
(doi:10.1109/IEEECONF60004.2024.10942842).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Beam alignment is a critical challenge in vehicle-to-vehicle (V2V) communications, particularly at millimeter-wave (mmWave) frequencies, where massive multi-input multi-output (mMIMO) systems rely on high directivity to mitigate propagation losses. The highly dynamic nature of V2V scenarios, characterized by short channel coherence times, complicates channel state information (CSI) acquisition and increases the overhead associated with traditional beam training processes. To address these challenges, we propose a transformer-based beam tracking framework that leverages sensory data, including GPS and camera inputs, to enable efficient and proactive beam alignment without extensive training overhead. GPS data is transformed into relative coordinates, while features including relative velocity and orientation are extracted from this data. Moreover, camera images provide complementary contextual information about the surrounding environment, enhancing the accuracy of beam predictions. The model predicts beam states up to 500 ms ahead using predefined beamforming codebooks, facilitating both precise beam selection and effective blockage mitigation. Trained on the DeepSense V2V dataset, our method demonstrates a reduction of up to 7 dB in power loss and a 26 % improvement in top-5 accuracy compared to a linear interpolation baseline, highlighting the potential of GPS and camera fusion for advancing beam alignment in dynamic V2V mmWave networks.
This record has no associated files available for download.
More information
Published date: 4 April 2024
Venue - Dates:
58th Asilomar Conference on Signals, Systems and Computers, ACSSC 2024, , Hybrid, Pacific Grove, United States, 2024-10-27 - 2024-10-30
Keywords:
Beam Tracking, mmWave, Multi-Modal, Transformer, V2V
Identifiers
Local EPrints ID: 505745
URI: http://eprints.soton.ac.uk/id/eprint/505745
ISSN: 1058-6393
PURE UUID: b2b1a56d-5616-4fe5-91e9-00c9143da5af
Catalogue record
Date deposited: 17 Oct 2025 16:35
Last modified: 18 Oct 2025 02:18
Export record
Altmetrics
Contributors
Author:
Mattia Fabiani
Author:
Diego Silva
Author:
Asmaa Abdallah
Author:
Abdulkadir Celik
Author:
Ahmed M. Eltawil
Editor:
Michael B. Matthews
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics