The University of Southampton
University of Southampton Institutional Repository

In-memory computing based ultra-efficient massive MIMO precoding: memristor crossbar circuits, conductance mapping strategies, and programming latency estimation

In-memory computing based ultra-efficient massive MIMO precoding: memristor crossbar circuits, conductance mapping strategies, and programming latency estimation
In-memory computing based ultra-efficient massive MIMO precoding: memristor crossbar circuits, conductance mapping strategies, and programming latency estimation
Matrix inversion in massive multiple-input multiple-output (MIMO) precoding imposes significant burden on the energy efficiency and processing latency of baseband units. In this paper, we first propose a memristor crossbar based in-memory computing circuit capable of supporting both zero-forcing (ZF) and minimum mean square error (MMSE) precoding. The circuit features a reduced matrix size and enables faster one-step computation without the need for timing control. Secondly, to address the computational inaccuracy caused by the limited conductance range of memristors, we develop an optimized matrix-to-conductance mapping scheme that jointly considers device physical constraints and matrix statistics, achieving over 60% reduction in relative computation error compared with baseline scheme. An associated lightweight circuit enhancement ensures compatibility with practical crossbar architectures, without incurring significant hardware overhead. Thirdly, we establish a memristor programming time model grounded in device-level potentiation and depression dynamics. The analysis yields closed-form expressions for the expected programming time and its upper bound, and is further validated through Monte Carlo simulations, enabling accurate estimation of the system throughput. Simulation results demonstrate that the proposed circuit achieves a bit error rate comparable to that of 64-bit floating-point precoding, while delivering over 100× improvement in both energy and area efficiency compared with the NVIDIA RTX A2000 graphics processing unit (GPU).
2156-3357
Zhang, Yu-Xin
59da9ab3-3939-4990-9680-e80d1935f2b4
Yang, Shaoshi
23650ec4-bcc8-4a2c-b1e7-a30893f52e52
Ren, Yi-Hang
14d08868-c48d-4a71-a413-121fdb52b9b3
Chen, Sheng
9310a111-f79a-48b8-98c7-383ca93cbb80
Zhang, Ping
2def4374-679d-41d1-bf3a-483028a73275
Zhang, Yu-Xin
59da9ab3-3939-4990-9680-e80d1935f2b4
Yang, Shaoshi
23650ec4-bcc8-4a2c-b1e7-a30893f52e52
Ren, Yi-Hang
14d08868-c48d-4a71-a413-121fdb52b9b3
Chen, Sheng
9310a111-f79a-48b8-98c7-383ca93cbb80
Zhang, Ping
2def4374-679d-41d1-bf3a-483028a73275

Zhang, Yu-Xin, Yang, Shaoshi, Ren, Yi-Hang, Chen, Sheng and Zhang, Ping (2025) In-memory computing based ultra-efficient massive MIMO precoding: memristor crossbar circuits, conductance mapping strategies, and programming latency estimation. IEEE Journal on Emerging and Selected Topics in Circuits and Systems. (doi:10.1109/JETCAS.2025.3648808).

Record type: Article

Abstract

Matrix inversion in massive multiple-input multiple-output (MIMO) precoding imposes significant burden on the energy efficiency and processing latency of baseband units. In this paper, we first propose a memristor crossbar based in-memory computing circuit capable of supporting both zero-forcing (ZF) and minimum mean square error (MMSE) precoding. The circuit features a reduced matrix size and enables faster one-step computation without the need for timing control. Secondly, to address the computational inaccuracy caused by the limited conductance range of memristors, we develop an optimized matrix-to-conductance mapping scheme that jointly considers device physical constraints and matrix statistics, achieving over 60% reduction in relative computation error compared with baseline scheme. An associated lightweight circuit enhancement ensures compatibility with practical crossbar architectures, without incurring significant hardware overhead. Thirdly, we establish a memristor programming time model grounded in device-level potentiation and depression dynamics. The analysis yields closed-form expressions for the expected programming time and its upper bound, and is further validated through Monte Carlo simulations, enabling accurate estimation of the system throughput. Simulation results demonstrate that the proposed circuit achieves a bit error rate comparable to that of 64-bit floating-point precoding, while delivering over 100× improvement in both energy and area efficiency compared with the NVIDIA RTX A2000 graphics processing unit (GPU).

Text
precoding_zhangyuxin-1 - Accepted Manuscript
Available under License Creative Commons Attribution.
Download (36MB)

More information

e-pub ahead of print date: 29 December 2025

Identifiers

Local EPrints ID: 508982
URI: http://eprints.soton.ac.uk/id/eprint/508982
ISSN: 2156-3357
PURE UUID: cd065a6a-f117-4419-b459-7859a5983cd4

Catalogue record

Date deposited: 09 Feb 2026 17:46
Last modified: 09 Feb 2026 17:46

Export record

Altmetrics

Contributors

Author: Yu-Xin Zhang
Author: Shaoshi Yang
Author: Yi-Hang Ren
Author: Sheng Chen
Author: Ping Zhang

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×