Adaptive market making using reinforcement learning in a multi-agent market simulation
Adaptive market making using reinforcement learning in a multi-agent market simulation
Financial markets are amongst the most complex and well-studied multi-agent systems. Over the past several decades, researchers have devoted considerable effort to understanding their structural and behavioural properties. However, building a simulated market environment that can replicate realistic interactive behaviour remains an ongoing challenge. This limitation restricts the scope and validity of experiments aimed at testing new trading strategies or hypotheses, particularly when simplifying assumptions, such as negligible market impact or the absence of market feedback, fail to hold.
This thesis addresses this challenge by building on the Agent-Based Interactive Discrete Event Simulation (ABIDES) simulation platform to create an empirically grounded and reproducible environment for studying cryptocurrency markets. First, we develop a methodology for tuning simulation parameters to replicate the stylised facts observed in Binance following the 2017 cryptocurrency boom. Second, we introduce Price-Reverting Impact Model of a cryptocurrency Exchange (PRIME), a novel configuration of the ABIDES simulator designed to produce realistic market responses to agent actions whilst retaining the ability to track an external price series. Third, we use the PRIME framework to conduct the first controlled comparison of Reinforcement Learning (RL) based market makers across various architectures in a realistic, interactive environment, establishing a benchmark for future research.
Our findings demonstrate that a carefully calibrated agent population comprising Zero Intelligence, Momentum, and Mean-Reversion agents can reproduce key statistical properties observed in the Bitcoin market. Furthermore, the PRIME framework successfully captures market impact behaviours described in existing and novel empirical observations, allowing for controlled but realistic experimentation of market agents in this environment. Finally, our reinforcement learning market-makers deployed on this platform reveal important insights into the trade-offs involved in state-action space design and the learning dynamics of various RL architectures. This setup also allows for a meaningful comparative evaluation of market-makers with previous work. Collectively, these contributions provide a reproducible foundation for advancing market simulation frameworks and for developing and evaluating novel reinforcement learning approaches to the market-making problem.
Market Simulation, market microstructure, market making, multi-agentsystems, reinforcement learning
University of Southampton
Cho, Christopher Jaehoon
de0dcf2d-d858-485c-b62b-0234fbac47a2
11 May 2026
Cho, Christopher Jaehoon
de0dcf2d-d858-485c-b62b-0234fbac47a2
Norman, Tim
663e522f-807c-4569-9201-dc141c8eb50d
Nunes, Manuel
af597793-a85a-463c-9d12-0ae4be7e0a69
Cho, Christopher Jaehoon
(2026)
Adaptive market making using reinforcement learning in a multi-agent market simulation.
University of Southampton, Doctoral Thesis, 196pp.
Record type:
Thesis
(Doctoral)
Abstract
Financial markets are amongst the most complex and well-studied multi-agent systems. Over the past several decades, researchers have devoted considerable effort to understanding their structural and behavioural properties. However, building a simulated market environment that can replicate realistic interactive behaviour remains an ongoing challenge. This limitation restricts the scope and validity of experiments aimed at testing new trading strategies or hypotheses, particularly when simplifying assumptions, such as negligible market impact or the absence of market feedback, fail to hold.
This thesis addresses this challenge by building on the Agent-Based Interactive Discrete Event Simulation (ABIDES) simulation platform to create an empirically grounded and reproducible environment for studying cryptocurrency markets. First, we develop a methodology for tuning simulation parameters to replicate the stylised facts observed in Binance following the 2017 cryptocurrency boom. Second, we introduce Price-Reverting Impact Model of a cryptocurrency Exchange (PRIME), a novel configuration of the ABIDES simulator designed to produce realistic market responses to agent actions whilst retaining the ability to track an external price series. Third, we use the PRIME framework to conduct the first controlled comparison of Reinforcement Learning (RL) based market makers across various architectures in a realistic, interactive environment, establishing a benchmark for future research.
Our findings demonstrate that a carefully calibrated agent population comprising Zero Intelligence, Momentum, and Mean-Reversion agents can reproduce key statistical properties observed in the Bitcoin market. Furthermore, the PRIME framework successfully captures market impact behaviours described in existing and novel empirical observations, allowing for controlled but realistic experimentation of market agents in this environment. Finally, our reinforcement learning market-makers deployed on this platform reveal important insights into the trade-offs involved in state-action space design and the learning dynamics of various RL architectures. This setup also allows for a meaningful comparative evaluation of market-makers with previous work. Collectively, these contributions provide a reproducible foundation for advancing market simulation frameworks and for developing and evaluating novel reinforcement learning approaches to the market-making problem.
Text
Chris_Cho_Doctoral_Thesis
- Version of Record
Text
Final-thesis-submission-Examination-Mr-Christopher-Cho
Restricted to Repository staff only
More information
Published date: 11 May 2026
Keywords:
Market Simulation, market microstructure, market making, multi-agentsystems, reinforcement learning
Identifiers
Local EPrints ID: 511754
URI: http://eprints.soton.ac.uk/id/eprint/511754
PURE UUID: c46b5fbb-d238-4710-9e9f-7cf0984a1b18
Catalogue record
Date deposited: 01 Jun 2026 16:50
Last modified: 02 Jun 2026 02:04
Export record
Altmetrics
Contributors
Author:
Christopher Jaehoon Cho
Thesis advisor:
Manuel Nunes
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics