The University of Southampton
University of Southampton Institutional Repository

Adaptive market making using reinforcement learning in a multi-agent market simulation

Adaptive market making using reinforcement learning in a multi-agent market simulation
Adaptive market making using reinforcement learning in a multi-agent market simulation
Financial markets are amongst the most complex and well-studied multi-agent systems. Over the past several decades, researchers have devoted considerable effort to understanding their structural and behavioural properties. However, building a simulated market environment that can replicate realistic interactive behaviour remains an ongoing challenge. This limitation restricts the scope and validity of experiments aimed at testing new trading strategies or hypotheses, particularly when simplifying assumptions, such as negligible market impact or the absence of market feedback, fail to hold.

This thesis addresses this challenge by building on the Agent-Based Interactive Discrete Event Simulation (ABIDES) simulation platform to create an empirically grounded and reproducible environment for studying cryptocurrency markets. First, we develop a methodology for tuning simulation parameters to replicate the stylised facts observed in Binance following the 2017 cryptocurrency boom. Second, we introduce Price-Reverting Impact Model of a cryptocurrency Exchange (PRIME), a novel configuration of the ABIDES simulator designed to produce realistic market responses to agent actions whilst retaining the ability to track an external price series. Third, we use the PRIME framework to conduct the first controlled comparison of Reinforcement Learning (RL) based market makers across various architectures in a realistic, interactive environment, establishing a benchmark for future research.

Our findings demonstrate that a carefully calibrated agent population comprising Zero Intelligence, Momentum, and Mean-Reversion agents can reproduce key statistical properties observed in the Bitcoin market. Furthermore, the PRIME framework successfully captures market impact behaviours described in existing and novel empirical observations, allowing for controlled but realistic experimentation of market agents in this environment. Finally, our reinforcement learning market-makers deployed on this platform reveal important insights into the trade-offs involved in state-action space design and the learning dynamics of various RL architectures. This setup also allows for a meaningful comparative evaluation of market-makers with previous work. Collectively, these contributions provide a reproducible foundation for advancing market simulation frameworks and for developing and evaluating novel reinforcement learning approaches to the market-making problem.
Market Simulation, market microstructure, market making, multi-agentsystems, reinforcement learning
University of Southampton
Cho, Christopher Jaehoon
de0dcf2d-d858-485c-b62b-0234fbac47a2
Cho, Christopher Jaehoon
de0dcf2d-d858-485c-b62b-0234fbac47a2
Norman, Tim
663e522f-807c-4569-9201-dc141c8eb50d
Nunes, Manuel
af597793-a85a-463c-9d12-0ae4be7e0a69

Cho, Christopher Jaehoon (2026) Adaptive market making using reinforcement learning in a multi-agent market simulation. University of Southampton, Doctoral Thesis, 196pp.

Record type: Thesis (Doctoral)

Abstract

Financial markets are amongst the most complex and well-studied multi-agent systems. Over the past several decades, researchers have devoted considerable effort to understanding their structural and behavioural properties. However, building a simulated market environment that can replicate realistic interactive behaviour remains an ongoing challenge. This limitation restricts the scope and validity of experiments aimed at testing new trading strategies or hypotheses, particularly when simplifying assumptions, such as negligible market impact or the absence of market feedback, fail to hold.

This thesis addresses this challenge by building on the Agent-Based Interactive Discrete Event Simulation (ABIDES) simulation platform to create an empirically grounded and reproducible environment for studying cryptocurrency markets. First, we develop a methodology for tuning simulation parameters to replicate the stylised facts observed in Binance following the 2017 cryptocurrency boom. Second, we introduce Price-Reverting Impact Model of a cryptocurrency Exchange (PRIME), a novel configuration of the ABIDES simulator designed to produce realistic market responses to agent actions whilst retaining the ability to track an external price series. Third, we use the PRIME framework to conduct the first controlled comparison of Reinforcement Learning (RL) based market makers across various architectures in a realistic, interactive environment, establishing a benchmark for future research.

Our findings demonstrate that a carefully calibrated agent population comprising Zero Intelligence, Momentum, and Mean-Reversion agents can reproduce key statistical properties observed in the Bitcoin market. Furthermore, the PRIME framework successfully captures market impact behaviours described in existing and novel empirical observations, allowing for controlled but realistic experimentation of market agents in this environment. Finally, our reinforcement learning market-makers deployed on this platform reveal important insights into the trade-offs involved in state-action space design and the learning dynamics of various RL architectures. This setup also allows for a meaningful comparative evaluation of market-makers with previous work. Collectively, these contributions provide a reproducible foundation for advancing market simulation frameworks and for developing and evaluating novel reinforcement learning approaches to the market-making problem.

Text
Chris_Cho_Doctoral_Thesis - Version of Record
Available under License University of Southampton Thesis Licence.
Download (6MB)
Text
Final-thesis-submission-Examination-Mr-Christopher-Cho
Restricted to Repository staff only

More information

Published date: 11 May 2026
Keywords: Market Simulation, market microstructure, market making, multi-agentsystems, reinforcement learning

Identifiers

Local EPrints ID: 511754
URI: http://eprints.soton.ac.uk/id/eprint/511754
PURE UUID: c46b5fbb-d238-4710-9e9f-7cf0984a1b18
ORCID for Christopher Jaehoon Cho: ORCID iD orcid.org/0000-0003-2805-0352
ORCID for Tim Norman: ORCID iD orcid.org/0000-0002-6387-4034
ORCID for Manuel Nunes: ORCID iD orcid.org/0000-0002-7116-5502

Catalogue record

Date deposited: 01 Jun 2026 16:50
Last modified: 02 Jun 2026 02:04

Export record

Altmetrics

Contributors

Author: Christopher Jaehoon Cho ORCID iD
Thesis advisor: Tim Norman ORCID iD
Thesis advisor: Manuel Nunes ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×