The University of Southampton
University of Southampton Institutional Repository

High-dimensional multi-period portfolio allocation using deep reinforcement learning

High-dimensional multi-period portfolio allocation using deep reinforcement learning
High-dimensional multi-period portfolio allocation using deep reinforcement learning
This paper proposes a novel investment strategy based on deep reinforcement learning (DRL) for long-term portfolio allocation in the presence of transaction costs and risk aversion. We design an advanced portfolio policy framework to model the dynamics of asset prices using convolutional neural networks (CNN), capture group-wise asset dependence using novel WaveNet methods, and solve the optimal asset allocation problem using DRL. These methods are embedded within a multi-period Bellman equation framework. Portfolio performance is tested empirically over different holding periods, risk aversion levels, transaction cost rates, and financial indices. Importantly, the results demonstrate the effectiveness and superiority of the proposed long-term portfolio allocation strategy compared to several competitors based on machine learning methods and under traditional optimization techniques.
Deep reinforcement learning, High-dimensional portfolios, Multi-period portfolio selection, Portfolio constraints, Risk aversion
1059-0560
Jiang, Yifu
cff1ccf8-1299-45de-95ec-f449f30fa0b8
Olmo, Jose
706f68c8-f991-4959-8245-6657a591056e
Atwi, Majed
a713c2fd-6b12-412d-9065-8a72ae788ad7
Jiang, Yifu
cff1ccf8-1299-45de-95ec-f449f30fa0b8
Olmo, Jose
706f68c8-f991-4959-8245-6657a591056e
Atwi, Majed
a713c2fd-6b12-412d-9065-8a72ae788ad7

Jiang, Yifu, Olmo, Jose and Atwi, Majed (2025) High-dimensional multi-period portfolio allocation using deep reinforcement learning. International Review of Economics and Finance, 98, [103996]. (doi:10.1016/j.iref.2025.103996).

Record type: Article

Abstract

This paper proposes a novel investment strategy based on deep reinforcement learning (DRL) for long-term portfolio allocation in the presence of transaction costs and risk aversion. We design an advanced portfolio policy framework to model the dynamics of asset prices using convolutional neural networks (CNN), capture group-wise asset dependence using novel WaveNet methods, and solve the optimal asset allocation problem using DRL. These methods are embedded within a multi-period Bellman equation framework. Portfolio performance is tested empirically over different holding periods, risk aversion levels, transaction cost rates, and financial indices. Importantly, the results demonstrate the effectiveness and superiority of the proposed long-term portfolio allocation strategy compared to several competitors based on machine learning methods and under traditional optimization techniques.

Text
Final manuscript Accepted - Preprint - Accepted Manuscript
Available under License Creative Commons Attribution.
Download (4MB)
Text
1-s2.0-S1059056025001595-main - Version of Record
Available under License Creative Commons Attribution.
Download (8MB)

More information

Accepted/In Press date: 26 February 2025
e-pub ahead of print date: 27 February 2025
Published date: 2 March 2025
Keywords: Deep reinforcement learning, High-dimensional portfolios, Multi-period portfolio selection, Portfolio constraints, Risk aversion

Identifiers

Local EPrints ID: 502201
URI: http://eprints.soton.ac.uk/id/eprint/502201
ISSN: 1059-0560
PURE UUID: e50fb176-a0c5-48b2-b267-1777e3740192
ORCID for Jose Olmo: ORCID iD orcid.org/0000-0002-0437-7812

Catalogue record

Date deposited: 18 Jun 2025 16:36
Last modified: 22 Aug 2025 02:08

Export record

Altmetrics

Contributors

Author: Yifu Jiang
Author: Jose Olmo ORCID iD
Author: Majed Atwi

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×