The University of Southampton
University of Southampton Institutional Repository

On quantifying the role of exogenous macro-economic information in machine learning for modelling financial data

On quantifying the role of exogenous macro-economic information in machine learning for modelling financial data
On quantifying the role of exogenous macro-economic information in machine learning for modelling financial data
Data from the financial markets are a source of challenging inference problems. Machine learning tools are increasingly used for the analysis of financial data. They are observed to provide more accurate models than classical analytical models that depend on specific assumptions. In this work, we ask if the inclusion of external (exogenous) macro-economic information into a model fitting procedure may be useful to improve the quality of analysis and predictions of financial time series. This dissertation explores this case by addressing several problems in empirical finance which are tackled by using a range of machine learning methods with exogenous macro-economic data.

First, we study a non-parametric approach to mapping the price of traded option contracts to the value of the underlying asset and the time to maturity. We explore if additional information would be helpful in improving this mapping. We show that this is the case, and further we show that there is a relationship between volume traded and volatility of an asset that is not apparent in the raw data, but it is seen through their influence on the prices of options. Then, we consider the non-negative matrix factorization (NMF) method and extend it with eXogenous information to specify a new model (XNMF). We present a learning algorithm for it and illustrate its better performance than NMF using equity prices and underlying macroeconomic variables. We show how residual signals arising in time series analysis can be explained by a sparse regression taken over related macroeconomic variables (the Kalman LagLasso model) to help in financial analysis. A comparison between stock index values and Bitcoin using this model illustrates clear underlying differences between them.

Finally, we study a powerful representation learning framework popular in machine learning (VAE) and extend it with inductive exogenous variable. Thus, we created a probabilistic XNMF (PAE-XNMF) that is able to generate financial data, with lower reconstruction error than a probabilistic NMF; and Recurrent Neural Networks, specifically, the Long Short Term Memory model (LSTM). We show that LSTM captures time series dynamics. Then we combined LSTM with attention mechanism to gain more interpretability of the influence of macro-economic data on predicting financial time series.
University of Southampton
Montesdeoca Bermudez, Luis Jairo
15ab4b1d-2c0a-41ff-8893-f06e4a1cac94
Montesdeoca Bermudez, Luis Jairo
15ab4b1d-2c0a-41ff-8893-f06e4a1cac94
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f

Montesdeoca Bermudez, Luis Jairo (2020) On quantifying the role of exogenous macro-economic information in machine learning for modelling financial data. University of Southampton, Doctoral Thesis, 132pp.

Record type: Thesis (Doctoral)

Abstract

Data from the financial markets are a source of challenging inference problems. Machine learning tools are increasingly used for the analysis of financial data. They are observed to provide more accurate models than classical analytical models that depend on specific assumptions. In this work, we ask if the inclusion of external (exogenous) macro-economic information into a model fitting procedure may be useful to improve the quality of analysis and predictions of financial time series. This dissertation explores this case by addressing several problems in empirical finance which are tackled by using a range of machine learning methods with exogenous macro-economic data.

First, we study a non-parametric approach to mapping the price of traded option contracts to the value of the underlying asset and the time to maturity. We explore if additional information would be helpful in improving this mapping. We show that this is the case, and further we show that there is a relationship between volume traded and volatility of an asset that is not apparent in the raw data, but it is seen through their influence on the prices of options. Then, we consider the non-negative matrix factorization (NMF) method and extend it with eXogenous information to specify a new model (XNMF). We present a learning algorithm for it and illustrate its better performance than NMF using equity prices and underlying macroeconomic variables. We show how residual signals arising in time series analysis can be explained by a sparse regression taken over related macroeconomic variables (the Kalman LagLasso model) to help in financial analysis. A comparison between stock index values and Bitcoin using this model illustrates clear underlying differences between them.

Finally, we study a powerful representation learning framework popular in machine learning (VAE) and extend it with inductive exogenous variable. Thus, we created a probabilistic XNMF (PAE-XNMF) that is able to generate financial data, with lower reconstruction error than a probabilistic NMF; and Recurrent Neural Networks, specifically, the Long Short Term Memory model (LSTM). We show that LSTM captures time series dynamics. Then we combined LSTM with attention mechanism to gain more interpretability of the influence of macro-economic data on predicting financial time series.

Text
On Quantifying the Role of Exogenous Macro-Economic Information in Machine Learning for Modelling Financial Data - Version of Record
Available under License University of Southampton Thesis Licence.
Download (8MB)

More information

Published date: February 2020

Identifiers

Local EPrints ID: 442093
URI: http://eprints.soton.ac.uk/id/eprint/442093
PURE UUID: 5097c7e4-7551-47e6-9ea9-194b609eb14d
ORCID for Mahesan Niranjan: ORCID iD orcid.org/0000-0001-7021-140X

Catalogue record

Date deposited: 07 Jul 2020 16:49
Last modified: 17 Mar 2024 03:11

Export record

Contributors

Author: Luis Jairo Montesdeoca Bermudez
Thesis advisor: Mahesan Niranjan ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×