A data-driven study of variability in X-ray binaries using machine learning
A data-driven study of variability in X-ray binaries using machine learning
The field of time series data mining has become increasingly important in light of the impending era of “Big Data”. With new astronomical sky surveys set to generate data at an unprecedented rate, the need for automated methods of data analysis is becoming increasingly urgent. This thesis makes a contribution to the toolbox of automated characterisation methods for astronomical time series, specifically light curves. Two end-to-end pipelines drawing on concepts from the fields of signal processing, data science, machine learning, and deep learning are presented, with examples showing how they can streamline the process of light curve analysis. The first pipeline is designed to extract and aggregate features from light curve segments, using a neural network and clustering algorithms to combine them into feature vectors of fixed length. These “light curve fingerprints” can be used as input for downstream machine learning algorithms, providing a way to tackle the problem of constructing a standard representation for light curves of variable length. I demonstrate the proposed method on the X-ray data of the Galactic low-mass black hole X-ray binary system GRS 1915+105, showcasing how it can be used to study the variability of the source on time scales ranging from seconds to hours, and to quantify the similarity of different light curves. The outcomes highlight the problem of the popular classification system of GRS 1915+105 observations, which does not account for intermediate class behaviour. I also explore the extension to another source, and outline the direction of further development. The fourth chapter of the thesis presents a periodicity detection and characterisation pipeline, which utilises the wavelet transform for timing-frequency analysis. The pipeline is applied to a sample of photometric light curves of Small Magellanic Cloud Be X-ray binaries, demonstrating its ability to facilitate analysis of the sources’ behaviour over time scales ranging from days to decades. Using this methodology, I detect several periodic signals for the first time, and provide the interpretation of their physical origin. The pipeline is a useful tool in the analysis of the binary orbital behaviour of the sources, the evolution of Be star non-radial pulsations, and their possible link with the formation mechanism of the circumstellar decretion discs.
University of Southampton
Orwat-Kapola, Jakub Kacper
6c9db0de-ab29-414b-9e08-44c43e658f7d
February 2024
Orwat-Kapola, Jakub Kacper
6c9db0de-ab29-414b-9e08-44c43e658f7d
Bird, Tony
045ee141-4720-46fd-a412-5aa848a91b32
Hill, Adam
b1007941-b5b1-47cd-8476-7c6b9c57f347
Altamirano, Diego
d5ccdb09-0b71-4303-9538-05b467be075b
Orwat-Kapola, Jakub Kacper
(2024)
A data-driven study of variability in X-ray binaries using machine learning.
University of Southampton, Doctoral Thesis, 215pp.
Record type:
Thesis
(Doctoral)
Abstract
The field of time series data mining has become increasingly important in light of the impending era of “Big Data”. With new astronomical sky surveys set to generate data at an unprecedented rate, the need for automated methods of data analysis is becoming increasingly urgent. This thesis makes a contribution to the toolbox of automated characterisation methods for astronomical time series, specifically light curves. Two end-to-end pipelines drawing on concepts from the fields of signal processing, data science, machine learning, and deep learning are presented, with examples showing how they can streamline the process of light curve analysis. The first pipeline is designed to extract and aggregate features from light curve segments, using a neural network and clustering algorithms to combine them into feature vectors of fixed length. These “light curve fingerprints” can be used as input for downstream machine learning algorithms, providing a way to tackle the problem of constructing a standard representation for light curves of variable length. I demonstrate the proposed method on the X-ray data of the Galactic low-mass black hole X-ray binary system GRS 1915+105, showcasing how it can be used to study the variability of the source on time scales ranging from seconds to hours, and to quantify the similarity of different light curves. The outcomes highlight the problem of the popular classification system of GRS 1915+105 observations, which does not account for intermediate class behaviour. I also explore the extension to another source, and outline the direction of further development. The fourth chapter of the thesis presents a periodicity detection and characterisation pipeline, which utilises the wavelet transform for timing-frequency analysis. The pipeline is applied to a sample of photometric light curves of Small Magellanic Cloud Be X-ray binaries, demonstrating its ability to facilitate analysis of the sources’ behaviour over time scales ranging from days to decades. Using this methodology, I detect several periodic signals for the first time, and provide the interpretation of their physical origin. The pipeline is a useful tool in the analysis of the binary orbital behaviour of the sources, the evolution of Be star non-radial pulsations, and their possible link with the formation mechanism of the circumstellar decretion discs.
Text
Thesis_Jakub_Orwat-Kapola
- Version of Record
Text
Final-thesis-submission-Examination-Mr-Jakub-Orwat-Kapola
Restricted to Repository staff only
More information
Submitted date: January 2024
Published date: February 2024
Identifiers
Local EPrints ID: 486973
URI: http://eprints.soton.ac.uk/id/eprint/486973
PURE UUID: 06d2002f-0ef1-4029-bec1-a8e91579fdb8
Catalogue record
Date deposited: 09 Feb 2024 17:31
Last modified: 17 Apr 2024 01:54
Export record
Contributors
Author:
Jakub Kacper Orwat-Kapola
Thesis advisor:
Adam Hill
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics