Automating the detection of marine sound sources from passive acoustic monitoring (PAM) data
Automating the detection of marine sound sources from passive acoustic monitoring (PAM) data
Passively listening to the marine environment allows us to characterise the combination of acoustic signals present over space and time, collectively known as the soundscape. A regional soundscape provides a reliable indicator of habitat quality, providing an observation of the distribution of vocalising species and monitors human activities which may influence the ecological health of an environment. In this thesis I use the COMPASS array, a suite of ten acoustic monitoring locations in the coastal region of western Scotland, to identify the complexities within the shallow water soundscape over small spatial scales. I describe how the cacophony of heterogenous acoustic sources present shape the temporal, spatial, and spectral patterns of each specific environment across the array, with variability in reported sound level metrics across space and time. To automate the detection and classification of spatial and temporal variations in ecosystem health and species presence from passive acoustic monitoring (PAM) data, algorithms must be capable of discrimination between marine species and the presence of anthropogenic and environmental noise, and be robust to fluctuating ambient noise levels. Within this thesis, I present a ‘small-scale’ convolutional neural network (CNN), trained to detect biological and anthropogenic signals from broadband acoustic data (0 – 48 kHz). I demonstrate that a computationally low-cost deep learning model can be used to mine extensive marine acoustic datasets for more than a single signal type, providing information on temporal scales relevant to the management of marine protected areas and the conservation of vulnerable species. For the CNN to be of use to the wider bioacoustic community, it must be robust within ambient noise fields, which vary to that of the original training set. The adaptability of the CNN to acoustic environments is demonstrated by fine-tuning the model with training sets of differing sizes curated from data collected in the Gulf of Mexico. The network reports an increase in accuracy of 30% after fine-tuning on a small amount of site-specific data (2.5 minutes), increasing by a further 5% with 25 minutes of data. Before CNNs can become commonplace in the bioacoustic toolbox, they must demonstrate performance benefits above that of currently used automated approaches to species monitoring. The final element of this thesis empirically evaluates the performance of the CNN, as a dolphin monitoring tool, in comparison to that of a widely popular data logger, the C-POD. Acoustic recordings (2136 hours) collected at two locations within the COMPASS array were analysed by both the CNN, the C-POD, and also manually to ground-truthed the detection positive hours (dph). The CNN achieves an overall accuracy of 0.84, and a false-positive rate of 0.05, outperforming the C-POD for all analysis periods. The CNN accurately describes seasonal and diel patterns in dolphin presence around each of the moorings, even when sensitivity is hindered by adverse weather periods. This thesis presents to the research community a multi sound source CNN for monitoring marine signals in diverse acoustic environments, which can provide information to management and stakeholders on an appropriate timeframe for conserving vulnerable species and habitats.
University of Southampton
White, Ellen Louise
1f019923-787e-4d89-9069-2f0b1ecf3506
May 2024
White, Ellen Louise
1f019923-787e-4d89-9069-2f0b1ecf3506
White, Paul
2dd2477b-5aa9-42e2-9d19-0806d994eaba
Bull, Jonathan
974037fd-544b-458f-98cc-ce8eca89e3c8
White, Ellen Louise
(2024)
Automating the detection of marine sound sources from passive acoustic monitoring (PAM) data.
University of Southampton, Doctoral Thesis, 184pp.
Record type:
Thesis
(Doctoral)
Abstract
Passively listening to the marine environment allows us to characterise the combination of acoustic signals present over space and time, collectively known as the soundscape. A regional soundscape provides a reliable indicator of habitat quality, providing an observation of the distribution of vocalising species and monitors human activities which may influence the ecological health of an environment. In this thesis I use the COMPASS array, a suite of ten acoustic monitoring locations in the coastal region of western Scotland, to identify the complexities within the shallow water soundscape over small spatial scales. I describe how the cacophony of heterogenous acoustic sources present shape the temporal, spatial, and spectral patterns of each specific environment across the array, with variability in reported sound level metrics across space and time. To automate the detection and classification of spatial and temporal variations in ecosystem health and species presence from passive acoustic monitoring (PAM) data, algorithms must be capable of discrimination between marine species and the presence of anthropogenic and environmental noise, and be robust to fluctuating ambient noise levels. Within this thesis, I present a ‘small-scale’ convolutional neural network (CNN), trained to detect biological and anthropogenic signals from broadband acoustic data (0 – 48 kHz). I demonstrate that a computationally low-cost deep learning model can be used to mine extensive marine acoustic datasets for more than a single signal type, providing information on temporal scales relevant to the management of marine protected areas and the conservation of vulnerable species. For the CNN to be of use to the wider bioacoustic community, it must be robust within ambient noise fields, which vary to that of the original training set. The adaptability of the CNN to acoustic environments is demonstrated by fine-tuning the model with training sets of differing sizes curated from data collected in the Gulf of Mexico. The network reports an increase in accuracy of 30% after fine-tuning on a small amount of site-specific data (2.5 minutes), increasing by a further 5% with 25 minutes of data. Before CNNs can become commonplace in the bioacoustic toolbox, they must demonstrate performance benefits above that of currently used automated approaches to species monitoring. The final element of this thesis empirically evaluates the performance of the CNN, as a dolphin monitoring tool, in comparison to that of a widely popular data logger, the C-POD. Acoustic recordings (2136 hours) collected at two locations within the COMPASS array were analysed by both the CNN, the C-POD, and also manually to ground-truthed the detection positive hours (dph). The CNN achieves an overall accuracy of 0.84, and a false-positive rate of 0.05, outperforming the C-POD for all analysis periods. The CNN accurately describes seasonal and diel patterns in dolphin presence around each of the moorings, even when sensitivity is hindered by adverse weather periods. This thesis presents to the research community a multi sound source CNN for monitoring marine signals in diverse acoustic environments, which can provide information to management and stakeholders on an appropriate timeframe for conserving vulnerable species and habitats.
Text
ELW_Automating_The_Detection_Of_Marine_Sound_Sources_From_PAM_Data
- Version of Record
Text
Final-thesis-submission-Examination-Miss-Ellen-White
Restricted to Repository staff only
More information
Submitted date: 6 December 2023
Published date: May 2024
Identifiers
Local EPrints ID: 490652
URI: http://eprints.soton.ac.uk/id/eprint/490652
PURE UUID: c7628a6b-1407-4f06-9680-67aaf0f0a690
Catalogue record
Date deposited: 31 May 2024 17:04
Last modified: 21 Sep 2024 01:34
Export record
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics