The University of Southampton
University of Southampton Institutional Repository

One size fits all? Adaptation of trained CNNs to new marine acoustic environments

One size fits all? Adaptation of trained CNNs to new marine acoustic environments
One size fits all? Adaptation of trained CNNs to new marine acoustic environments
Convolutional neural networks (CNNs) have the potential to enable a revolution in bioacoustics, allowing robust detection and classification of marine sound sources. As global Passive Acoustic Monitoring (PAM) datasets continue to expand it is critical we improve our confidence in the performance of models across different marine environments, if we are to exploit the full ecological value of information within the data. This work demonstrates the transferability of developed CNN models to new acoustic environments by using a pre-trained model developed for one location (West of Scotland, UK) and deploying it in a distinctly different soundscape (Gulf of Mexico, USA). In this work transfer learning is used to fine-tune an existing open-source ‘small-scale’ CNN, which detects odontocete tonal and broadband call types and vessel noise (operating between 0 and 48 kHz). The CNN is fine-tuned on training sets of differing sizes, from the unseen site, to understand the adaptability of a network to new marine acoustic environments. Fine-tuning with a small sample of site-specific data significantly improves the performance of the CNN in the new environment, across all classes. We demonstrate an improved performance in area-under-curve (AUC) score of 0.30, across four classes by fine-training with only 50 spectrograms per class, with a 5% improvement in accuracy between 50 frames and 500 frames. This work shows that only a small amount of site-specific data is needed to retrain a CNN, enabling researchers to harness the power of existing pre-trained models for their own datasets. The marine bioacoustic domain will benefit from a larger pool of global data for training large deep learning models, but we illustrate in this work that domain adaptation can be improved with limited site-specific exemplars.
Bioacoustics, Deep learning, Domain adaptation, Marine Mammal Detection, Marine acoustics, Soundscapes, Marine mammal detection
1574-9541
White, Ellen L.
1f019923-787e-4d89-9069-2f0b1ecf3506
Klinck, Holger
34ec2259-8bec-4264-bfcc-f5ea96126d2f
Bull, Jonathan M.
974037fd-544b-458f-98cc-ce8eca89e3c8
White, Paul R.
2dd2477b-5aa9-42e2-9d19-0806d994eaba
Risch, Denise
5b687ac5-831b-4640-bb88-5f03a27a3a27
White, Ellen L.
1f019923-787e-4d89-9069-2f0b1ecf3506
Klinck, Holger
34ec2259-8bec-4264-bfcc-f5ea96126d2f
Bull, Jonathan M.
974037fd-544b-458f-98cc-ce8eca89e3c8
White, Paul R.
2dd2477b-5aa9-42e2-9d19-0806d994eaba
Risch, Denise
5b687ac5-831b-4640-bb88-5f03a27a3a27

White, Ellen L., Klinck, Holger, Bull, Jonathan M., White, Paul R. and Risch, Denise (2023) One size fits all? Adaptation of trained CNNs to new marine acoustic environments. Ecological Informatics, 78, [102363]. (doi:10.1016/j.ecoinf.2023.102363).

Record type: Article

Abstract

Convolutional neural networks (CNNs) have the potential to enable a revolution in bioacoustics, allowing robust detection and classification of marine sound sources. As global Passive Acoustic Monitoring (PAM) datasets continue to expand it is critical we improve our confidence in the performance of models across different marine environments, if we are to exploit the full ecological value of information within the data. This work demonstrates the transferability of developed CNN models to new acoustic environments by using a pre-trained model developed for one location (West of Scotland, UK) and deploying it in a distinctly different soundscape (Gulf of Mexico, USA). In this work transfer learning is used to fine-tune an existing open-source ‘small-scale’ CNN, which detects odontocete tonal and broadband call types and vessel noise (operating between 0 and 48 kHz). The CNN is fine-tuned on training sets of differing sizes, from the unseen site, to understand the adaptability of a network to new marine acoustic environments. Fine-tuning with a small sample of site-specific data significantly improves the performance of the CNN in the new environment, across all classes. We demonstrate an improved performance in area-under-curve (AUC) score of 0.30, across four classes by fine-training with only 50 spectrograms per class, with a 5% improvement in accuracy between 50 frames and 500 frames. This work shows that only a small amount of site-specific data is needed to retrain a CNN, enabling researchers to harness the power of existing pre-trained models for their own datasets. The marine bioacoustic domain will benefit from a larger pool of global data for training large deep learning models, but we illustrate in this work that domain adaptation can be improved with limited site-specific exemplars.

Text
OneSizeFitsAll_CNN_EcologicalInformatics - Version of Record
Available under License Creative Commons Attribution.
Download (4MB)

More information

Accepted/In Press date: 1 November 2023
e-pub ahead of print date: 7 November 2023
Published date: December 2023
Additional Information: Funding Information: This work was supported by the Natural Environmental Research Council [grant number NE/S007210/1 ]. The COMPASS project has been supported by the EU's INTERREG VA Programme , managed by the Special EU Programmes Body . The views and opinions expressed in this document do not necessarily reflect those of the European Commission or the Special EU Programmes Body (SEUPB) . Publisher Copyright: © 2023 The Authors
Keywords: Bioacoustics, Deep learning, Domain adaptation, Marine Mammal Detection, Marine acoustics, Soundscapes, Marine mammal detection

Identifiers

Local EPrints ID: 484457
URI: http://eprints.soton.ac.uk/id/eprint/484457
ISSN: 1574-9541
PURE UUID: 3b210aef-e55d-45f4-babf-962f82a1f066
ORCID for Jonathan M. Bull: ORCID iD orcid.org/0000-0003-3373-5807
ORCID for Paul R. White: ORCID iD orcid.org/0000-0002-4787-8713

Catalogue record

Date deposited: 16 Nov 2023 12:14
Last modified: 18 Mar 2024 02:39

Export record

Altmetrics

Contributors

Author: Ellen L. White
Author: Holger Klinck
Author: Paul R. White ORCID iD
Author: Denise Risch

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×