The University of Southampton
University of Southampton Institutional Repository

Image segmentation in marine environments using convolutional LSTM for temporal context

Image segmentation in marine environments using convolutional LSTM for temporal context
Image segmentation in marine environments using convolutional LSTM for temporal context
Unmanned surface vehicles (USVs) carry a wealth of possible applications, many of which are limited by the vehicle's level of autonomy. The development of efficient and robust computer vision algorithms is a key factor in improving this, as they permit autonomous detection and thereby avoidance of obstacles. Recent developments in convolutional neural networks (CNNs), and the collection of increasingly diverse datasets, present opportunities for improved computer vision algorithms requiring less data and computational power. One area of potential improvement is the utilisation of temporal context from USV camera feeds in the form of sequential video frames to consistently identify obstacles in diverse marine environments under challenging conditions. This paper documents the implementation of this through long short-term memory (LSTM) cells in existing CNN structures and the exploration of parameters affecting their efficacy. It is found that LSTM cells are promising for achieving improved performance; however, there are weaknesses associated with network training procedures and datasets. Several novel network architectures are presented and compared using a state-of-the-art benchmarking method. It is shown that LSTM cells allow for better model performance with fewer training iterations, but that this advantage diminishes with additional training.
0141-1187
Hansen, Kasper Foss
43a662c3-e796-44ae-a840-b92a06883c38
Yao, Linghong
ae3dfc15-d28f-4ac6-92f3-e3fae7f5e039
Ren, Kang
d579a21f-df53-4646-b697-5314e79d82e0
Wang, Sen
003d3e09-ec33-4b78-852a-626e8532f5cb
Liu, Wenwen
d2e4c2d1-2c67-4816-bf69-fa64643f6c05
Liu, Yuanchang
84c50c16-30a4-4d84-ae42-56c2abdbf179
Hansen, Kasper Foss
43a662c3-e796-44ae-a840-b92a06883c38
Yao, Linghong
ae3dfc15-d28f-4ac6-92f3-e3fae7f5e039
Ren, Kang
d579a21f-df53-4646-b697-5314e79d82e0
Wang, Sen
003d3e09-ec33-4b78-852a-626e8532f5cb
Liu, Wenwen
d2e4c2d1-2c67-4816-bf69-fa64643f6c05
Liu, Yuanchang
84c50c16-30a4-4d84-ae42-56c2abdbf179

Hansen, Kasper Foss, Yao, Linghong, Ren, Kang, Wang, Sen, Liu, Wenwen and Liu, Yuanchang (2023) Image segmentation in marine environments using convolutional LSTM for temporal context. Applied Ocean Research, 139, [103709]. (doi:10.1016/j.apor.2023.103709).

Record type: Article

Abstract

Unmanned surface vehicles (USVs) carry a wealth of possible applications, many of which are limited by the vehicle's level of autonomy. The development of efficient and robust computer vision algorithms is a key factor in improving this, as they permit autonomous detection and thereby avoidance of obstacles. Recent developments in convolutional neural networks (CNNs), and the collection of increasingly diverse datasets, present opportunities for improved computer vision algorithms requiring less data and computational power. One area of potential improvement is the utilisation of temporal context from USV camera feeds in the form of sequential video frames to consistently identify obstacles in diverse marine environments under challenging conditions. This paper documents the implementation of this through long short-term memory (LSTM) cells in existing CNN structures and the exploration of parameters affecting their efficacy. It is found that LSTM cells are promising for achieving improved performance; however, there are weaknesses associated with network training procedures and datasets. Several novel network architectures are presented and compared using a state-of-the-art benchmarking method. It is shown that LSTM cells allow for better model performance with fewer training iterations, but that this advantage diminishes with additional training.

Text
1-s2.0-S014111872300250X-main - Version of Record
Available under License Creative Commons Attribution.
Download (17MB)

More information

Accepted/In Press date: 18 August 2023
e-pub ahead of print date: 26 August 2023
Published date: 26 August 2023

Identifiers

Local EPrints ID: 492948
URI: http://eprints.soton.ac.uk/id/eprint/492948
ISSN: 0141-1187
PURE UUID: 57c8d987-545b-4588-9862-0e294e9b1af1
ORCID for Kang Ren: ORCID iD orcid.org/0000-0002-9640-0521

Catalogue record

Date deposited: 21 Aug 2024 17:02
Last modified: 22 Aug 2024 02:11

Export record

Altmetrics

Contributors

Author: Kasper Foss Hansen
Author: Linghong Yao
Author: Kang Ren ORCID iD
Author: Sen Wang
Author: Wenwen Liu
Author: Yuanchang Liu

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×