Image segmentation in marine environments using convolutional LSTM for temporal context
Image segmentation in marine environments using convolutional LSTM for temporal context
Unmanned surface vehicles (USVs) carry a wealth of possible applications, many of which are limited by the vehicle's level of autonomy. The development of efficient and robust computer vision algorithms is a key factor in improving this, as they permit autonomous detection and thereby avoidance of obstacles. Recent developments in convolutional neural networks (CNNs), and the collection of increasingly diverse datasets, present opportunities for improved computer vision algorithms requiring less data and computational power. One area of potential improvement is the utilisation of temporal context from USV camera feeds in the form of sequential video frames to consistently identify obstacles in diverse marine environments under challenging conditions. This paper documents the implementation of this through long short-term memory (LSTM) cells in existing CNN structures and the exploration of parameters affecting their efficacy. It is found that LSTM cells are promising for achieving improved performance; however, there are weaknesses associated with network training procedures and datasets. Several novel network architectures are presented and compared using a state-of-the-art benchmarking method. It is shown that LSTM cells allow for better model performance with fewer training iterations, but that this advantage diminishes with additional training.
Hansen, Kasper Foss
43a662c3-e796-44ae-a840-b92a06883c38
Yao, Linghong
ae3dfc15-d28f-4ac6-92f3-e3fae7f5e039
Ren, Kang
d579a21f-df53-4646-b697-5314e79d82e0
Wang, Sen
003d3e09-ec33-4b78-852a-626e8532f5cb
Liu, Wenwen
d2e4c2d1-2c67-4816-bf69-fa64643f6c05
Liu, Yuanchang
84c50c16-30a4-4d84-ae42-56c2abdbf179
26 August 2023
Hansen, Kasper Foss
43a662c3-e796-44ae-a840-b92a06883c38
Yao, Linghong
ae3dfc15-d28f-4ac6-92f3-e3fae7f5e039
Ren, Kang
d579a21f-df53-4646-b697-5314e79d82e0
Wang, Sen
003d3e09-ec33-4b78-852a-626e8532f5cb
Liu, Wenwen
d2e4c2d1-2c67-4816-bf69-fa64643f6c05
Liu, Yuanchang
84c50c16-30a4-4d84-ae42-56c2abdbf179
Hansen, Kasper Foss, Yao, Linghong, Ren, Kang, Wang, Sen, Liu, Wenwen and Liu, Yuanchang
(2023)
Image segmentation in marine environments using convolutional LSTM for temporal context.
Applied Ocean Research, 139, [103709].
(doi:10.1016/j.apor.2023.103709).
Abstract
Unmanned surface vehicles (USVs) carry a wealth of possible applications, many of which are limited by the vehicle's level of autonomy. The development of efficient and robust computer vision algorithms is a key factor in improving this, as they permit autonomous detection and thereby avoidance of obstacles. Recent developments in convolutional neural networks (CNNs), and the collection of increasingly diverse datasets, present opportunities for improved computer vision algorithms requiring less data and computational power. One area of potential improvement is the utilisation of temporal context from USV camera feeds in the form of sequential video frames to consistently identify obstacles in diverse marine environments under challenging conditions. This paper documents the implementation of this through long short-term memory (LSTM) cells in existing CNN structures and the exploration of parameters affecting their efficacy. It is found that LSTM cells are promising for achieving improved performance; however, there are weaknesses associated with network training procedures and datasets. Several novel network architectures are presented and compared using a state-of-the-art benchmarking method. It is shown that LSTM cells allow for better model performance with fewer training iterations, but that this advantage diminishes with additional training.
Text
1-s2.0-S014111872300250X-main
- Version of Record
More information
Accepted/In Press date: 18 August 2023
e-pub ahead of print date: 26 August 2023
Published date: 26 August 2023
Identifiers
Local EPrints ID: 492948
URI: http://eprints.soton.ac.uk/id/eprint/492948
ISSN: 0141-1187
PURE UUID: 57c8d987-545b-4588-9862-0e294e9b1af1
Catalogue record
Date deposited: 21 Aug 2024 17:02
Last modified: 22 Aug 2024 02:11
Export record
Altmetrics
Contributors
Author:
Kasper Foss Hansen
Author:
Linghong Yao
Author:
Kang Ren
Author:
Sen Wang
Author:
Wenwen Liu
Author:
Yuanchang Liu
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics