A hybrid sparse-semantic image classification framework to support marine and coastal monitoring and management
A hybrid sparse-semantic image classification framework to support marine and coastal monitoring and management
Effective ocean and coastal management increasingly relies on timely, reliable, and interpretable information derived from large volumes of visual data collected by satellites, Unmanned Aerial Vehicles (UAVs), and shipborne sensing systems. Such data underpin a wide range of management and governance functions, including environmental monitoring, maritime surveillance, port operations, and coastal planning. However, the operational use of automated image classification in marine and coastal contexts remains constrained by complex visual conditions and the persistent scarcity of labelled data, limiting its integration into routine decision-support workflows. This study develops a hybrid sparse-semantic image classification framework designed to enhance the robustness and applicability of marine and coastal image understanding in management-oriented settings. The framework integrates structured local representations, which capture geometric and spatial characteristics of marine scenes through a sparsity-driven encoding strategy with locality, non-negativity, and semantic consistency constraints, with global semantic features learned via self-supervised Vision Transformers (ViTs). Specifically, semantic information extracted from the Classification (CLS) token of a pretrained self-Distillation with No Labels (DINO) model is fused with sparse local descriptors to form interpretable and discriminative image representations that remain effective under limited labelled data conditions. Experiments conducted on four public benchmark datasets and three maritime image datasets demonstrate consistent performance improvements over representative state-of-the-art approaches. Beyond classification accuracy, the proposed framework provides a transferable and expandable analytical tool to support marine environmental monitoring, maritime traffic surveillance, and coastal management practices by enhancing upstream image interpretation within monitoring and decision-support workflows in data-constrained marine and coastal contexts.
Decision support, Image classification, Marine and coastal management, Maritime surveillance, Self-supervised learning
Shi, Ying
c1c5c1d8-b6bb-4ab8-acc6-c17989eadeba
Song, Lan
865f8a4a-da88-49b4-bc27-11fa5a229f62
Wu, Luocheng
9ca477a4-4e0f-455c-b36a-ba4c9a217ea9
Li, Huanhuan
5e806b21-10a7-465c-9db3-32e466ae42f1
1 June 2026
Shi, Ying
c1c5c1d8-b6bb-4ab8-acc6-c17989eadeba
Song, Lan
865f8a4a-da88-49b4-bc27-11fa5a229f62
Wu, Luocheng
9ca477a4-4e0f-455c-b36a-ba4c9a217ea9
Li, Huanhuan
5e806b21-10a7-465c-9db3-32e466ae42f1
Shi, Ying, Song, Lan, Wu, Luocheng and Li, Huanhuan
(2026)
A hybrid sparse-semantic image classification framework to support marine and coastal monitoring and management.
Ocean & Coastal Management, 277, [108207].
(doi:10.1016/j.ocecoaman.2026.108207).
Abstract
Effective ocean and coastal management increasingly relies on timely, reliable, and interpretable information derived from large volumes of visual data collected by satellites, Unmanned Aerial Vehicles (UAVs), and shipborne sensing systems. Such data underpin a wide range of management and governance functions, including environmental monitoring, maritime surveillance, port operations, and coastal planning. However, the operational use of automated image classification in marine and coastal contexts remains constrained by complex visual conditions and the persistent scarcity of labelled data, limiting its integration into routine decision-support workflows. This study develops a hybrid sparse-semantic image classification framework designed to enhance the robustness and applicability of marine and coastal image understanding in management-oriented settings. The framework integrates structured local representations, which capture geometric and spatial characteristics of marine scenes through a sparsity-driven encoding strategy with locality, non-negativity, and semantic consistency constraints, with global semantic features learned via self-supervised Vision Transformers (ViTs). Specifically, semantic information extracted from the Classification (CLS) token of a pretrained self-Distillation with No Labels (DINO) model is fused with sparse local descriptors to form interpretable and discriminative image representations that remain effective under limited labelled data conditions. Experiments conducted on four public benchmark datasets and three maritime image datasets demonstrate consistent performance improvements over representative state-of-the-art approaches. Beyond classification accuracy, the proposed framework provides a transferable and expandable analytical tool to support marine environmental monitoring, maritime traffic surveillance, and coastal management practices by enhancing upstream image interpretation within monitoring and decision-support workflows in data-constrained marine and coastal contexts.
This record has no associated files available for download.
More information
Accepted/In Press date: 6 April 2026
e-pub ahead of print date: 10 April 2026
Published date: 1 June 2026
Additional Information:
Publisher Copyright:
© 2026 Elsevier Ltd. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
Keywords:
Decision support, Image classification, Marine and coastal management, Maritime surveillance, Self-supervised learning
Identifiers
Local EPrints ID: 511427
URI: http://eprints.soton.ac.uk/id/eprint/511427
ISSN: 0964-5691
PURE UUID: b87a3bf0-358a-48ad-a91c-f61f07f3aeba
Catalogue record
Date deposited: 14 May 2026 16:37
Last modified: 15 May 2026 02:13
Export record
Altmetrics
Contributors
Author:
Ying Shi
Author:
Lan Song
Author:
Luocheng Wu
Author:
Huanhuan Li
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics