The University of Southampton
University of Southampton Institutional Repository

Data augmentation in classification and segmentation: a survey and new strategies

Data augmentation in classification and segmentation: a survey and new strategies
Data augmentation in classification and segmentation: a survey and new strategies

In the past decade, deep neural networks, particularly convolutional neural networks, have revolutionised computer vision. However, all deep learning models may require a large amount of data so as to achieve satisfying results. Unfortunately, the availability of sufficient amounts of data for real-world problems is not always possible, and it is well recognised that a paucity of data easily results in overfitting. This issue may be addressed through several approaches, one of which is data augmentation. In this paper, we survey the existing data augmentation techniques in computer vision tasks, including segmentation and classification, and suggest new strategies. In particular, we introduce a way of implementing data augmentation by using local information in images. We propose a parameter-free and easy to implement strategy, the random local rotation strategy, which involves randomly selecting the location and size of circular regions in the image and rotating them with random angles. It can be used as an alternative to the traditional rotation strategy, which generally suffers from irregular image boundaries. It can also complement other techniques in data augmentation. Extensive experimental results and comparisons demonstrated that the new strategy consistently outperformed its traditional counterparts in, for example, image classification.

classification, convolutional neural networks, data augmentation, deep learning, image processing, segmentation
2313-433X
Alomar, Khaled
ff1cdb20-40a5-42e3-82db-935881354868
Aysel, Halil Ibrahim
9db69eca-47c7-4443-86a1-33504e172d60
Cai, Xiaohao
de483445-45e9-4b21-a4e8-b0427fc72cee
Alomar, Khaled
ff1cdb20-40a5-42e3-82db-935881354868
Aysel, Halil Ibrahim
9db69eca-47c7-4443-86a1-33504e172d60
Cai, Xiaohao
de483445-45e9-4b21-a4e8-b0427fc72cee

Alomar, Khaled, Aysel, Halil Ibrahim and Cai, Xiaohao (2023) Data augmentation in classification and segmentation: a survey and new strategies. Journal of imaging, 9 (2), [46]. (doi:10.3390/jimaging9020046).

Record type: Article

Abstract

In the past decade, deep neural networks, particularly convolutional neural networks, have revolutionised computer vision. However, all deep learning models may require a large amount of data so as to achieve satisfying results. Unfortunately, the availability of sufficient amounts of data for real-world problems is not always possible, and it is well recognised that a paucity of data easily results in overfitting. This issue may be addressed through several approaches, one of which is data augmentation. In this paper, we survey the existing data augmentation techniques in computer vision tasks, including segmentation and classification, and suggest new strategies. In particular, we introduce a way of implementing data augmentation by using local information in images. We propose a parameter-free and easy to implement strategy, the random local rotation strategy, which involves randomly selecting the location and size of circular regions in the image and rotating them with random angles. It can be used as an alternative to the traditional rotation strategy, which generally suffers from irregular image boundaries. It can also complement other techniques in data augmentation. Extensive experimental results and comparisons demonstrated that the new strategy consistently outperformed its traditional counterparts in, for example, image classification.

Text
jimaging-09-00046-v2 - Version of Record
Available under License Creative Commons Attribution.
Download (4MB)

More information

Accepted/In Press date: 10 February 2023
Published date: 17 February 2023
Additional Information: Funding Information: K.A. and H.I.A. are thankful for the support from The Ministry of Education in Saudi Arabia and the Republic of Turkey Ministry of National Education, respectively.
Keywords: classification, convolutional neural networks, data augmentation, deep learning, image processing, segmentation

Identifiers

Local EPrints ID: 481570
URI: http://eprints.soton.ac.uk/id/eprint/481570
ISSN: 2313-433X
PURE UUID: 0deaf7df-71b8-4eae-abf2-95e3cd65c87a
ORCID for Khaled Alomar: ORCID iD orcid.org/0000-0002-8303-3240
ORCID for Halil Ibrahim Aysel: ORCID iD orcid.org/0000-0002-4981-0827
ORCID for Xiaohao Cai: ORCID iD orcid.org/0000-0003-0924-2834

Catalogue record

Date deposited: 04 Sep 2023 16:34
Last modified: 18 Mar 2024 04:00

Export record

Altmetrics

Contributors

Author: Khaled Alomar ORCID iD
Author: Halil Ibrahim Aysel ORCID iD
Author: Xiaohao Cai ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×