Data augmentation in classification and segmentation: a survey and new strategies
Data augmentation in classification and segmentation: a survey and new strategies
In the past decade, deep neural networks, particularly convolutional neural networks, have revolutionised computer vision. However, all deep learning models may require a large amount of data so as to achieve satisfying results. Unfortunately, the availability of sufficient amounts of data for real-world problems is not always possible, and it is well recognised that a paucity of data easily results in overfitting. This issue may be addressed through several approaches, one of which is data augmentation. In this paper, we survey the existing data augmentation techniques in computer vision tasks, including segmentation and classification, and suggest new strategies. In particular, we introduce a way of implementing data augmentation by using local information in images. We propose a parameter-free and easy to implement strategy, the random local rotation strategy, which involves randomly selecting the location and size of circular regions in the image and rotating them with random angles. It can be used as an alternative to the traditional rotation strategy, which generally suffers from irregular image boundaries. It can also complement other techniques in data augmentation. Extensive experimental results and comparisons demonstrated that the new strategy consistently outperformed its traditional counterparts in, for example, image classification.
classification, convolutional neural networks, data augmentation, deep learning, image processing, segmentation
Alomar, Khaled
ff1cdb20-40a5-42e3-82db-935881354868
Aysel, Halil Ibrahim
9db69eca-47c7-4443-86a1-33504e172d60
Cai, Xiaohao
de483445-45e9-4b21-a4e8-b0427fc72cee
17 February 2023
Alomar, Khaled
ff1cdb20-40a5-42e3-82db-935881354868
Aysel, Halil Ibrahim
9db69eca-47c7-4443-86a1-33504e172d60
Cai, Xiaohao
de483445-45e9-4b21-a4e8-b0427fc72cee
Alomar, Khaled, Aysel, Halil Ibrahim and Cai, Xiaohao
(2023)
Data augmentation in classification and segmentation: a survey and new strategies.
Journal of imaging, 9 (2), [46].
(doi:10.3390/jimaging9020046).
Abstract
In the past decade, deep neural networks, particularly convolutional neural networks, have revolutionised computer vision. However, all deep learning models may require a large amount of data so as to achieve satisfying results. Unfortunately, the availability of sufficient amounts of data for real-world problems is not always possible, and it is well recognised that a paucity of data easily results in overfitting. This issue may be addressed through several approaches, one of which is data augmentation. In this paper, we survey the existing data augmentation techniques in computer vision tasks, including segmentation and classification, and suggest new strategies. In particular, we introduce a way of implementing data augmentation by using local information in images. We propose a parameter-free and easy to implement strategy, the random local rotation strategy, which involves randomly selecting the location and size of circular regions in the image and rotating them with random angles. It can be used as an alternative to the traditional rotation strategy, which generally suffers from irregular image boundaries. It can also complement other techniques in data augmentation. Extensive experimental results and comparisons demonstrated that the new strategy consistently outperformed its traditional counterparts in, for example, image classification.
Text
jimaging-09-00046-v2
- Version of Record
More information
Accepted/In Press date: 10 February 2023
Published date: 17 February 2023
Additional Information:
Funding Information:
K.A. and H.I.A. are thankful for the support from The Ministry of Education in Saudi Arabia and the Republic of Turkey Ministry of National Education, respectively.
Keywords:
classification, convolutional neural networks, data augmentation, deep learning, image processing, segmentation
Identifiers
Local EPrints ID: 481570
URI: http://eprints.soton.ac.uk/id/eprint/481570
ISSN: 2313-433X
PURE UUID: 0deaf7df-71b8-4eae-abf2-95e3cd65c87a
Catalogue record
Date deposited: 04 Sep 2023 16:34
Last modified: 18 Mar 2024 04:00
Export record
Altmetrics
Contributors
Author:
Khaled Alomar
Author:
Halil Ibrahim Aysel
Author:
Xiaohao Cai
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics