The University of Southampton
University of Southampton Institutional Repository

A benchmark for multi-class object counting and size estimation using deep convolutional neural networks

A benchmark for multi-class object counting and size estimation using deep convolutional neural networks
A benchmark for multi-class object counting and size estimation using deep convolutional neural networks
Automatic object counting and object size estimation in digital images can be very useful in many real-world applications such as surveillance, smart farming, intelligent traffic systems, etc. However, most existing research mainly focus on scenarios where only one type of object is considered due to the lack of proper datasets. Furthermore, they use the traditional detection algorithms for size estimation and can only do segmenting tasks but cannot identify different types of objects and return corresponding individual size information. To fill these gaps, we create a synthetic dataset and propose a benchmark for multi-class object counting and size estimation (MOCSE) within a unified framework. We create the dataset MOCSE13 by using Unity to generate synthetic images for 13 different objects (fruits and vegetables). Besides, we propose a deep architecture approach for multi-class object counting and object size estimation. Our proposed models with different backbones are evaluated on the synthetic dataset. The experimental results provide a benchmark for multi-class object counting and size estimation and the synthetic dataset can be served as a proper testbed for future studies.
Convolutional neural networks, Crowd counting, Multi-class object counting, Object size estimation, Synthetic dataset
0952-1976
Liu, Zixu
1b07df56-a07b-4e78-bebc-38657ca80f76
Wang, Qian
f7d5674e-9334-4631-a6da-627603c02fa8
Meng, Fanlin
3c9359f7-2fea-4477-a84c-3a283d3701dc
Liu, Zixu
1b07df56-a07b-4e78-bebc-38657ca80f76
Wang, Qian
f7d5674e-9334-4631-a6da-627603c02fa8
Meng, Fanlin
3c9359f7-2fea-4477-a84c-3a283d3701dc

Liu, Zixu, Wang, Qian and Meng, Fanlin (2022) A benchmark for multi-class object counting and size estimation using deep convolutional neural networks. Engineering Applications of Artificial Intelligence, 116, [105449]. (doi:10.1016/j.engappai.2022.105449).

Record type: Article

Abstract

Automatic object counting and object size estimation in digital images can be very useful in many real-world applications such as surveillance, smart farming, intelligent traffic systems, etc. However, most existing research mainly focus on scenarios where only one type of object is considered due to the lack of proper datasets. Furthermore, they use the traditional detection algorithms for size estimation and can only do segmenting tasks but cannot identify different types of objects and return corresponding individual size information. To fill these gaps, we create a synthetic dataset and propose a benchmark for multi-class object counting and size estimation (MOCSE) within a unified framework. We create the dataset MOCSE13 by using Unity to generate synthetic images for 13 different objects (fruits and vegetables). Besides, we propose a deep architecture approach for multi-class object counting and object size estimation. Our proposed models with different backbones are evaluated on the synthetic dataset. The experimental results provide a benchmark for multi-class object counting and size estimation and the synthetic dataset can be served as a proper testbed for future studies.

Text
Accepted version - Accepted Manuscript
Download (2MB)
Text
1-s2.0-S0952197622004390-main - Version of Record
Restricted to Repository staff only
Request a copy

More information

Accepted/In Press date: 9 September 2022
e-pub ahead of print date: 1 October 2022
Published date: November 2022
Additional Information: Publisher Copyright: © 2022 Elsevier Ltd
Keywords: Convolutional neural networks, Crowd counting, Multi-class object counting, Object size estimation, Synthetic dataset

Identifiers

Local EPrints ID: 470981
URI: http://eprints.soton.ac.uk/id/eprint/470981
ISSN: 0952-1976
PURE UUID: 5bd7dcdb-51ae-4c5a-b052-15062af1cdd5
ORCID for Zixu Liu: ORCID iD orcid.org/0000-0002-4806-5482

Catalogue record

Date deposited: 21 Oct 2022 16:46
Last modified: 17 Mar 2024 07:32

Export record

Altmetrics

Contributors

Author: Zixu Liu ORCID iD
Author: Qian Wang
Author: Fanlin Meng

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×