The University of Southampton
University of Southampton Institutional Repository

Classification of continuous sky brightness data using random forest

Classification of continuous sky brightness data using random forest
Classification of continuous sky brightness data using random forest
Sky brightness measuring and monitoring are required to mitigate the negative effect of light pollution as a byproduct of modern civilization. Good handling of a pile of sky brightness data includes evaluation and classification of the data according to its quality and characteristics such that further analysis and inference can be conducted properly. This study aims to develop a classification model based on Random Forest algorithm and to evaluate its performance. Using sky brightness data from 1250 nights with minute temporal resolution acquired at eight different stations in Indonesia, datasets consisting of 15 features were created to train and test the model. Those features were extracted from the observation time, the global statistics of nightly sky brightness, or the light curve characteristics. Among those features, 10 are considered to be the most important for the classification task. The model was trained to classify the data into six classes (1: peculiar data, 2: overcast, 3: cloudy, 4: clear, 5: moonlit-cloudy, and 6: moonlit-clear) and then tested to achieve high accuracy (92%) and scores (F-score = 84% and G-mean = 84%). Some misclassifications exist, but the classification results are considerably good as indicated by posterior distributions of the sky brightness as a function of classes. Data classified as class-4 have sharp distribution with typical full width at half maximum of 1.5 mag/arcsec2, while distributions of class-2 and -3 are left skewed with the latter having lighter tail. Due to the moonlight, distributions of class-5 and -6 data are more smeared or have larger spread. These results demonstrate that the established classification model is reasonably good and consistent.
1687-7969
Priyatikanto, Rhorom
c250c3ca-958c-46a2-969a-3ad689b8630b
Mayangsari, Lidia
3630b0ea-f59f-4fae-8bdf-653380dec45d
Prihandoko, Rudi A.
411c33a4-a71c-4dd3-9c80-8f84f5804547
Admiranto, Agustinus G.
8b3cb7a3-3882-46fa-a632-1ea72ddfa943
Priyatikanto, Rhorom
c250c3ca-958c-46a2-969a-3ad689b8630b
Mayangsari, Lidia
3630b0ea-f59f-4fae-8bdf-653380dec45d
Prihandoko, Rudi A.
411c33a4-a71c-4dd3-9c80-8f84f5804547
Admiranto, Agustinus G.
8b3cb7a3-3882-46fa-a632-1ea72ddfa943

Priyatikanto, Rhorom, Mayangsari, Lidia, Prihandoko, Rudi A. and Admiranto, Agustinus G. (2020) Classification of continuous sky brightness data using random forest. Advances in Astronomy, 2020, [5102065]. (doi:10.1155/2020/5102065).

Record type: Article

Abstract

Sky brightness measuring and monitoring are required to mitigate the negative effect of light pollution as a byproduct of modern civilization. Good handling of a pile of sky brightness data includes evaluation and classification of the data according to its quality and characteristics such that further analysis and inference can be conducted properly. This study aims to develop a classification model based on Random Forest algorithm and to evaluate its performance. Using sky brightness data from 1250 nights with minute temporal resolution acquired at eight different stations in Indonesia, datasets consisting of 15 features were created to train and test the model. Those features were extracted from the observation time, the global statistics of nightly sky brightness, or the light curve characteristics. Among those features, 10 are considered to be the most important for the classification task. The model was trained to classify the data into six classes (1: peculiar data, 2: overcast, 3: cloudy, 4: clear, 5: moonlit-cloudy, and 6: moonlit-clear) and then tested to achieve high accuracy (92%) and scores (F-score = 84% and G-mean = 84%). Some misclassifications exist, but the classification results are considerably good as indicated by posterior distributions of the sky brightness as a function of classes. Data classified as class-4 have sharp distribution with typical full width at half maximum of 1.5 mag/arcsec2, while distributions of class-2 and -3 are left skewed with the latter having lighter tail. Due to the moonlight, distributions of class-5 and -6 data are more smeared or have larger spread. These results demonstrate that the established classification model is reasonably good and consistent.

This record has no associated files available for download.

More information

Accepted/In Press date: 27 February 2020
Published date: 1 April 2020
Additional Information: Funding Information: This study was funded by the National Institute of Aeronautics and Space through an in-house program. LM was supported by Riset Pengembangan Kompetensi Institut Teknologi Bandung Publisher Copyright: © 2020 Rhorom Priyatikanto et al.

Identifiers

Local EPrints ID: 475418
URI: http://eprints.soton.ac.uk/id/eprint/475418
ISSN: 1687-7969
PURE UUID: cd5dc5de-8eb8-4e94-9d28-c4d60176244d
ORCID for Rhorom Priyatikanto: ORCID iD orcid.org/0000-0003-1203-2651
ORCID for Lidia Mayangsari: ORCID iD orcid.org/0000-0001-9373-5775

Catalogue record

Date deposited: 17 Mar 2023 17:37
Last modified: 18 Mar 2024 04:04

Export record

Altmetrics

Contributors

Author: Rhorom Priyatikanto ORCID iD
Author: Lidia Mayangsari ORCID iD
Author: Rudi A. Prihandoko
Author: Agustinus G. Admiranto

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×