The University of Southampton
University of Southampton Institutional Repository

An assessment of the effectiveness of a random forest classifier for land-cover classification

An assessment of the effectiveness of a random forest classifier for land-cover classification
An assessment of the effectiveness of a random forest classifier for land-cover classification
Land cover monitoring using remotely sensed data requires robust classification methods which allow for the accurate mapping of complex land cover and land use categories. Random forest (RF) is a powerful machine learning classifier that is relatively unknown in land remote sensing and has not been evaluated thoroughly by the remote sensing community compared to more conventional pattern recognition techniques. Key advantages of RF include: their non-parametric nature; high classification accuracy; and capability to determine variable importance. However, the split rules for classification are unknown, therefore RF can be considered to be black box type classifier. RF provides an algorithm for estimating missing values; and flexibility to perform several types of data analysis, including regression, classification, survival analysis, and unsupervised learning.

In this paper, the performance of the RF classifier for land cover classification of a complex area is explored. Evaluation was based on several criteria: mapping accuracy, sensitivity to data set size and noise. Landsat-5 Thematic Mapper data captured in European spring and summer were used with auxiliary variables derived from a digital terrain model to classify 14 different land categories in the south of Spain. Results show that the RF algorithm yields accurate land cover classifications, with 92% overall accuracy and a Kappa index of 0.92. RF is robust to training data reduction and noise because significant differences in kappa values were only observed for data reduction and noise addition values greater than 50 and 20%, respectively. Additionally, variables that RF identified as most important for classifying land cover coincided with expectations. A McNemar test indicates an overall better performance of the random forest model over a single decision tree at the 0.00001 significance level.
0924-2716
93-104
Rodriguez-Galiano, V.F.
1eb6a1dd-f73d-4e90-a9cf-a51f20712c3c
Ghimire, B.
d2ab9f23-815a-485e-8347-15f28ef8ce77
Rogan, J.
f7579c71-3a48-4faf-ab7a-fe29acef2748
Chica-Olmo, M.
c7291c15-3b53-45d7-942c-06985f77d6f6
Rigol-Sanchez, J.P.
815a8e29-a1b2-44f4-a661-801fbee91f4d
Rodriguez-Galiano, V.F.
1eb6a1dd-f73d-4e90-a9cf-a51f20712c3c
Ghimire, B.
d2ab9f23-815a-485e-8347-15f28ef8ce77
Rogan, J.
f7579c71-3a48-4faf-ab7a-fe29acef2748
Chica-Olmo, M.
c7291c15-3b53-45d7-942c-06985f77d6f6
Rigol-Sanchez, J.P.
815a8e29-a1b2-44f4-a661-801fbee91f4d

Rodriguez-Galiano, V.F., Ghimire, B., Rogan, J., Chica-Olmo, M. and Rigol-Sanchez, J.P. (2012) An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS Journal of Photogrammetry and Remote Sensing, 67, 93-104. (doi:10.1016/j.isprsjprs.2011.11.002).

Record type: Article

Abstract

Land cover monitoring using remotely sensed data requires robust classification methods which allow for the accurate mapping of complex land cover and land use categories. Random forest (RF) is a powerful machine learning classifier that is relatively unknown in land remote sensing and has not been evaluated thoroughly by the remote sensing community compared to more conventional pattern recognition techniques. Key advantages of RF include: their non-parametric nature; high classification accuracy; and capability to determine variable importance. However, the split rules for classification are unknown, therefore RF can be considered to be black box type classifier. RF provides an algorithm for estimating missing values; and flexibility to perform several types of data analysis, including regression, classification, survival analysis, and unsupervised learning.

In this paper, the performance of the RF classifier for land cover classification of a complex area is explored. Evaluation was based on several criteria: mapping accuracy, sensitivity to data set size and noise. Landsat-5 Thematic Mapper data captured in European spring and summer were used with auxiliary variables derived from a digital terrain model to classify 14 different land categories in the south of Spain. Results show that the RF algorithm yields accurate land cover classifications, with 92% overall accuracy and a Kappa index of 0.92. RF is robust to training data reduction and noise because significant differences in kappa values were only observed for data reduction and noise addition values greater than 50 and 20%, respectively. Additionally, variables that RF identified as most important for classifying land cover coincided with expectations. A McNemar test indicates an overall better performance of the random forest model over a single decision tree at the 0.00001 significance level.

This record has no associated files available for download.

More information

Published date: January 2012
Organisations: Global Env Change & Earth Observation

Identifiers

Local EPrints ID: 360079
URI: http://eprints.soton.ac.uk/id/eprint/360079
ISSN: 0924-2716
PURE UUID: 22ce85a5-964a-4374-bab7-1e4919071e2b

Catalogue record

Date deposited: 25 Nov 2013 13:22
Last modified: 14 Mar 2024 15:33

Export record

Altmetrics

Contributors

Author: V.F. Rodriguez-Galiano
Author: B. Ghimire
Author: J. Rogan
Author: M. Chica-Olmo
Author: J.P. Rigol-Sanchez

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×