Exploring the use of Sentinel-2 datasets and environmental variables to model wheat crop yield in smallholder arid and semi-arid farming systems
Exploring the use of Sentinel-2 datasets and environmental variables to model wheat crop yield in smallholder arid and semi-arid farming systems
Low levels of agricultural productivity are associated with the persistence of food insecurity, poverty, and other socio-economic stresses. Mapping and monitoring agricultural dynamics and production in real-time at high spatial resolution are essential for ensuring food security and shaping policy interventions. However, an accurate yield estimation might be challenging in some arid and semi-arid regions since input datasets are generally scarce, and access is restricted due to security challenges. This work examines how well Sentinel-2 satellite sensor-derived data, topographic and climatic variables, can be used as covariates to accurately model and predict wheat crop yield at the farm level using statistical models in low data settings of arid and semi-arid regions, using Sulaimani governorate in Iraq as an example. We developed a covariate selection procedure that assessed the correlations between the covariates and their relationships with wheat crop yield. Potential non-linear relationships were investigated in the latter case using regression splines. In the absence of substantial non-linear relationships between the covariates and crop yield, and residual spatial autocorrelation, we fitted a Bayesian multiple linear regression model to model and predict crop yield at 10 m resolution. Out of the covariates tested, our results showed significant relationships between crop yield and mean cumulative NDVI during the growing season, mean elevation, mean end of the season, mean maximum temperature and mean the start of the season at the farm level. For in-sample prediction, we estimated an R2 value of 51 % for the model, whereas for out-of-sample prediction, this was 41 %, both of which indicate reasonable predictive performance. The calculated root-mean-square error for out-of-sample prediction was 69.80, which is less than the standard deviation of 89.23 for crop yield, further showing that the model performed well by reducing prediction variability. Besides crop yield estimates, the model produced uncertainty metrics at 10 m resolution. Overall, this study showed that Sentinel-2 data can be valuable for upscaling field measurement of crop yield in arid and semi-arid regions. In addition, the environmental covariates can strengthen the model predictive power. The method may be applicable in other areas with similar environments, particularly in conflict zones, to increase the availability of agricultural statistics.
Qader, Sarchil Hama
e8e721d4-9706-4b5e-94ee-262042a268ed
Utazi, Chigozie Edson
e69ca81e-fb23-4bc1-99a5-25c9e0f4d6f9
Priyatikanto, Rhorom
c250c3ca-958c-46a2-969a-3ad689b8630b
Najmaddin, Peshawa
99e703b3-3133-482b-aa38-4fa32e0bb5d6
Hama-Ali, Emad Omer
bef0e001-4cc9-4a13-b709-edb74c9c5223
Khwarahm, Nabaz R.
2e1dea22-1f7f-41d6-b007-ed5bcc95f6ec
Tatem, Andrew J.
6c6de104-a5f9-46e0-bb93-a1a7c980513e
Dash, Jadu
69e5776c-02f9-4485-894e-c5551d95eabb
15 April 2023
Qader, Sarchil Hama
e8e721d4-9706-4b5e-94ee-262042a268ed
Utazi, Chigozie Edson
e69ca81e-fb23-4bc1-99a5-25c9e0f4d6f9
Priyatikanto, Rhorom
c250c3ca-958c-46a2-969a-3ad689b8630b
Najmaddin, Peshawa
99e703b3-3133-482b-aa38-4fa32e0bb5d6
Hama-Ali, Emad Omer
bef0e001-4cc9-4a13-b709-edb74c9c5223
Khwarahm, Nabaz R.
2e1dea22-1f7f-41d6-b007-ed5bcc95f6ec
Tatem, Andrew J.
6c6de104-a5f9-46e0-bb93-a1a7c980513e
Dash, Jadu
69e5776c-02f9-4485-894e-c5551d95eabb
Qader, Sarchil Hama, Utazi, Chigozie Edson, Priyatikanto, Rhorom, Najmaddin, Peshawa, Hama-Ali, Emad Omer, Khwarahm, Nabaz R., Tatem, Andrew J. and Dash, Jadu
(2023)
Exploring the use of Sentinel-2 datasets and environmental variables to model wheat crop yield in smallholder arid and semi-arid farming systems.
Science of the Total Environment, 869, [161716].
(doi:10.1016/j.scitotenv.2023.161716).
Abstract
Low levels of agricultural productivity are associated with the persistence of food insecurity, poverty, and other socio-economic stresses. Mapping and monitoring agricultural dynamics and production in real-time at high spatial resolution are essential for ensuring food security and shaping policy interventions. However, an accurate yield estimation might be challenging in some arid and semi-arid regions since input datasets are generally scarce, and access is restricted due to security challenges. This work examines how well Sentinel-2 satellite sensor-derived data, topographic and climatic variables, can be used as covariates to accurately model and predict wheat crop yield at the farm level using statistical models in low data settings of arid and semi-arid regions, using Sulaimani governorate in Iraq as an example. We developed a covariate selection procedure that assessed the correlations between the covariates and their relationships with wheat crop yield. Potential non-linear relationships were investigated in the latter case using regression splines. In the absence of substantial non-linear relationships between the covariates and crop yield, and residual spatial autocorrelation, we fitted a Bayesian multiple linear regression model to model and predict crop yield at 10 m resolution. Out of the covariates tested, our results showed significant relationships between crop yield and mean cumulative NDVI during the growing season, mean elevation, mean end of the season, mean maximum temperature and mean the start of the season at the farm level. For in-sample prediction, we estimated an R2 value of 51 % for the model, whereas for out-of-sample prediction, this was 41 %, both of which indicate reasonable predictive performance. The calculated root-mean-square error for out-of-sample prediction was 69.80, which is less than the standard deviation of 89.23 for crop yield, further showing that the model performed well by reducing prediction variability. Besides crop yield estimates, the model produced uncertainty metrics at 10 m resolution. Overall, this study showed that Sentinel-2 data can be valuable for upscaling field measurement of crop yield in arid and semi-arid regions. In addition, the environmental covariates can strengthen the model predictive power. The method may be applicable in other areas with similar environments, particularly in conflict zones, to increase the availability of agricultural statistics.
Text
1-s2.0-S0048969723003315-main
- Version of Record
More information
Accepted/In Press date: 15 January 2023
e-pub ahead of print date: 20 January 2023
Published date: 15 April 2023
Additional Information:
Funding Information:
This research was funded by UK Research and Innovation GCRF 323036/ARCP011217 .
Identifiers
Local EPrints ID: 474993
URI: http://eprints.soton.ac.uk/id/eprint/474993
ISSN: 0048-9697
PURE UUID: 99f833fe-9d7d-4637-9d21-a3c5553c9236
Catalogue record
Date deposited: 08 Mar 2023 17:50
Last modified: 17 Mar 2024 04:09
Export record
Altmetrics
Contributors
Author:
Sarchil Hama Qader
Author:
Rhorom Priyatikanto
Author:
Peshawa Najmaddin
Author:
Emad Omer Hama-Ali
Author:
Nabaz R. Khwarahm
Author:
Jadu Dash
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics