Predicting Banana Yield at the Field Scale by Combining Sentinel-2 Time Series Data and Regression Models

Applied Engineering in Agriculture(2023)

Cited 0|Views4
No score
Abstract
Highlights A dataset expansion method based on random sampling could improve the robustness of yield estimation models. CIRE was more suitable for banana yield estimation. XGBoost-based banana yield estimation method showed good prediction ability of banana yield. Abstract. Banana yield prediction at the field level offers significant benefits to growers, packinghouses, crop insurance companies, and researchers. This study explored a remote sensing-based approach for forecasting banana yield at the field scale by using Sentinel-2 (S2) image time series and regression models. First, S2 images of critical phenological periods for bananas were acquired from the Google Earth Engine platform, and these images were treated with cloud and cloud shadow removal. Second, the dataset was expanded by randomly selecting pixels for each field to improve the accuracy of yield prediction. Third, nine vegetation indices (VIs) with high correlation with crop yield were compared and analyzed. Chlorophyll Index Red Edge was selected with a particularly high predictive ability in banana yield prediction. Finally, six regression models, namely, least absolute shrinkage and selection operator (LASSO), support vector regression (SVR), k-nearest neighbors (k-NN), random forest (RF), gradient boosted regression trees (GBRT), and extreme gradient boost (XGBoost), were employed, and their performances were compared. Results showed that the best prediction of banana yield was when 70 pixels were selected for each banana field. Out of nine VIs, comparing different regression models, the XGBoost model emerged as the best learner (the average of R2 for 100 runs in 2019 and 2020 were 0.84 and 0.79, respectively). It was followed by the GBRT model with almost the same performance, which explained 82% and 79% of the banana yield variability for 2019 and 2020, respectively. The LASSO model exhibited the lowest performance of all, but it performed best in terms of stability. The proposed framework applied to satellite image time series can achieve reliable banana yield prediction across years at the field scale. Keywords: Banana yield prediction, Extreme gradient boost, Sentinel-2, Vegetation index.
More
Translated text
Key words
banana yield,predicting,regression
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined