Classification of sugarcane areas in Landsat images using machine learning algorithms

Main Article Content

Ana Clara Arantes Villas Bôas de Barros
Marcelo Andrade Silva
https://orcid.org/0000-0001-9079-8981
Rodrigo Domiciano Marques
https://orcid.org/0000-0002-5933-9776
Cristian Villegas
https://orcid.org/0000-0003-3176-5236
Ana Cláudia dos Santos Luciano
https://orcid.org/0000-0003-4862-9863

Abstract

Monitoring sugarcane areas through remote sensing is essential for planning and management of the national sugarcane industry. The use of machine learning algorithms has provided many benefits to remote sensing. This article aims to compare the prediction quality of three important machine learning methods in identifying sugarcane areas using Landsat images: Logistic Regression (LR), Decision Tree (DT) and RandomForest (RF). LR was applied in three versions: LR without penalization, LR with Ridge penalization (LR-R) and LR with Lasso penalization (LR-L). Data obtained in this study refer to a region of approximately 306,000 ha located in the state of São Paulo, Brazil, which was segmented into approximately 46,000 polygons (observations). Six spectral bands and vegetation indices observed along 17 months resulted in 102 covariates, which were reduced via Principal Component Analysis (PCA). In total, 19 Principal Components were chosen to account for 94.61% of the cumulative explained variance ratio and were used in machine learning methods to classify each polygon as sugarcane or other land covers. The method with the highest accuracy considering testing sample of 20% of data was RF (78.51%), followed by DT (72.30%), LR-L (69.64%), LR-R (69.64%), and LR (69.52%).

Article Details

How to Cite
Barros, A. C. A. V. B. de, Silva, M. A., Marques, R. D., Villegas, C., & Luciano, A. C. dos S. (2024). Classification of sugarcane areas in Landsat images using machine learning algorithms. Brazilian Journal of Biometrics, 42(2), 147–157. https://doi.org/10.28951/bjb.v42i2.693
Section
Articles

References

Breiman, L., Friedman, J. H., Olshen, R. A. & Stone, C. J. Classification and Regression Trees. (Wadsworth, 1984).

De Souza Dias, M. O., Maciel Filho, R., Mantelatto, P. E., Cavalett, O., Rossell, C. E. V., Bonomi, A. & Leal, M. R. L. V. Sugarcane processing for ethanol and sugar in Brazil. Environmental Development 15, 35–51. ISSN: 2211-4645. https://www.sciencedirect.com/science/ article/pii/S2211464515000147 (2015).

Hastie, T., Tibshirani, R. & Friedman, J. The Elements of Statistical Learning (Springer New York Inc., 2001).

Huete, A., Liu, H. & van Leeuwen, W. The use of vegetation indices in forested regions: issues of linearity and saturation in IGARSS’97. 1997 IEEE International Geoscience and Remote Sensing Symposium Proceedings. Remote Sensing - A Scientific Vision for Sustainable Development 4 (1997), 1966–1968 vol.4.

Izbicki, R. & dos Santos, T. M. Aprendizado de máquina: uma abordagem estatística 268 pp. (2020).

James, G., Witten, D., Hastie, T. & Tibshirani, R. An Introduction to Statistical Learning: with Applications in R (Springer, 2013).

Johnson, B. A., Scheyvens, H. & Shivakoti, B. R. An ensemble pansharpening approach for finer-scale mapping of sugarcane with Landsat 8 imagery. International Journal of Applied Earth Observation and Geoinformation 33, 218–225. ISSN: 1569-8432. https://www.sciencedirect.com/ science/article/pii/S0303243414001366 (2014).

Johnson, R. & Wichern, D. Applied multivariate statistical analysis (Prentice hall Upper Saddle River, NJ, 2002).

Karp, S. G., Medina, J. D. C., Letti, L. A. J.,Woiciechowski, A. L., de Carvalho, J. C., Schmitt, C. C., de Oliveira Penha, R., Kumlehn, G. S. & Soccol, C. R. Bioeconomy and biofuels: the case of sugarcane ethanol in Brazil. Biofuels, Bioproducts and Biorefining 15, 899–912. eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1002/bbb.2195. https://onlinelibrary.wiley.com/ doi/abs/10.1002/bbb.2195 (2021).

Lantz, B. Machine learning with R: expert techniques for predictive modeling (Packt publishing ltd, 2019).

Lu, M., Bi, Y., Xue, B., Hu, Q., Zhang, M.,Wei, Y., Yang, P. &Wu, W. Genetic Programming for High-Level Feature Learning in Crop Classification. Remote Sensing 14, 16 (2022).

Luciano, A., Picoli, M., Rocha, J., Franco, H., Sanches, G., Leal, M. & leMaire, G. Generalized space-time classifiers for monitoring sugarcane areas in Brazil. Remote Sensing of Environment 215 (July 2018).

Luciano, A. C. d. S., Campagnuci, B. C. G. & le Maire, G. Mapping 33 years of sugarcane evolution in São Paulo state, Brazil, using landsat imagery and generalized space-time classifiers. Remote Sensing Applications: Society and Environment 26, 100749. ISSN: 2352-9385. https://www.sciencedirect.com/science/article/pii/S235293852200057X (2022).

Luciano, A. C. d. S., Picoli, M. C. A., Rocha, J. V., Duft, D. G., Lamparelli, R. A. C., Leal, M. R. L. V. & Le Maire, G. A generalized space-time OBIA classification scheme to map sugarcane areas at regional scale, using Landsat images time-series and the random forest algorithm. International Journal of Applied Earth Observation and Geoinformation 80, 127–136. ISSN: 1569-8432. https://www.sciencedirect.com/science/article/pii/S0303243418311917 (2019).

McCullagh, P. & Nelder, J. Generalized Linear Models, Second Edition (Chapman & Hall, 1989).

McFEETERS, S. K. The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. International Journal of Remote Sensing 17, 1425–1432. https://doi.org/10.1080/01431169608948714 (1996).

Morettin, P. A. & Singer, J. d. M. Estatística e ciência de dados 454 pp. (2022).

Rouse, J., Hass, R., Schell, J. & Deering, D. Monitoring vegetation systems in the great plains with ERTS. Third Earth Resour. Technol. Satell. Symp. 1, 309–317. https://cir.nii.ac.jp/crid/1573105975083785728 (1973).

Rudorff, B. F. T., Aguiar, D. A., Silva, W. F., Sugawara, L. M., Adami, M. & Moreira, M. A. Studies on the Rapid Expansion of Sugarcane for Ethanol Production in São Paulo State (Brazil) Using Landsat Data. Remote Sensing 2, 1057–1076. ISSN: 2072-4292. https://www.mdpi.com/2072-4292/2/4/1057 (2010).

Silva, C. B., de Moraes, M. A. F. D. & Molin, J. P. Adoption and use of precision agriculture technologies in the sugarcane industry of São Paulo state, Brazil. Precision Agriculture 12, 67–81 (2011).

Wang, M., Liu, Z., Ali Baig, M. H., Wang, Y., Li, Y. & Chen, Y. Mapping sugarcane in complex landscapes by integrating multi-temporal Sentinel-2 images and machine learning algorithms. Land Use Policy 88, 104190. ISSN: 0264-8377. https://www.sciencedirect.com/science/article/pii/S0264837719307185 (2019).

Wilson, E. H. & Sader, S. A. Detection of forest harvest type using multiple dates of Landsat TM imagery. Remote Sensing of Environment 80, 385–396. ISSN: 0034-4257. https://www.sciencedirect.com/science/article/pii/S0034425701003182 (2002).

Zhou, Z., Huang, J.,Wang, J., Zhang, K., Kuang, Z., Zhong, S. & Song, X. Object-oriented classification of sugarcane using time-series middle-resolution remote sensing data based on AdaBoost. PloS one 10, e0142069 (2015).