Journal of Researches in Mechanics of Agricultural Machinery

Journal of Researches in Mechanics of Agricultural Machinery

Comparative Analysis of Machine Learning Algorithms for Agricultural Land Classification in Masal Using Sentinel-2

Document Type : Original Article

Authors
Department of Biosystems Engineering, Faculty of Agricultural Sciences, University of Guilan, Rasht, Iran
Abstract
Abstract
Changes in agricultural land use driven by human or natural factors and their monitoring constitute one of the most important challenges in the northern provinces of the country. Different machine learning algorithms exhibit varying performance in regions with specific spatial characteristics. The aim of the present study was to use Sentinel-2 satellite imagery to classify land use in Masal County and to compare the maximum likelihood, minimum distance, Mahalanobis distance, and support vector machine classifiers with linear, polynomial, sigmoid, and radial basis function kernels.
To provide training and testing data for the algorithms, in addition to ground truth points collected using a global positioning system (GPS) and Google Earth maps, spectral indices—including the normalized difference vegetation index (NDVI), the normalized difference built-up index (NDBI), and the normalized difference water index (NDWI)—were used to more accurately determine ground truth information. Model validation was performed using a confusion matrix and by calculating the kappa coefficient and overall accuracy. The results showed that the maximum likelihood algorithm achieved the most accurate classification performance, with an overall accuracy of 97.29% and a kappa coefficient of 0.96, while the support vector machine with a linear kernel ranked second, with kappa coefficient and overall accuracy values of 0.89 and 92.46%, respectively. The weakest performance was observed for the minimum distance algorithm, with an overall accuracy of 72.97% and a kappa coefficient of 0.66. Based on the results of this study, the maximum likelihood classifier provided superior performance in discriminating land use classes in Masal County. The findings of this research are of significant importance for policymakers and provincial managers in increasing awareness of changes in agricultural and forest land use in the region.

EXTENDED ABSTRACT
Introduction 
Land use and land cover change, driven by either anthropogenic activities or natural dynamics, is a critical environmental challenge in many regions of the world, particularly in ecologically sensitive and agriculturally significant areas such as the northern provinces of Iran. Monitoring these changes is essential for sustainable resource management, environmental protection, and regional planning. Masal County, with its diverse landscape including forested areas, agricultural lands, and urban settlements, represents a case where accurate and timely land use classification is vital for informed decision-making. Remote sensing technologies, particularly multispectral satellite data like Sentinel-2, have emerged as powerful tools for capturing spatial and temporal variations in land use. The application of machine learning algorithms has further enhanced the potential for precise classification; however, the performance of these algorithms can vary significantly depending on the spatial characteristics of the study area. Thus, selecting an appropriate classification method is crucial for reliable land cover assessment. This study aims to evaluate and compare the performance of four common classification algorithms, Maximum Likelihood, Minimum Distance, Mahalanobis Distance, and Support Vector Machine (SVM) with various kernel functions (linear, polynomial, sigmoid, and radial basis function), for classifying Land use and land cover in Masal County using Sentinel-2 imagery. 
Material and Methods 
The methodology of this study involved the acquisition and processing of Sentinel-2 satellite imagery covering Masal County. Ground truth data were established through a combination of GPS-based field surveys and interpretation of high-resolution Google Earth imagery. To enhance the accuracy of land use classification, several spectral indices were calculated, including the Normalized Difference Vegetation Index (NDVI), Normalized Difference Built-up Index (NDBI), and Modified Normalized Difference Water Index (MNDWI). These indices helped in distinguishing between vegetation, built-up areas, and water bodies, respectively. Four classification algorithms were tested: Maximum Likelihood, Minimum Distance, Mahalanobis Distance, and SVM with linear, polynomial, sigmoid, and radial basis function (RBF) kernels. Model validation was conducted using confusion matrices, with overall accuracy and Kappa coefficient serving as the primary evaluation metrics.
Results and Discussion 
The comparative evaluation of the four classification algorithms Maximum Likelihood, Minimum Distance, Mahalanobis Distance, and Support Vector Machine (SVM) with various kernels revealed substantial differences in their ability to accurately classify land use types in Masal County using Sentinel-2 imagery and ancillary spectral indices. Among the tested methods, the Maximum Likelihood classifier demonstrated the highest performance, achieving an overall accuracy of 97.29% and a Kappa coefficient of 0.96. This high accuracy indicates the algorithm's strong capability in modeling the statistical distribution of pixels for each land use class, particularly when sufficient and representative training data are available. The confusion matrix for this classifier revealed minimal misclassification between major land cover types, including agricultural lands, forests, built-up areas, and water bodies. The high values for both overall accuracy and Kappa coefficient suggest that the Maximum Likelihood approach is robust for the heterogeneous landscape of Masal County. The SVM classifiers, especially with a linear kernel, also performed well, with an overall accuracy of 92.46% and a Kappa coefficient of 0.89. This suggests that the land cover classes in the study area are largely linearly separable in the feature space constructed from the spectral bands and indices. The SVM with other kernels polynomial, sigmoid, and radial basis function (RBF) showed slightly lower accuracies, indicating that increased model complexity did not necessarily translate to better discrimination in this particular dataset. The linear kernel's strong performance highlights its suitability for remote sensing applications where class boundaries are relatively straightforward. The Mahalanobis Distance classifier achieved moderate results, with overall accuracy and Kappa values lower than those of Maximum Likelihood and SVM, but higher than those of Minimum Distance. This method benefited from considering the covariance among spectral features, which helped in distinguishing classes with overlapping spectral signatures. However, its performance was limited in areas where class distributions were not well separated. The Minimum Distance classifier exhibited the weakest performance, with an overall accuracy of 72.97% and a Kappa coefficient of 0.66. Its reliance on simple Euclidean distances made it less effective in capturing the spectral complexity of the region, leading to higher rates of misclassification, particularly between agricultural and urban classes. Incorporating spectral indices such as NDVI, NDBI, and MNDWI significantly improved the classification results for all algorithms. These indices enhanced the separability of vegetation, built-up, and water classes, reducing confusion and increasing the reliability of the classified maps. 
Conclusions 
The findings of this study underscore the crucial role of algorithm selection in remote sensing-based land-use classification. Among the tested methods, the Maximum Likelihood classifier consistently delivered the highest accuracy and reliability for distinguishing land use types in Masal County, followed closely by the SVM with a linear kernel. The use of spectral indices further enhanced the classification results, emphasizing the value of integrating ancillary data for improved land cover discrimination. These outcomes offer practical guidance for land managers, policymakers, and researchers seeking to monitor and manage land-use changes in northern Iran and similar regions. Accurate land use maps are essential for tracking agricultural expansion, forest degradation, and urban growth, all of which have significant implications for environmental sustainability and regional planning. Future research should explore the integration of additional data sources and advanced machine learning techniques to refine classification accuracy further and support dynamic land use monitoring.
Author Contributions
Momeni Masouleh Seyed Abdollah: Data collection, Software, Visualization, Original Draft
Rahimi-Ajdadi Fatemeh: Conceptualization, Methodology, Validation, Writing - Review & Editing, Supervision, Project administration
Data Availability Statement
This section details where supporting data can be found, typically including links to publicly archived datasets. If no data is reported, "Not applicable" should be stated.
Acknowledgements
The authors would like to appreciate the support of the University of Guilan.
Ethical Considerations
This section states ethical approval details (e.g., Ethics Committee, ethical code) and confirms adherence to ethical standards, including avoidance of data fabrication, falsification, plagiarism, and misconduct.
 
 
Keywords

Subjects


Akbarinasab, M., Abbasi, F., & Talebpour, N. (2023). Detection of Babolroud River plume using Sentinel-2 satellite image case study: flood in 2018. Journal of Hydrogeomorphology, 10(35), 15-11.  https://doi.org/10.22034/hyd.2023.54406.1664
Amin, G., Imtiaz, I., Haroon, E., Saqib, N. u., Shahzad, M. I., and Nazeer, M. (2024). Assessment of machine learning algorithms for land cover classification in a complex mountainous landscape. Journal of Geovisualization and Spatial Analysis 8, 34, 1-19. https://doi.org/10.1007/s41651-024-00195-z
Asghari, S., jalilyan, R., pirozineghad, N., madadi, A. & yadeghari, M. (2020). Evaluation of water extraction indices using Landsat satellite images (Case Study: Gamasiab River of Kermanshah). Geographical Sciences Journal, 20(58), 53-70. https://doi.org/10.29252/jgs.20.58.53
Asadi, F., Mirakhorlo, K. & Mostafa, M. (2021). Estimation of poplar cultivation area in Mazandaran province using Santinel 2 satellite images. Ecology of Iranian Forest, 9(18), 54-62. https://doi.org/10.52547/ifej.9.18.54
Atzberger, C. 2013. Advances in remote sensing of agriculture: Context description, existing operational monitoring systems and major information needs. Remote sensing, 5, 949-981. https://doi.org/10.3390/rs5020949
Bruzzone, L., & Demir, B. (2014). A review of modern approaches to classification of remote sensing data. Land Use and Land Cover Mapping in Europe: Practices & Trends, 127-143. https://doi.org/10.1007/978-94-007-7969-3_9
Chen, Y. & Gong, P. (2013). Clustering based on eigenspace transformation–CBEST for efficient classification. ISPRS journal of photogrammetry and remote sensing, 83, 64-80. https://doi.org/10.1016/j.isprsjprs.2013.06.003
Davodpour, R., & Toranjzar, H. (2020). Preparation of land use map using ETM+ Landsat (a case study in Hendodar Watershed). Journal of Environmental Science and Technology, 22, 379-389. (In Persian).
Feyzollah Pour, M. (2024). Detection of changes in the water area of Miqan Lagoon using spectral indices NDWI, MNDWI, AWEI, and supervised SVM models during the period from 1994 to 2022. Geographical Studies of Arid Regions, 14(54), 104-119. (In Persian). https://doi.org/10.22034/jargs.2023.404501.1045
Ghorbani, A., Aslami, F., Ahmadabadi, S., & Ghaffari, S. (2016). Land use mapping of Kaftareh Watershed of Ardabil using visual and digital processing of ETM+ image. Natural Ecosystems of Iran, 6(4), 27-43. (In Persian).
Ghosh, M. K., Kumar, L., & Roy, C. (2015). Monitoring the coastline change of Hatiya Island in Bangladesh using remote sensing techniques. ISPRS Journal of Photogrammetry and Remote Sensing, 101, 137-144.
Golabkesh, F., Nazarpour, A., Ghanavati, N. & Babaeinejad, T. (2022). Application of supervised classification algorithms in long-term survey of surface soil salinity by using remote sensing: a case study of Atabiyeh Plain, Khuzestan Province. JWSS-Isfahan University of Technology, 26(2), 203-221. https://doi.org/10.47176/jwss.26.2.42982
Gong, W., Yuan, L., Fan, W., Wang, X., & Stott, P. (2016). Comparison to supervised classification modelling in land use cover using Landsat 8 OLI data: an example in Miyun county of North China. Nature Environment and Pollution Technology, 15(1), 243.
Lalehzari E, Esmaeily A, Homayouni S. (2018). Development and evaluation of a noise reduction algorithm for improvement of hypserspectral image classification. JGST, 8(1), 195-207. (In Persian).
Mesri, A., Rahimi-Ajdadi, F., & Bagheri, I. (2024). Zoning land surface roughness for wind turbine installation using satellite remote sensing: a case study of kiashahr county. Iranian Journal of Biosystem Engineering, 55 (1), 93-112. (In Persian). https://doi.org/10.22059/ijbse.2024.380477.665560
Mesri, A., Rahimi-Ajdadi, F., & Bagheri, I. (2022). Monitoring Rice Land Changes Using GIS and RS. Journal of Agricultural Machinery, 12 (4): 515-527. (In Persian).  https://doi.org/10.22067/jam.2021.69174.1025
Mola Aghajanzadeh, S., Soleymani, K., Habibnejad, M., Kavyan, A., & Rahmani, M. (2021). Application of remote sensing in assessing land use changes in Haraz Watershed. Geographical Researches, 36(3), 275-284. (In Persian). https://doi.org/10.29252/geores.36.3.275
Mahvash Mohammadi, N., & Hezarkhani, A. (2020). Separating alteration units in the Takht-e-Gonbad district using via comparing two classification methods of Support vector machine and maximum likelihood. Iranian Journal of Geology, 53(53), 31-43. (In Persian).
Rahimi-Ajdadi, F., & Khani, M. (2022). Multi-temporal detection of agricultural land losses using remote sensing and gis techniques, Shanderman, Iran. Acta Technologica Agriculturae, 25(2), 67-72.  https://doi.org/10.2478/ata-2022-0011
Rahnama, S., Maharlooei, M., Rostami, M. & Maghsoudi, H. (2019). Determining the best classification algorithm in order to estimate the area under date palm cultivation using LANDSAT 8 satellite imagery. Journal of Agricultural Machinery, 9(2), 321-335. (In Persian). https://doi.org/10.22067/jam.v9i2.67310
Rouse, J. W., Haas, R. H., Schell, J. A., & Deering, D. W. (1974). Monitoring vegetation systems in the Great Plains with ERTS. NASA Spec. Publ, 351(1), 309.
Ramesh, B., & Krishnan, N. (1991). Application of remote sensing for analysis of urban fringe dynamics—case study of Jaipur, India. Proceeding Asian Conference on Remote Sensing (ACRS).
Shao, Y., & Lunetta, R. S. (2012). Comparison of support vector machine, neural network, and CART algorithms for the land-cover classification using limited training data points. ISPRS Journal of Photogrammetry and Remote Sensing, 70, 78-87. https://doi.org/10.1016/j.isprsjprs.2012.04.001
Weiss, M., Jacob, F., & Duveiller, G. (2020). Remote sensing for agricultural applications: A meta-review. Remote sensing of environment, 236, 111402. https://doi.org/10.1016/j.rse.2019.111402
Yasin, M. Y., Abdullah, J., Noor, N. M., Yusoff, M. M., & Noor, N. M. (2022, October). Landsat observation of urban growth and land use change using NDVI and NDBI analysis. In IOP Conference Series: Earth and Environmental Science, 1067(1), 012037. https://doi.org/10.1088/1755-1315/1067/1/012037
Zare Khormizie, H. , Ghafarian Malamiri, H. R. , & Mortaz, M. (2020). Evaluation of supervised classification capability of Landsat-8 and Sentinel-2A satellite images in determining type and area of pistachio cultivars. Journal of RS and GIS for Natural Resources, 11(38) 84-103. (In Persian). https://doi.org/10.30495/girs.2020.672378