Using Landsat-5 for Accurate Historical LULC Classification: A Comparison of Machine Learning Models

This study investigates the application of various machine learning models for land use and land cover (LULC) classification in the Kerch Peninsula. The study utilizes archival field data, cadastral data, and published scientific literature for model training and testing, using Landsat-5 imagery fro...

Full description

Bibliographic Details
Main Authors: Denis Krivoguz, Sergei G. Chernyi, Elena Zinchenko, Artem Silkin, Anton Zinchenko
Format: Article
Language:English
Published: MDPI AG 2023-08-01
Series:Data
Subjects:
Online Access:https://www.mdpi.com/2306-5729/8/9/138
_version_ 1797580606226300928
author Denis Krivoguz
Sergei G. Chernyi
Elena Zinchenko
Artem Silkin
Anton Zinchenko
author_facet Denis Krivoguz
Sergei G. Chernyi
Elena Zinchenko
Artem Silkin
Anton Zinchenko
author_sort Denis Krivoguz
collection DOAJ
description This study investigates the application of various machine learning models for land use and land cover (LULC) classification in the Kerch Peninsula. The study utilizes archival field data, cadastral data, and published scientific literature for model training and testing, using Landsat-5 imagery from 1990 as input data. Four machine learning models (deep neural network, Random Forest, support vector machine (SVM), and AdaBoost) are employed, and their hyperparameters are tuned using random search and grid search. Model performance is evaluated through cross-validation and confusion matrices. The deep neural network achieves the highest accuracy (96.2%) and performs well in classifying water, urban lands, open soils, and high vegetation. However, it faces challenges in classifying grasslands, bare lands, and agricultural areas. The Random Forest model achieves an accuracy of 90.5% but struggles with differentiating high vegetation from agricultural lands. The SVM model achieves an accuracy of 86.1%, while the AdaBoost model performs the lowest with an accuracy of 58.4%. The novel contributions of this study include the comparison and evaluation of multiple machine learning models for land use classification in the Kerch Peninsula. The deep neural network and Random Forest models outperform SVM and AdaBoost in terms of accuracy. However, the use of limited data sources such as cadastral data and scientific articles may introduce limitations and potential errors. Future research should consider incorporating field studies and additional data sources for improved accuracy. This study provides valuable insights for land use classification, facilitating the assessment and management of natural resources in the Kerch Peninsula. The findings contribute to informed decision-making processes and lay the groundwork for further research in the field.
first_indexed 2024-03-10T22:53:14Z
format Article
id doaj.art-671720bac7a04c2197c70533d1d74fb1
institution Directory Open Access Journal
issn 2306-5729
language English
last_indexed 2024-03-10T22:53:14Z
publishDate 2023-08-01
publisher MDPI AG
record_format Article
series Data
spelling doaj.art-671720bac7a04c2197c70533d1d74fb12023-11-19T10:11:39ZengMDPI AGData2306-57292023-08-018913810.3390/data8090138Using Landsat-5 for Accurate Historical LULC Classification: A Comparison of Machine Learning ModelsDenis Krivoguz0Sergei G. Chernyi1Elena Zinchenko2Artem Silkin3Anton Zinchenko4Department of the “Oceanology”, Southern Federal University, 340015 Rostov-on-Don, RussiaDepartment of Cyber-Physical Systems, St. Petersburg State Marine Technical University, Leninsky Prospect, 101, 198262 St. Petersburg, RussiaDepartment of Cyber-Physical Systems, St. Petersburg State Marine Technical University, Leninsky Prospect, 101, 198262 St. Petersburg, RussiaDepartment of Cyber-Physical Systems, St. Petersburg State Marine Technical University, Leninsky Prospect, 101, 198262 St. Petersburg, RussiaDepartment of Cyber-Physical Systems, St. Petersburg State Marine Technical University, Leninsky Prospect, 101, 198262 St. Petersburg, RussiaThis study investigates the application of various machine learning models for land use and land cover (LULC) classification in the Kerch Peninsula. The study utilizes archival field data, cadastral data, and published scientific literature for model training and testing, using Landsat-5 imagery from 1990 as input data. Four machine learning models (deep neural network, Random Forest, support vector machine (SVM), and AdaBoost) are employed, and their hyperparameters are tuned using random search and grid search. Model performance is evaluated through cross-validation and confusion matrices. The deep neural network achieves the highest accuracy (96.2%) and performs well in classifying water, urban lands, open soils, and high vegetation. However, it faces challenges in classifying grasslands, bare lands, and agricultural areas. The Random Forest model achieves an accuracy of 90.5% but struggles with differentiating high vegetation from agricultural lands. The SVM model achieves an accuracy of 86.1%, while the AdaBoost model performs the lowest with an accuracy of 58.4%. The novel contributions of this study include the comparison and evaluation of multiple machine learning models for land use classification in the Kerch Peninsula. The deep neural network and Random Forest models outperform SVM and AdaBoost in terms of accuracy. However, the use of limited data sources such as cadastral data and scientific articles may introduce limitations and potential errors. Future research should consider incorporating field studies and additional data sources for improved accuracy. This study provides valuable insights for land use classification, facilitating the assessment and management of natural resources in the Kerch Peninsula. The findings contribute to informed decision-making processes and lay the groundwork for further research in the field.https://www.mdpi.com/2306-5729/8/9/138machine learningLULCLandsatclassification
spellingShingle Denis Krivoguz
Sergei G. Chernyi
Elena Zinchenko
Artem Silkin
Anton Zinchenko
Using Landsat-5 for Accurate Historical LULC Classification: A Comparison of Machine Learning Models
Data
machine learning
LULC
Landsat
classification
title Using Landsat-5 for Accurate Historical LULC Classification: A Comparison of Machine Learning Models
title_full Using Landsat-5 for Accurate Historical LULC Classification: A Comparison of Machine Learning Models
title_fullStr Using Landsat-5 for Accurate Historical LULC Classification: A Comparison of Machine Learning Models
title_full_unstemmed Using Landsat-5 for Accurate Historical LULC Classification: A Comparison of Machine Learning Models
title_short Using Landsat-5 for Accurate Historical LULC Classification: A Comparison of Machine Learning Models
title_sort using landsat 5 for accurate historical lulc classification a comparison of machine learning models
topic machine learning
LULC
Landsat
classification
url https://www.mdpi.com/2306-5729/8/9/138
work_keys_str_mv AT deniskrivoguz usinglandsat5foraccuratehistoricallulcclassificationacomparisonofmachinelearningmodels
AT sergeigchernyi usinglandsat5foraccuratehistoricallulcclassificationacomparisonofmachinelearningmodels
AT elenazinchenko usinglandsat5foraccuratehistoricallulcclassificationacomparisonofmachinelearningmodels
AT artemsilkin usinglandsat5foraccuratehistoricallulcclassificationacomparisonofmachinelearningmodels
AT antonzinchenko usinglandsat5foraccuratehistoricallulcclassificationacomparisonofmachinelearningmodels