Ship Engine Model Selection by Applying Machine Learning Classification Techniques Using Imputation and Dimensionality Reduction

The maritime is facing a gradual proliferation of data, which is frequently coupled with the presence of subpar information that contains missing and duplicate data, erroneous records, and flawed entries as a result of human intervention or a lack of access to sensitive and important collaborative i...

Full description

Bibliographic Details
Main Authors: Kyriakos Skarlatos, Grigorios Papageorgiou, Panagiotis Biris, Ekaterini Skamnia, Polychronis Economou, Sotirios Bersimis
Format: Article
Language:English
Published: MDPI AG 2024-01-01
Series:Journal of Marine Science and Engineering
Subjects:
Online Access:https://www.mdpi.com/2077-1312/12/1/97
_version_ 1797343267975593984
author Kyriakos Skarlatos
Grigorios Papageorgiou
Panagiotis Biris
Ekaterini Skamnia
Polychronis Economou
Sotirios Bersimis
author_facet Kyriakos Skarlatos
Grigorios Papageorgiou
Panagiotis Biris
Ekaterini Skamnia
Polychronis Economou
Sotirios Bersimis
author_sort Kyriakos Skarlatos
collection DOAJ
description The maritime is facing a gradual proliferation of data, which is frequently coupled with the presence of subpar information that contains missing and duplicate data, erroneous records, and flawed entries as a result of human intervention or a lack of access to sensitive and important collaborative information. Data limitations and restrictions have a crucial impact on inefficient data-driven decisions, leading to decreased productivity, augmented operating expenses, and the consequent substantial decline in a competitive edge. The missing or inadequate presentation of significant information, such as the vessel’s primary engine model, critically affects its capabilities and operating expenses as well as its environmental impact. In this study, a comprehensive study was employed, using and comparing several machine learning classification techniques to classify a ship’s main engine model, along with different imputation methods for handling the missing values and dimensionality reduction methods. The classification is based on the technical and operational characteristics of the vessel, including the physical dimensions, various capacities, speeds and consumption. Briefly, three dimensionality reduction methods (Principal Component Analysis, Uniform Manifold Approximation and Projection, and t-Distributed Stochastic Neighbor Embedding) were considered and combined with a variety of classifiers and the appropriate parameters of the dimensionality reduction methods. According to the classification results, the ExtraTreeClassifier with PCA with 4 components, the ExtraTreeClassifier with t-SNE with perplexity equal to 10 and 3 components, and the same classifier with UMAP with 10 neighbors and 3 components outperformed the rest of the combinations. This classification could provide significant information for shipowners to enhance the vessel’s operation by optimizing it.
first_indexed 2024-03-08T10:46:19Z
format Article
id doaj.art-720d760b9a254b199c2e55cee1564b19
institution Directory Open Access Journal
issn 2077-1312
language English
last_indexed 2024-03-08T10:46:19Z
publishDate 2024-01-01
publisher MDPI AG
record_format Article
series Journal of Marine Science and Engineering
spelling doaj.art-720d760b9a254b199c2e55cee1564b192024-01-26T17:15:48ZengMDPI AGJournal of Marine Science and Engineering2077-13122024-01-011219710.3390/jmse12010097Ship Engine Model Selection by Applying Machine Learning Classification Techniques Using Imputation and Dimensionality ReductionKyriakos Skarlatos0Grigorios Papageorgiou1Panagiotis Biris2Ekaterini Skamnia3Polychronis Economou4Sotirios Bersimis5Department of Business Administration, University of Piraeus, 18534 Piraeus, GreeceDepartment of Civil Engineering, University of Patras, 26504 Patras, GreeceDepartment of Civil Engineering, University of Patras, 26504 Patras, GreeceDepartment of Civil Engineering, University of Patras, 26504 Patras, GreeceDepartment of Civil Engineering, University of Patras, 26504 Patras, GreeceDepartment of Business Administration, University of Piraeus, 18534 Piraeus, GreeceThe maritime is facing a gradual proliferation of data, which is frequently coupled with the presence of subpar information that contains missing and duplicate data, erroneous records, and flawed entries as a result of human intervention or a lack of access to sensitive and important collaborative information. Data limitations and restrictions have a crucial impact on inefficient data-driven decisions, leading to decreased productivity, augmented operating expenses, and the consequent substantial decline in a competitive edge. The missing or inadequate presentation of significant information, such as the vessel’s primary engine model, critically affects its capabilities and operating expenses as well as its environmental impact. In this study, a comprehensive study was employed, using and comparing several machine learning classification techniques to classify a ship’s main engine model, along with different imputation methods for handling the missing values and dimensionality reduction methods. The classification is based on the technical and operational characteristics of the vessel, including the physical dimensions, various capacities, speeds and consumption. Briefly, three dimensionality reduction methods (Principal Component Analysis, Uniform Manifold Approximation and Projection, and t-Distributed Stochastic Neighbor Embedding) were considered and combined with a variety of classifiers and the appropriate parameters of the dimensionality reduction methods. According to the classification results, the ExtraTreeClassifier with PCA with 4 components, the ExtraTreeClassifier with t-SNE with perplexity equal to 10 and 3 components, and the same classifier with UMAP with 10 neighbors and 3 components outperformed the rest of the combinations. This classification could provide significant information for shipowners to enhance the vessel’s operation by optimizing it.https://www.mdpi.com/2077-1312/12/1/97machine learning in shippingdimensionality reductionsupervised learningmodel comparison and selectionship engine classification
spellingShingle Kyriakos Skarlatos
Grigorios Papageorgiou
Panagiotis Biris
Ekaterini Skamnia
Polychronis Economou
Sotirios Bersimis
Ship Engine Model Selection by Applying Machine Learning Classification Techniques Using Imputation and Dimensionality Reduction
Journal of Marine Science and Engineering
machine learning in shipping
dimensionality reduction
supervised learning
model comparison and selection
ship engine classification
title Ship Engine Model Selection by Applying Machine Learning Classification Techniques Using Imputation and Dimensionality Reduction
title_full Ship Engine Model Selection by Applying Machine Learning Classification Techniques Using Imputation and Dimensionality Reduction
title_fullStr Ship Engine Model Selection by Applying Machine Learning Classification Techniques Using Imputation and Dimensionality Reduction
title_full_unstemmed Ship Engine Model Selection by Applying Machine Learning Classification Techniques Using Imputation and Dimensionality Reduction
title_short Ship Engine Model Selection by Applying Machine Learning Classification Techniques Using Imputation and Dimensionality Reduction
title_sort ship engine model selection by applying machine learning classification techniques using imputation and dimensionality reduction
topic machine learning in shipping
dimensionality reduction
supervised learning
model comparison and selection
ship engine classification
url https://www.mdpi.com/2077-1312/12/1/97
work_keys_str_mv AT kyriakosskarlatos shipenginemodelselectionbyapplyingmachinelearningclassificationtechniquesusingimputationanddimensionalityreduction
AT grigoriospapageorgiou shipenginemodelselectionbyapplyingmachinelearningclassificationtechniquesusingimputationanddimensionalityreduction
AT panagiotisbiris shipenginemodelselectionbyapplyingmachinelearningclassificationtechniquesusingimputationanddimensionalityreduction
AT ekateriniskamnia shipenginemodelselectionbyapplyingmachinelearningclassificationtechniquesusingimputationanddimensionalityreduction
AT polychroniseconomou shipenginemodelselectionbyapplyingmachinelearningclassificationtechniquesusingimputationanddimensionalityreduction
AT sotiriosbersimis shipenginemodelselectionbyapplyingmachinelearningclassificationtechniquesusingimputationanddimensionalityreduction