The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis

This empirical investigation delves into the influence of machine learning (ML) algorithms in the realm of cross-project defect prediction, employing the AEEEEM dataset as a foundation. The primary objective is to discern the nuanced influences of various algorithms on predictive performance, with a...

Full description

Bibliographic Details
Main Authors: Bala, Yahaya Zakariyau, Samat, Pathiah Abdul, Sharif, Khaironi Yatim, Manshor, Noridayu
Format: Article
Language:English
Published: Universitas Ahmad Dahlan 2024
Online Access:http://psasir.upm.edu.my/id/eprint/113069/1/113069.pdf
_version_ 1825940247239720960
author Bala, Yahaya Zakariyau
Samat, Pathiah Abdul
Sharif, Khaironi Yatim
Manshor, Noridayu
author_facet Bala, Yahaya Zakariyau
Samat, Pathiah Abdul
Sharif, Khaironi Yatim
Manshor, Noridayu
author_sort Bala, Yahaya Zakariyau
collection UPM
description This empirical investigation delves into the influence of machine learning (ML) algorithms in the realm of cross-project defect prediction, employing the AEEEEM dataset as a foundation. The primary objective is to discern the nuanced influences of various algorithms on predictive performance, with a specific focus on the F1 score metric as evaluation criterion. Four ML algorithms have been carefully assessed in this study: random forest (RF), support vector machines (SVM), k-nearest neighbors (KNN), and logistic regression (LR). The choice of these algorithms reflects their prevalence in software defect prediction literature and their diversity. Through rigorous experimentation and analysis, the investigation unveils compelling evidence affirming the superiority of RF over its counterparts. The F1 score utilized as evaluation metric, capturing the delicate balance between precision and recall, essential in defect prediction scenarios. The nuanced examination of algorithmic efficacy provides practical insights for developers and practitioners navigating the challenges of cross-project defect prediction. By leveraging the rich and diverse AEEEEM dataset, this study ensures a comprehensive exploration of algorithmic influences across varied software projects. The findings not only contribute to the academic discourse on defect prediction but also offer practical guidance for real-world application, emphasizing the pivotal role of RF as a tool in enhancing predictive accuracy and reliability.
first_indexed 2024-12-09T02:25:07Z
format Article
id upm.eprints-113069
institution Universiti Putra Malaysia
language English
last_indexed 2024-12-09T02:25:07Z
publishDate 2024
publisher Universitas Ahmad Dahlan
record_format dspace
spelling upm.eprints-1130692024-11-15T06:59:25Z http://psasir.upm.edu.my/id/eprint/113069/ The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis Bala, Yahaya Zakariyau Samat, Pathiah Abdul Sharif, Khaironi Yatim Manshor, Noridayu This empirical investigation delves into the influence of machine learning (ML) algorithms in the realm of cross-project defect prediction, employing the AEEEEM dataset as a foundation. The primary objective is to discern the nuanced influences of various algorithms on predictive performance, with a specific focus on the F1 score metric as evaluation criterion. Four ML algorithms have been carefully assessed in this study: random forest (RF), support vector machines (SVM), k-nearest neighbors (KNN), and logistic regression (LR). The choice of these algorithms reflects their prevalence in software defect prediction literature and their diversity. Through rigorous experimentation and analysis, the investigation unveils compelling evidence affirming the superiority of RF over its counterparts. The F1 score utilized as evaluation metric, capturing the delicate balance between precision and recall, essential in defect prediction scenarios. The nuanced examination of algorithmic efficacy provides practical insights for developers and practitioners navigating the challenges of cross-project defect prediction. By leveraging the rich and diverse AEEEEM dataset, this study ensures a comprehensive exploration of algorithmic influences across varied software projects. The findings not only contribute to the academic discourse on defect prediction but also offer practical guidance for real-world application, emphasizing the pivotal role of RF as a tool in enhancing predictive accuracy and reliability. Universitas Ahmad Dahlan 2024 Article PeerReviewed text en cc_by_sa_4 http://psasir.upm.edu.my/id/eprint/113069/1/113069.pdf Bala, Yahaya Zakariyau and Samat, Pathiah Abdul and Sharif, Khaironi Yatim and Manshor, Noridayu (2024) The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis. Telecommunication Computing Electronics and Control, 22 (4). pp. 830-837. ISSN 1693-6930; eISSN: 2302-9293 https://telkomnika.uad.ac.id/index.php/TELKOMNIKA/article/view/25916 10.12928/TELKOMNIKA.v22i4.25916
spellingShingle Bala, Yahaya Zakariyau
Samat, Pathiah Abdul
Sharif, Khaironi Yatim
Manshor, Noridayu
The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis
title The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis
title_full The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis
title_fullStr The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis
title_full_unstemmed The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis
title_short The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis
title_sort influence of machine learning on the predictive performance of cross project defect prediction empirical analysis
url http://psasir.upm.edu.my/id/eprint/113069/1/113069.pdf
work_keys_str_mv AT balayahayazakariyau theinfluenceofmachinelearningonthepredictiveperformanceofcrossprojectdefectpredictionempiricalanalysis
AT samatpathiahabdul theinfluenceofmachinelearningonthepredictiveperformanceofcrossprojectdefectpredictionempiricalanalysis
AT sharifkhaironiyatim theinfluenceofmachinelearningonthepredictiveperformanceofcrossprojectdefectpredictionempiricalanalysis
AT manshornoridayu theinfluenceofmachinelearningonthepredictiveperformanceofcrossprojectdefectpredictionempiricalanalysis
AT balayahayazakariyau influenceofmachinelearningonthepredictiveperformanceofcrossprojectdefectpredictionempiricalanalysis
AT samatpathiahabdul influenceofmachinelearningonthepredictiveperformanceofcrossprojectdefectpredictionempiricalanalysis
AT sharifkhaironiyatim influenceofmachinelearningonthepredictiveperformanceofcrossprojectdefectpredictionempiricalanalysis
AT manshornoridayu influenceofmachinelearningonthepredictiveperformanceofcrossprojectdefectpredictionempiricalanalysis