Evolving Suspiciousness Metrics From Hybrid Data Set for Boosting a Spectrum Based Fault Localization

Spectrum Based Fault Localization (SBFL) uses different metrics called risk evaluation formula to guide and pinpoint faults in debugging process. The accuracy of a specific SBFL method may be limited by the used formulae and program spectra. However, it has been demonstrated recently that Genetic Pr...

Full description

Bibliographic Details
Main Authors: Adekunle Akinjobi Ajibode, Ting Shu, Zuohua Ding
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9247185/
_version_ 1818616823673782272
author Adekunle Akinjobi Ajibode
Ting Shu
Zuohua Ding
author_facet Adekunle Akinjobi Ajibode
Ting Shu
Zuohua Ding
author_sort Adekunle Akinjobi Ajibode
collection DOAJ
description Spectrum Based Fault Localization (SBFL) uses different metrics called risk evaluation formula to guide and pinpoint faults in debugging process. The accuracy of a specific SBFL method may be limited by the used formulae and program spectra. However, it has been demonstrated recently that Genetic Programming could be used to automatically design formulae directly from the program spectra. Therefore, this article presents Genetic Programming approach for proposing risk evaluation formula with the inclusion of radicals to evolve suspiciousness metric directly from the program spectra. 92 faults from Unix utilities of SIR repository and 357 real faults from Defect4J repository were used. The approach combines these data sets, used 25% of the total faults (113) to evolve the formulae and the remaining 75% (336) to validate the effectiveness of the metrics generated by our approach. The proposed approach then uses Genetic Programming to run 30 evolution to produce different 30 metrics. The GP-generated metrics consistently out-performed all the classic formulae in both single and multiple faults, especially OP2 on average of 2.25% in single faults and 3.42% in multiple faults. The experiment results conclude that the combination of Hybrid data set and radical is a good technique to evolve effective formulae for spectra-based fault localization.
first_indexed 2024-12-16T16:55:56Z
format Article
id doaj.art-146d6f04ba6a401d90634f3cd80e49ca
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-16T16:55:56Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-146d6f04ba6a401d90634f3cd80e49ca2022-12-21T22:23:53ZengIEEEIEEE Access2169-35362020-01-01819845119846710.1109/ACCESS.2020.30354139247185Evolving Suspiciousness Metrics From Hybrid Data Set for Boosting a Spectrum Based Fault LocalizationAdekunle Akinjobi Ajibode0https://orcid.org/0000-0002-9212-4566Ting Shu1https://orcid.org/0000-0002-5222-7608Zuohua Ding2https://orcid.org/0000-0002-9671-7836School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou, ChinaSchool of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou, ChinaSchool of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou, ChinaSpectrum Based Fault Localization (SBFL) uses different metrics called risk evaluation formula to guide and pinpoint faults in debugging process. The accuracy of a specific SBFL method may be limited by the used formulae and program spectra. However, it has been demonstrated recently that Genetic Programming could be used to automatically design formulae directly from the program spectra. Therefore, this article presents Genetic Programming approach for proposing risk evaluation formula with the inclusion of radicals to evolve suspiciousness metric directly from the program spectra. 92 faults from Unix utilities of SIR repository and 357 real faults from Defect4J repository were used. The approach combines these data sets, used 25% of the total faults (113) to evolve the formulae and the remaining 75% (336) to validate the effectiveness of the metrics generated by our approach. The proposed approach then uses Genetic Programming to run 30 evolution to produce different 30 metrics. The GP-generated metrics consistently out-performed all the classic formulae in both single and multiple faults, especially OP2 on average of 2.25% in single faults and 3.42% in multiple faults. The experiment results conclude that the combination of Hybrid data set and radical is a good technique to evolve effective formulae for spectra-based fault localization.https://ieeexplore.ieee.org/document/9247185/Debuggingfault localizationgenetic programmingSBFL
spellingShingle Adekunle Akinjobi Ajibode
Ting Shu
Zuohua Ding
Evolving Suspiciousness Metrics From Hybrid Data Set for Boosting a Spectrum Based Fault Localization
IEEE Access
Debugging
fault localization
genetic programming
SBFL
title Evolving Suspiciousness Metrics From Hybrid Data Set for Boosting a Spectrum Based Fault Localization
title_full Evolving Suspiciousness Metrics From Hybrid Data Set for Boosting a Spectrum Based Fault Localization
title_fullStr Evolving Suspiciousness Metrics From Hybrid Data Set for Boosting a Spectrum Based Fault Localization
title_full_unstemmed Evolving Suspiciousness Metrics From Hybrid Data Set for Boosting a Spectrum Based Fault Localization
title_short Evolving Suspiciousness Metrics From Hybrid Data Set for Boosting a Spectrum Based Fault Localization
title_sort evolving suspiciousness metrics from hybrid data set for boosting a spectrum based fault localization
topic Debugging
fault localization
genetic programming
SBFL
url https://ieeexplore.ieee.org/document/9247185/
work_keys_str_mv AT adekunleakinjobiajibode evolvingsuspiciousnessmetricsfromhybriddatasetforboostingaspectrumbasedfaultlocalization
AT tingshu evolvingsuspiciousnessmetricsfromhybriddatasetforboostingaspectrumbasedfaultlocalization
AT zuohuading evolvingsuspiciousnessmetricsfromhybriddatasetforboostingaspectrumbasedfaultlocalization