Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering
Abstract Just‐In‐Time (JIT) defect prediction aims to predict the defect proneness of software changes when they are initially submitted. It has become a hot topic in software defect prediction due to its timely manner and traceability. Researchers have proposed many JIT defect prediction approaches...
Autores principales: | , , , , |
---|---|
Formato: | Artículo |
Lenguaje: | English |
Publicado: |
Hindawi-IET
2023-08-01
|
Colección: | IET Software |
Materias: | |
Acceso en línea: | https://doi.org/10.1049/sfw2.12131 |
_version_ | 1826992823409311744 |
---|---|
author | Huan Zhang Li Kuang Aolang Wu Qiuming Zhao Xiaoxian Yang |
author_facet | Huan Zhang Li Kuang Aolang Wu Qiuming Zhao Xiaoxian Yang |
author_sort | Huan Zhang |
collection | DOAJ |
description | Abstract Just‐In‐Time (JIT) defect prediction aims to predict the defect proneness of software changes when they are initially submitted. It has become a hot topic in software defect prediction due to its timely manner and traceability. Researchers have proposed many JIT defect prediction approaches. However, these approaches cannot effectively utilise line labels representing added or removed lines and ignore the noise caused by defect‐irrelevant files. Therefore, a JIT defect prediction model enhanced by the joint method of line label Fusion and file Filtering (JIT‐FF) is proposed. Firstly, to distinguish added and removed lines while preserving the original software changes information, the authors represent the code changes as original, added, and removed codes according to line labels. Secondly, to obtain semantics‐enhanced code representation, a cross‐attention‐based line label fusion method to perform complementary feature enhancement is proposed. Thirdly, to generate code changes containing fewer defect‐irrelevant files, the authors formalise the file filtering as a sequential decision problem and propose a reinforcement learning‐based file filtering method. Finally, based on generated code changes, CodeBERT‐based commit representation and multi‐layer perceptron‐based defect prediction are performed to identify the defective software changes. The experiments demonstrate that JIT‐FF can predict defective software changes more effectively. |
first_indexed | 2024-03-09T07:03:50Z |
format | Article |
id | doaj.art-119306bc943341aea65f525846d911f6 |
institution | Directory Open Access Journal |
issn | 1751-8806 1751-8814 |
language | English |
last_indexed | 2025-02-18T08:55:35Z |
publishDate | 2023-08-01 |
publisher | Hindawi-IET |
record_format | Article |
series | IET Software |
spelling | doaj.art-119306bc943341aea65f525846d911f62024-11-02T23:55:37ZengHindawi-IETIET Software1751-88061751-88142023-08-0117437839110.1049/sfw2.12131Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filteringHuan Zhang0Li Kuang1Aolang Wu2Qiuming Zhao3Xiaoxian Yang4School of Computer Science and Engineering Central South University Hunan ChinaSchool of Computer Science and Engineering Central South University Hunan ChinaSchool of Computer Science and Engineering Central South University Hunan ChinaSchool of Computer Science and Engineering Central South University Hunan ChinaSchool of Computer and Information Engineering Shanghai Polytechnic University Shanghai ChinaAbstract Just‐In‐Time (JIT) defect prediction aims to predict the defect proneness of software changes when they are initially submitted. It has become a hot topic in software defect prediction due to its timely manner and traceability. Researchers have proposed many JIT defect prediction approaches. However, these approaches cannot effectively utilise line labels representing added or removed lines and ignore the noise caused by defect‐irrelevant files. Therefore, a JIT defect prediction model enhanced by the joint method of line label Fusion and file Filtering (JIT‐FF) is proposed. Firstly, to distinguish added and removed lines while preserving the original software changes information, the authors represent the code changes as original, added, and removed codes according to line labels. Secondly, to obtain semantics‐enhanced code representation, a cross‐attention‐based line label fusion method to perform complementary feature enhancement is proposed. Thirdly, to generate code changes containing fewer defect‐irrelevant files, the authors formalise the file filtering as a sequential decision problem and propose a reinforcement learning‐based file filtering method. Finally, based on generated code changes, CodeBERT‐based commit representation and multi‐layer perceptron‐based defect prediction are performed to identify the defective software changes. The experiments demonstrate that JIT‐FF can predict defective software changes more effectively.https://doi.org/10.1049/sfw2.12131software engineeringsoftware quality |
spellingShingle | Huan Zhang Li Kuang Aolang Wu Qiuming Zhao Xiaoxian Yang Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering IET Software software engineering software quality |
title | Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering |
title_full | Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering |
title_fullStr | Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering |
title_full_unstemmed | Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering |
title_short | Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering |
title_sort | just in time defect prediction enhanced by the joint method of line label fusion and file filtering |
topic | software engineering software quality |
url | https://doi.org/10.1049/sfw2.12131 |
work_keys_str_mv | AT huanzhang justintimedefectpredictionenhancedbythejointmethodoflinelabelfusionandfilefiltering AT likuang justintimedefectpredictionenhancedbythejointmethodoflinelabelfusionandfilefiltering AT aolangwu justintimedefectpredictionenhancedbythejointmethodoflinelabelfusionandfilefiltering AT qiumingzhao justintimedefectpredictionenhancedbythejointmethodoflinelabelfusionandfilefiltering AT xiaoxianyang justintimedefectpredictionenhancedbythejointmethodoflinelabelfusionandfilefiltering |