Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering

Abstract Just‐In‐Time (JIT) defect prediction aims to predict the defect proneness of software changes when they are initially submitted. It has become a hot topic in software defect prediction due to its timely manner and traceability. Researchers have proposed many JIT defect prediction approaches...

Descripción completa

Detalles Bibliográficos
Autores principales: Huan Zhang, Li Kuang, Aolang Wu, Qiuming Zhao, Xiaoxian Yang
Formato: Artículo
Lenguaje:English
Publicado: Hindawi-IET 2023-08-01
Colección:IET Software
Materias:
Acceso en línea:https://doi.org/10.1049/sfw2.12131
_version_ 1826992823409311744
author Huan Zhang
Li Kuang
Aolang Wu
Qiuming Zhao
Xiaoxian Yang
author_facet Huan Zhang
Li Kuang
Aolang Wu
Qiuming Zhao
Xiaoxian Yang
author_sort Huan Zhang
collection DOAJ
description Abstract Just‐In‐Time (JIT) defect prediction aims to predict the defect proneness of software changes when they are initially submitted. It has become a hot topic in software defect prediction due to its timely manner and traceability. Researchers have proposed many JIT defect prediction approaches. However, these approaches cannot effectively utilise line labels representing added or removed lines and ignore the noise caused by defect‐irrelevant files. Therefore, a JIT defect prediction model enhanced by the joint method of line label Fusion and file Filtering (JIT‐FF) is proposed. Firstly, to distinguish added and removed lines while preserving the original software changes information, the authors represent the code changes as original, added, and removed codes according to line labels. Secondly, to obtain semantics‐enhanced code representation, a cross‐attention‐based line label fusion method to perform complementary feature enhancement is proposed. Thirdly, to generate code changes containing fewer defect‐irrelevant files, the authors formalise the file filtering as a sequential decision problem and propose a reinforcement learning‐based file filtering method. Finally, based on generated code changes, CodeBERT‐based commit representation and multi‐layer perceptron‐based defect prediction are performed to identify the defective software changes. The experiments demonstrate that JIT‐FF can predict defective software changes more effectively.
first_indexed 2024-03-09T07:03:50Z
format Article
id doaj.art-119306bc943341aea65f525846d911f6
institution Directory Open Access Journal
issn 1751-8806
1751-8814
language English
last_indexed 2025-02-18T08:55:35Z
publishDate 2023-08-01
publisher Hindawi-IET
record_format Article
series IET Software
spelling doaj.art-119306bc943341aea65f525846d911f62024-11-02T23:55:37ZengHindawi-IETIET Software1751-88061751-88142023-08-0117437839110.1049/sfw2.12131Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filteringHuan Zhang0Li Kuang1Aolang Wu2Qiuming Zhao3Xiaoxian Yang4School of Computer Science and Engineering Central South University Hunan ChinaSchool of Computer Science and Engineering Central South University Hunan ChinaSchool of Computer Science and Engineering Central South University Hunan ChinaSchool of Computer Science and Engineering Central South University Hunan ChinaSchool of Computer and Information Engineering Shanghai Polytechnic University Shanghai ChinaAbstract Just‐In‐Time (JIT) defect prediction aims to predict the defect proneness of software changes when they are initially submitted. It has become a hot topic in software defect prediction due to its timely manner and traceability. Researchers have proposed many JIT defect prediction approaches. However, these approaches cannot effectively utilise line labels representing added or removed lines and ignore the noise caused by defect‐irrelevant files. Therefore, a JIT defect prediction model enhanced by the joint method of line label Fusion and file Filtering (JIT‐FF) is proposed. Firstly, to distinguish added and removed lines while preserving the original software changes information, the authors represent the code changes as original, added, and removed codes according to line labels. Secondly, to obtain semantics‐enhanced code representation, a cross‐attention‐based line label fusion method to perform complementary feature enhancement is proposed. Thirdly, to generate code changes containing fewer defect‐irrelevant files, the authors formalise the file filtering as a sequential decision problem and propose a reinforcement learning‐based file filtering method. Finally, based on generated code changes, CodeBERT‐based commit representation and multi‐layer perceptron‐based defect prediction are performed to identify the defective software changes. The experiments demonstrate that JIT‐FF can predict defective software changes more effectively.https://doi.org/10.1049/sfw2.12131software engineeringsoftware quality
spellingShingle Huan Zhang
Li Kuang
Aolang Wu
Qiuming Zhao
Xiaoxian Yang
Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering
IET Software
software engineering
software quality
title Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering
title_full Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering
title_fullStr Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering
title_full_unstemmed Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering
title_short Just‐in‐time defect prediction enhanced by the joint method of line label fusion and file filtering
title_sort just in time defect prediction enhanced by the joint method of line label fusion and file filtering
topic software engineering
software quality
url https://doi.org/10.1049/sfw2.12131
work_keys_str_mv AT huanzhang justintimedefectpredictionenhancedbythejointmethodoflinelabelfusionandfilefiltering
AT likuang justintimedefectpredictionenhancedbythejointmethodoflinelabelfusionandfilefiltering
AT aolangwu justintimedefectpredictionenhancedbythejointmethodoflinelabelfusionandfilefiltering
AT qiumingzhao justintimedefectpredictionenhancedbythejointmethodoflinelabelfusionandfilefiltering
AT xiaoxianyang justintimedefectpredictionenhancedbythejointmethodoflinelabelfusionandfilefiltering