CoreBug: Improving Effort-Aware Bug Prediction in Software Systems Using Generalized <i>k</i>-Core Decomposition in Class Dependency Networks

Complex network theory has been successfully introduced into the field of software engineering. Many works in the literature have built complex networks in software, usually called software networks, to represent software structure. Such software networks and their related graph algorithms have been...

Full description

Bibliographic Details
Main Authors: Xin Du, Tian Wang, Liuhai Wang, Weifeng Pan, Chunlai Chai, Xinxin Xu, Bo Jiang, Jiale Wang
Format: Article
Language:English
Published: MDPI AG 2022-04-01
Series:Axioms
Subjects:
Online Access:https://www.mdpi.com/2075-1680/11/5/205
Description
Summary:Complex network theory has been successfully introduced into the field of software engineering. Many works in the literature have built complex networks in software, usually called software networks, to represent software structure. Such software networks and their related graph algorithms have been proved effective in predicting software bugs. However, the software networks used were unweighted and undirected, neglecting the strength and direction of the couplings. Worse still, they ignored many important types of couplings between classes, such as <i>local variable</i>, <i>instantiates</i>, and <i>access</i>. All of these greatly affect the accuracy of the software network in representing the topological detail of software projects and ultimately affect the metrics derived from it. In this work, an improved effort-aware bug prediction approach named CoreBug is proposed. First, CoreBug uses a weighted directed class dependency network (WDCDN) to precisely describe classes and their couplings, including nine coupling types and their different coupling strengths and directions. Second, a generalized <i>k</i>-core decomposition is introduced to compute the <i>coreness</i> of each class in the WDCDN. Third, CoreBug combines the <i>coreness</i> of each class with its <i>relative risk</i>, as returned by the logistic regression, to quantify the risk of a given class being buggy. Empirical results on eighteen Java projects show that CoreBug is superior to the state-of-the-art approaches according to the average ranking of the Friedman test.
ISSN:2075-1680