Multiview Transfer Learning for Software Defect Prediction

Most software defect prediction models usually assume that enough historical training instances with labels are available. Additionally, the training data and the predicted instances should share the same features to ensure the prediction accuracy. However, in practice, there are many datasets with...

Full description

Bibliographic Details
Main Authors: Jinyin Chen, Yitao Yang, Keke Hu, Qi Xuan, Yi Liu, Chao Yang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8600320/
_version_ 1818416100819337216
author Jinyin Chen
Yitao Yang
Keke Hu
Qi Xuan
Yi Liu
Chao Yang
author_facet Jinyin Chen
Yitao Yang
Keke Hu
Qi Xuan
Yi Liu
Chao Yang
author_sort Jinyin Chen
collection DOAJ
description Most software defect prediction models usually assume that enough historical training instances with labels are available. Additionally, the training data and the predicted instances should share the same features to ensure the prediction accuracy. However, in practice, there are many datasets with different granularities containing information in different dimensions. Therefore, it is valuable to effectively use the small scale and different dimensions of data as training instances to improve the prediction performance of the model. We propose a heterogeneous data orienting multiview transfer learning for software defect prediction, denoted as MTDP, which can achieve different dimensions and granularities features to automatically learn labels through neural network models. With this multiview transfer method, lots of training instances are provided for software defect prediction model to ensure the effectiveness of training labels. The proposed MTDP method has four main stages: 1) build heterogeneous transfer models; 2) transfer heterogeneous instances to generate quasi-real instances; 3) label quasi-real instances through co-training and then expand the training set; and (4) construct improved software defect prediction models. The experimental results show that the quasi-real instances have similar effects compared with real instances. Moreover, the software defect prediction performance can be improved by introducing the quasi-real instances into the training dataset.
first_indexed 2024-12-14T11:45:31Z
format Article
id doaj.art-2f6b9b7f717b461a95d254d19bad50c8
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-14T11:45:31Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-2f6b9b7f717b461a95d254d19bad50c82022-12-21T23:02:36ZengIEEEIEEE Access2169-35362019-01-0178901891610.1109/ACCESS.2018.28907338600320Multiview Transfer Learning for Software Defect PredictionJinyin Chen0Yitao Yang1Keke Hu2Qi Xuan3Yi Liu4https://orcid.org/0000-0002-4066-689XChao Yang5College of Information Engineering, Zhejiang University of Technology, Hangzhou, ChinaCollege of Information Engineering, Zhejiang University of Technology, Hangzhou, ChinaCollege of Information Engineering, Zhejiang University of Technology, Hangzhou, ChinaCollege of Information Engineering, Zhejiang University of Technology, Hangzhou, ChinaInstitute of Process Equipment and Control Engineering, Zhejiang University of Technology, Hangzhou, ChinaInstitute of Process Equipment and Control Engineering, Zhejiang University of Technology, Hangzhou, ChinaMost software defect prediction models usually assume that enough historical training instances with labels are available. Additionally, the training data and the predicted instances should share the same features to ensure the prediction accuracy. However, in practice, there are many datasets with different granularities containing information in different dimensions. Therefore, it is valuable to effectively use the small scale and different dimensions of data as training instances to improve the prediction performance of the model. We propose a heterogeneous data orienting multiview transfer learning for software defect prediction, denoted as MTDP, which can achieve different dimensions and granularities features to automatically learn labels through neural network models. With this multiview transfer method, lots of training instances are provided for software defect prediction model to ensure the effectiveness of training labels. The proposed MTDP method has four main stages: 1) build heterogeneous transfer models; 2) transfer heterogeneous instances to generate quasi-real instances; 3) label quasi-real instances through co-training and then expand the training set; and (4) construct improved software defect prediction models. The experimental results show that the quasi-real instances have similar effects compared with real instances. Moreover, the software defect prediction performance can be improved by introducing the quasi-real instances into the training dataset.https://ieeexplore.ieee.org/document/8600320/Multiview transfer learningsoftware defect predictionactive learningneural network
spellingShingle Jinyin Chen
Yitao Yang
Keke Hu
Qi Xuan
Yi Liu
Chao Yang
Multiview Transfer Learning for Software Defect Prediction
IEEE Access
Multiview transfer learning
software defect prediction
active learning
neural network
title Multiview Transfer Learning for Software Defect Prediction
title_full Multiview Transfer Learning for Software Defect Prediction
title_fullStr Multiview Transfer Learning for Software Defect Prediction
title_full_unstemmed Multiview Transfer Learning for Software Defect Prediction
title_short Multiview Transfer Learning for Software Defect Prediction
title_sort multiview transfer learning for software defect prediction
topic Multiview transfer learning
software defect prediction
active learning
neural network
url https://ieeexplore.ieee.org/document/8600320/
work_keys_str_mv AT jinyinchen multiviewtransferlearningforsoftwaredefectprediction
AT yitaoyang multiviewtransferlearningforsoftwaredefectprediction
AT kekehu multiviewtransferlearningforsoftwaredefectprediction
AT qixuan multiviewtransferlearningforsoftwaredefectprediction
AT yiliu multiviewtransferlearningforsoftwaredefectprediction
AT chaoyang multiviewtransferlearningforsoftwaredefectprediction