publication . Preprint . 2019

Transfer-Learning Oriented Class Imbalance Learning for Cross-Project Defect Prediction

Tong, Haonan; Liu, Bin; Wang, Shihai; Li, Qiuying;
Open Access English
  • Published: 24 Jan 2019
Abstract
Cross-project defect prediction (CPDP) aims to predict defects of projects lacking training data by using prediction models trained on historical defect data from other projects. However, since the distribution differences between datasets from different projects, it is still a challenge to build high-quality CPDP models. Unfortunately, class imbalanced nature of software defect datasets further increases the difficulty. In this paper, we propose a transferlearning oriented minority over-sampling technique (TOMO) based feature weighting transfer naive Bayes (FWTNB) approach (TOMOFWTNB) for CPDP by considering both classimbalance and feature importance problems. ...
Subjects
free text keywords: Computer Science - Software Engineering, Computer Science - Machine Learning
Related Organizations
Download from
59 references, page 1 of 4

[1] Q. Song, Z. Jia, M. Shepperd, S. Ying, J. Liu, A general software defect-proneness prediction framework, IEEE Transactions on Software Engineering 37 (3) (2011) 356- 370. doi:10.1109/TSE.2010.90. [OpenAIRE]

[2] T. Gyimothy, R. Ferenc, I. Siket, Empirical validation of object-oriented metrics on open source software for fault pre IEEE Trans. Softw. Eng. 31 (10) (2005) 897-910. doi:10.1109/TSE.2005.112. URL https://doi.org/10.1109/TSE.2005.112 [OpenAIRE]

[3] C. Catal, B. Diri, A systematic review of software fault prediction studies, Expert Syst. Appl. 36 (4) (2009) 7346-7354. doi:10.1016/j.eswa.2008.10.027. URL http://dx.doi.org/10.1016/j.eswa.2008.10.027 [OpenAIRE]

[4] B. Kitchenham, O. P. Brereton, D. Budgen, M. Turner, J. Bailey, S. Linkman, Systematic literature reviews in software engineering a systematic literature review, Information and Software Technology 51 (1) (2009) 7 - 15, special Section - Most Cited Articles in 2002 and Regular Research Papers.

[5] T. Menzies, Z. Milton, B. Turhan, B. Cukic, Y. Jiang, A. Bener, Defect prediction from static code features: Current results, limitations, new approaches, Automated Software Engg. 17 (4) (2010) 375-407. doi:10.1007/s10515-010-0069-5. URL http://dx.doi.org/10.1007/s10515-010-0069-5

[6] T. Hall, S. Beecham, D. Bowes, D. Gray, S. Counsell, A systematic literature review on fault prediction performance in software engineering, IEEE Transactions on Software Engineering 38 (6) (2012) 1276-1304. doi:10.1109/TSE.2011.103. [OpenAIRE]

[7] M. Shepperd, Q. Song, Z. Sun, C. Mair, Data quality: Some comments on the nasa software defect datasets, IEEE Transactions on Software Engineering 39 (9) (2013) 1208-1215. doi:10.1109/TSE.2013.11. URL http://dx.doi.org/10.1109/TSE.2013.11 [OpenAIRE]

[8] C. Catal, Software fault prediction: A literature review and current trends, Expert Systems with Applications 38 (4) (2011) 4626 - 4636. doi:https://doi.org/10.1016/j.eswa.2010.10.024. URL http://www.sciencedirect.com/science/article/pii/S0957417410011681 [OpenAIRE]

[9] B. Ghotra, S. McIntosh, A. E. Hassan, Revisiting the impact of classification techniques on the performance of defect in: Proceedings of the 37th International Conference on Software Engineering - Volume 1, ICSE '15, IEEE Press, Piscataway, NJ, USA, 2015, pp. 789-800. URL http://dl.acm.org/citation.cfm?id=2818754.2818850

[10] S. S. Rathore, S. Kumar, Linear and non-linear heterogeneous ensemble methods to predict the number of faults in sof Knowledge-Based Systems 119 (Supplement C) (2017) 232 - 256. doi:https://doi.org/10.1016/j.knosys.2016.12.017. URL http://www.sciencedirect.com/science/article/pii/S0950705116305202

[11] S. Kanmani, V. R. Uthariaraj, V. Sankaranarayanan, P. Thambidurai, Object-oriented software fault prediction using neural networks, Inf. Softw. Technol. 49 (5) (2007) 483-492. doi:10.1016/j.infsof.2006.07.005. URL http://dx.doi.org/10.1016/j.infsof.2006.07.005 [OpenAIRE]

[12] K. O. Elish, M. O. Elish, Predicting defect-prone software modules using support vector machines, Journal of Systems and Software 81 (5) (2008) 649 - 660, software Process and Product Measurement. doi:https://doi.org/10.1016/j.jss.2007.07.040. URL http://www.sciencedirect.com/science/article/pii/S016412120700235X [OpenAIRE]

[13] K. Gao, T. M. Khoshgoftaar, H. Wang, N. Seliya, Choosing software metrics for defect prediction: An investigation on feature selection techniques, Softw. Pract. Exper. 41 (5) (2011) 579-606. doi:10.1002/spe.1043. URL http://dx.doi.org/10.1002/spe.1043 [OpenAIRE]

[15] G.Czibula,Z.Marian,I.G.Czibula,Softwaredefectpredictionusing relationalassociation rulemining, Information Sciences 264 (2014) 260 - 278, serious Games. doi:https://doi.org/10.1016/j.ins.2013.12.031. URLhttp://www.sciencedirect.com/science/article/pii/S0020025513008876

[16] X. Yang, K. Tang, X. Yao, A Learning-to-Rank Approach to Software Defect Prediction, IEEE TRANSACTIONS ON RELIABILITY 64 (1) (2015) 234-246. doi:{10.1109/TR.2014.2370891}.

59 references, page 1 of 4
Powered by OpenAIRE Open Research Graph
Any information missing or wrong?Report an Issue