Comparative Analysis of Random Forests with Statistical and Machine Learning Methods in Predicting Fault-Prone Classes

View Sample PDF

Author(s): Ruchika Malhotra (Delhi Technological University, India), Arvinder Kaur (GGS Indraprastha University, India)and Yogesh Singh (GGS Indraprastha University, India)
Copyright: 2012
Pages: 22
Source title: Cross-Disciplinary Applications of Artificial Intelligence and Pattern Recognition: Advancing Technologies
Source Author(s)/Editor(s): Vijay Kumar Mago (Simon Fraser University, Canada)and Nitin Bhatia (DAV College, India)
DOI: 10.4018/978-1-61350-429-1.ch023

Keywords: Artificial Intelligence / Computer Science & IT / Information Science Reference

Purchase

View Comparative Analysis of Random Forests with Statistical and Machine Learning Methods in Predicting Fault-Prone Classes on the publisher's website for pricing and purchasing information.

Abstract

There are available metrics for predicting fault prone classes, which may help software organizations for planning and performing testing activities. This may be possible due to proper allocation of resources on fault prone parts of the design and code of the software. Hence, importance and usefulness of such metrics is understandable, but empirical validation of these metrics is always a great challenge. Random Forest (RF) algorithm has been successfully applied for solving regression and classification problems in many applications. In this work, the authors predict faulty classes/modules using object oriented metrics and static code metrics. This chapter evaluates the capability of RF algorithm and compares its performance with nine statistical and machine learning methods in predicting fault prone software classes. The authors applied RF on six case studies based on open source, commercial software and NASA data sets. The results indicate that the prediction performance of RF is generally better than statistical and machine learning models. Further, the classification of faulty classes/modules using the RF method is better than the other methods in most of the data sets.

The IRMA Community

Research IRM

Comparative Analysis of Random Forests with Statistical and Machine Learning Methods in Predicting Fault-Prone Classes

Purchase

Abstract

Related Content

IRMA Sponsors