Improving Logging Prediction on Imbalanced Datasets: A Case Study on Open Source Java Projects

View Sample PDF

Author(s): Sangeeta Lal (Jaypee Institute of Information Technology Noida, Department of CSE & IT, Noida, Uttar-Pradesh, India), Neetu Sardana (Jaypee Institute of Information Technology Noida, Department of CSE & IT, Noida, Uttar-Pradesh, India)and Ashish Sureka (ABB Corporate Research Center, Bangalore, India)
Copyright: 2020
Pages: 33
Source title: Cognitive Analytics: Concepts, Methodologies, Tools, and Applications
Source Author(s)/Editor(s): Information Resources Management Association (USA)
DOI: 10.4018/978-1-7998-2460-2.ch039

Keywords: Artificial Intelligence / Cognitive Science / Information Science Reference / Social Sciences & Humanities

Purchase

View Improving Logging Prediction on Imbalanced Datasets: A Case Study on Open Source Java Projects on the publisher's website for pricing and purchasing information.

Abstract

Logging is an important yet tough decision for OSS developers. Machine-learning models are useful in improving several steps of OSS development, including logging. Several recent studies propose machine-learning models to predict logged code construct. The prediction performances of these models are limited due to the class-imbalance problem since the number of logged code constructs is small as compared to non-logged code constructs. No previous study analyzes the class-imbalance problem for logged code construct prediction. The authors first analyze the performances of J48, RF, and SVM classifiers for catch-blocks and if-blocks logged code constructs prediction on imbalanced datasets. Second, the authors propose LogIm, an ensemble and threshold-based machine-learning model. Third, the authors evaluate the performance of LogIm on three open-source projects. On average, LogIm model improves the performance of baseline classifiers, J48, RF, and SVM, by 7.38%, 9.24%, and 4.6% for catch-blocks, and 12.11%, 14.95%, and 19.13% for if-blocks logging prediction.

Machine Learning in Healthcare, Introduction and Real World Application Considerations

It is All in the Design: Creating the Foundations of a Mixed Methods Research Study

Cognitive Computing: Methodologies for Neural Computing and Semantic Computing in Brain-Inspired Systems

Abstract Intelligence: Embodying and Enabling Cognitive Systems by Mathematical Engineering

Yingxu Wang, Lotfi A. Zadeh, Bernard Widrow, Newton Howard, Françoise Beaufays, George Baciu, D. Frank Hsu, Guiming Luo, Fumio Mizoguchi, Shushma Patel, Victor Raskin, Shusaku Tsumoto, Wei Wei, Du Zhang. © 2020. 18 pages.

Data Mining Problems Classification and Techniques

Designs of Mixed Method Research

IRMA Offers Over 2,500 Full Text Open Access Research Papers for Free Download Click to Start Searching Free IRM Research!

IRMA Sponsors

Encyclopedia of Information Science and Technology, Fourth Edition

The IRMA Community

Research IRM

Improving Logging Prediction on Imbalanced Datasets: A Case Study on Open Source Java Projects

Purchase

Abstract

Related Content

IRMA Sponsors