Published January 1, 2011 | Version v1
Journal article Open

Thresholds based outlier detection approach for mining class outliers: An empirical case study on software measurement datasets

  • 1. Inst Informat Technol, Sci & Technol Res Council Turkey TUBITAK, Natl Res Inst Elect & Cryptol UEKAE, TR-41470 Kocaeli, Turkey

Description

Predicting the fault-proneness labels of software program modules is an emerging software quality assurance activity and the quality of datasets collected from previous software version affects the performance of fault prediction models. In this paper, we propose an outlier detection approach using metrics thresholds and class labels to identify class outliers. We evaluate our approach on public NASA datasets from PROMISE repository. Experiments reveal that this novel outlier detection method improves the performance of robust software fault prediction models based on Naive Bayes and Random Forests machine learning algorithms. (C) 2010 Elsevier Ltd. All rights reserved.

Files

bib-0cf933c1-90ca-4cd5-97a9-77714418c849.txt

Files (209 Bytes)

Name Size Download all
md5:3dfe1663813b89b45dc0e4d0d12a8555
209 Bytes Preview Download