Integrating Defect Data, Code Review Data, and Version Control Data for Defect Analysis and Prediction

Lee, Tao-Chun

Lee, Tao-Chun

2013

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In this thesis, we present a new approach to integrating software system defect data: Defect reports, code reviews and code commits. We propose to infer defect types by keywords. We index defect reports into groups by the keywords found in the descriptions of those reports, and study the properties of each group by leveraging code reviews and code commits. Our approach is more scalable than previous studies that consider defects classified by manual inspections, because indexing is automatic and can be applied uniformly to large defect dataset. Also our approach can analyze defects from programming errors, performance issues, high-level design to user interface, a more comprehensive variety than previous studies using static program analysis. By applying our approach to Honeywell Automation and Control Solutions (ACS) projects, with roughly 700 defects, we found that some defect types could be five times more than other defect types, which gave clues to the dominant root causes of the defects. We found certain defect types clustered in certain source files. We found that 20%-50% of the files usually contained more than 80% of the defects. Finally, we applied a known defect prediction algorithm to predict the hot files of the defects for the defect types of interest. We achieved defect hit rate 50%-90%.

Details

Title Integrating Defect Data, Code Review Data, and Version Control Data for Defect Analysis and Prediction

Author(s) Lee, Tao-Chun

Advisor(s)

Candea, George

Date 2013

Keywords

Defect statistics; defect analysis; defect prediction

Laboratories DSLAB

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > DSLAB - Dependable Systems Laboratory
Work outside EPFL
Student projects

Work type Master's Thesis

Record creation date 2014-06-26

Actions

Preview

Select file: