000200040 001__ 200040
000200040 005__ 20190316235930.0
000200040 037__ $$aSTUDENT
000200040 245__ $$aIntegrating Defect Data, Code Review Data, and Version Control Data for Defect Analysis and Prediction
000200040 269__ $$a2013
000200040 260__ $$c2013
000200040 336__ $$aStudent Projects
000200040 520__ $$aIn this thesis, we present a new approach to integrating software system defect data: Defect reports, code reviews and code commits. We propose to infer defect types by keywords. We index defect reports into groups by the keywords found in the descriptions of those reports, and study the properties of each group by leveraging code reviews and code commits. Our approach is more scalable than previous studies that consider defects classified by manual inspections, because indexing is automatic and can be applied uniformly to large defect dataset. Also our approach can analyze defects from programming errors, performance issues, high-level design to user interface, a more comprehensive variety than previous studies using static program analysis. By applying our approach to Honeywell Automation and Control Solutions (ACS) projects, with roughly 700 defects, we found that some defect types could be five times more than other defect types, which gave clues to the dominant root causes of the defects. We found certain defect types clustered in certain source files. We found that 20%-50% of the files usually contained more than 80% of the defects. Finally, we applied a known defect prediction algorithm to predict the hot files of the defects for the defect types of interest. We achieved defect hit rate 50%-90%.
000200040 6531_ $$aDefect statistics
000200040 6531_ $$adefect analysis
000200040 6531_ $$adefect prediction
000200040 700__ $$aLee, Tao-Chun
000200040 720_2 $$0241982$$aCandea, George$$edir.$$g172241
000200040 8564_ $$s974562$$uhttps://infoscience.epfl.ch/record/200040/files/MSDefenceTalk_160913.pdf$$zn/a
000200040 8564_ $$s976570$$uhttps://infoscience.epfl.ch/record/200040/files/MSThesisFinalVersionTao_280813.pdf$$yn/a$$zn/a
000200040 909C0 $$0252225$$pDSLAB$$xU11275
000200040 909CO $$ooai:infoscience.tind.io:200040$$pIC$$qGLOBAL_SET
000200040 917Z8 $$x210458
000200040 917Z8 $$x210458
000200040 917Z8 $$x210458
000200040 917Z8 $$x210458
000200040 917Z8 $$x210458
000200040 937__ $$aEPFL-STUDENT-200040
000200040 973__ $$aOTHER
000200040 980__ $$aSTUDENT$$bMASTERS