Data Mining Static Code Attributes to Learn Defect Predictors


The value of using static code attributes to learn defect predictors has been widely debated. Prior work has explored issues like the merits of "McCabes versus Halstead versus lines of code counts" for generating defect predictors. We show here that such debates are irrelevant since how the attributes are used to build predictors is much more important than which particular attributes are used. Also, contrary to prior pessimism, we show that such defect predictors are demonstrably useful and, on the data studied here, yield predictors with a mean probability of detection of 71 percent and mean false alarms rates of 25 percent. These predictors would be useful for prioritizing a resource-bound exploration of code that has yet to be inspected

DOI: 10.1109/TSE.2007.256941

Extracted Key Phrases

11 Figures and Tables

Showing 1-10 of 46 references

Elements of Software Science

  • M Halstead
  • 1977
Highly Influential
16 Excerpts

Data Mining

  • I H Witten, E Frank
  • 2005
Showing 1-10 of 456 extracted citations
Citations per Year

762 Citations

Semantic Scholar estimates that this publication has received between 654 and 891 citations based on the available data.

See our FAQ for additional information.