A Case Study of Bias in Bug-Fix Datasets

@article{Nguyen2010ACS,
  title={A Case Study of Bias in Bug-Fix Datasets},
  author={Thanh H. D. Nguyen and Bram Adams and Ahmed E. Hassan},
  journal={2010 17th Working Conference on Reverse Engineering},
  year={2010},
  pages={259-268}
}
Software quality researchers build software quality models by recovering traceability links between bug reports in issue tracking repositories and source code files. However, all too often the data stored in issue tracking repositories is not explicitly tagged or linked to source code. Researchers have to resort to heuristics to tag the data (e.g., to determine if an issue is a bug report or a work item), or to link a piece of code to a particular issue or bug. Recent studies by Bird et al. and… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 40 CITATIONS

Studying the Evolution of Build Systems

VIEW 4 EXCERPTS
CITES BACKGROUND
HIGHLY INFLUENCED

Mining Software Defects: Should We Consider Affected Releases?

  • 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE)
  • 2019
VIEW 2 EXCERPTS
CITES BACKGROUND

ConPredictor: Concurrency Defect Prediction in Real-World Applications

  • IEEE Transactions on Software Engineering
  • 2018
VIEW 1 EXCERPT
CITES BACKGROUND

Does Geographical Distance Effect Distributed Development Teams: How Aggregation Bias in Software Artifacts Causes Contradictory Findings

  • 2016 IEEE 27th International Symposium on Software Reliability Engineering (ISSRE)
  • 2016
VIEW 3 EXCERPTS
CITES BACKGROUND

Predicting Build Co-changes with Source Code Change and Commit Categories

  • 2016 IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering (SANER)
  • 2016
VIEW 1 EXCERPT
CITES BACKGROUND

References

Publications referenced by this paper.
SHOWING 1-10 OF 15 REFERENCES

Fair and balanced?: bias in bug-fix datasets

  • ESEC/SIGSOFT FSE
  • 2009
VIEW 10 EXCERPTS
HIGHLY INFLUENTIAL

Predicting Defects for Eclipse

  • Third International Workshop on Predictor Models in Software Engineering (PROMISE'07: ICSE Workshops 2007)
  • 2007
VIEW 7 EXCERPTS
HIGHLY INFLUENTIAL

Using Software Dependencies and Churn Metrics to Predict Field Failures: An Empirical Case Study

  • First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007)
  • 2007
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

Predicting faults using the complexity of code changes

  • 2009 IEEE 31st International Conference on Software Engineering
  • 2009
VIEW 1 EXCERPT