Autocorrelation and Linkage Cause Bias in Evaluation of Relational Learners

  title={Autocorrelation and Linkage Cause Bias in Evaluation of Relational Learners},
  author={David D. Jensen and Jennifer Neville},
Two common characteristics of relational data sets — concentrated linkage and relational auto-correlation — can cause traditional methods of evaluation to greatly overestimate the accuracy of induced models on test sets. We identify these characteristics, define quantitative measures of their severity, and explain how they produce this bias. We show how linkage and autocorrelation affect estimates of model accuracy by applying FOIL to synthetic data and to data drawn from the Internet Movie… CONTINUE READING
Highly Cited
This paper has 26 citations. REVIEW CITATIONS
13 Citations
11 References
Similar Papers

Similar Papers

Loading similar papers…