Robust Decision Trees: Removing Outliers from Databases

  title={Robust Decision Trees: Removing Outliers from Databases},
  author={George H. John},
Finding and removing outliers is an important problem in data mining. Errors in large databases can be extremely common, so an important property of a data mining algorithm is robustness with respect to errors in the database. Most sophisticated methods in machine learning address this problem to some extent, but not fully, and can be improved by addressing the problem more directly. In this paper we examine C4.5, a decision tree algorithm that is already quite robust few algorithms have been… CONTINUE READING
Highly Influential
This paper has highly influenced a number of papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 220 citations. REVIEW CITATIONS

3 Figures & Tables



Citations per Year

221 Citations

Semantic Scholar estimates that this publication has 221 citations based on the available data.

See our FAQ for additional information.