Cross-Validated Variable Selection in Tree-Based Methods Improves Predictive Performance

@article{Painsky2017CrossValidatedVS,
  title={Cross-Validated Variable Selection in Tree-Based Methods Improves Predictive Performance},
  author={Amichai Painsky and S. Rosset},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2017},
  volume={39},
  pages={2142-2153}
}
  • Amichai Painsky, S. Rosset
  • Published 2017
  • Computer Science, Mathematics, Medicine
  • IEEE Transactions on Pattern Analysis and Machine Intelligence
Recursive partitioning methods producing tree-like models are a long standing staple of predictive modeling. However, a fundamental flaw in the partitioning (or splitting) rule of commonly used tree building methods precludes them from treating different types of variables equally. This most clearly manifests in these methods’ inability to properly utilize categorical variables with a large number of categories, which are ubiquitous in the new age of big data. We propose a framework to… Expand
20 Citations
Decision tree underfitting in mining of gene expression data. An evolutionary multi-test tree approach
  • 9
  • PDF
Trees-Based Models for Correlated Data
  • PDF
On the Universality of the Logistic Loss Function
  • 21
  • PDF
The Algorithmic Classification Trees
  • I. Povkhan, Maksym Lupei
  • Computer Science
  • 2020 IEEE Third International Conference on Data Stream Mining & Processing (DSMP)
  • 2020
  • 1
Bregman Divergence Bounds and Universality Properties of the Logarithmic Loss
  • 6
  • PDF
...
1
2
...

References

SHOWING 1-10 OF 34 REFERENCES
Unbiased Recursive Partitioning: A Conditional Inference Framework
  • 2,445
  • Highly Influential
  • PDF
SPLIT SELECTION METHODS FOR CLASSIFICATION TREES
  • 941
  • PDF
REGRESSION TREES WITH UNBIASED VARIABLE SELECTION AND INTERACTION DETECTION
  • 408
  • PDF
Selecting multiway splits in decision trees
  • 14
  • PDF
A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection
  • 10,028
  • PDF
Classification and regression trees
  • W. Loh
  • Computer Science
  • Wiley Interdiscip. Rev. Data Min. Knowl. Discov.
  • 2011
  • 14,639
  • Highly Influential
  • PDF
Using a Permutation Test for Attribute Selection in Decision Trees
  • 49
  • PDF
Bias in random forest variable importance measures: Illustrations, sources and a solution
  • 1,813
  • Highly Influential
  • PDF
Classification Trees With Bivariate Linear Discriminant Node Models
  • 83
  • PDF
...
1
2
3
4
...