On the chance accuracies of large collections of classifiers

@inproceedings{Palatucci2008OnTC,
  title={On the chance accuracies of large collections of classifiers},
  author={Mark Palatucci and Andrew Carlson},
  booktitle={ICML},
  year={2008}
}
We provide a theoretical analysis of the chance accuracies of large collections of classifiers. We show that on problems with small numbers of examples, some classifier can perform well by random chance, and we derive a theorem to explicitly calculate this accuracy. We use this theorem to provide a principled feature selection criterion for sparse, high-dimensional problems. We evaluate this method on microarray and fMRI datasets and show that it performs very close to the optimal accuracy… CONTINUE READING