Author pages are created from data sourced from our academic publisher partnerships and public sources.

- Publications
- Influence

Wrappers for Feature Subset Selection

- R. Kohavi, G. H. John
- Computer Science
- Artif. Intell.
- 1 December 1997

Abstract In the feature subset selection problem, a learning algorithm is faced with the problem of selecting a relevant subset of features upon which to focus its attention, while ignoring the rest.… Expand

A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection

- R. Kohavi
- Computer Science
- IJCAI
- 20 August 1995

We review accuracy estimation methods and compare the two most common methods crossvalidation and bootstrap. Recent experimental results on artificial data and theoretical re cults in restricted… Expand

An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Methods for voting classification algorithms, such as Bagging and AdaBoost, have been shown to be very successful in improving the accuracy of certain classifiers for artificial and real-world… Expand

Scaling Up the Accuracy of Naive-Bayes Classifiers: A Decision-Tree Hybrid

- R. Kohavi
- Computer Science
- KDD
- 2 August 1996

Naive-Bayes induction algorithms were previously shown to be surprisingly accurate on many classification tasks even when the conditional independence assumption on which they are based is violated.… Expand

Supervised and Unsupervised Discretization of Continuous Features

- J. Dougherty, R. Kohavi, M. Sahami
- Computer Science
- ICML
- 9 July 1995

Many supervised machine learning algorithms require a discrete feature space. In this paper, we review previous work on continuous feature discretization, identify defining characteristics of the… Expand

Irrelevant Features and the Subset Selection Problem

- George H. John, R. Kohavi, Karl Pfleger
- Computer Science
- ICML
- 10 July 1994

We address the problem of finding a subset of features that allows a supervised induction algorithm to induce small high-accuracy concepts. We examine notions of relevance and irrelevance, and show… Expand

The Power of Decision Tables

- R. Kohavi
- Computer Science
- ECML
- 25 April 1995

We evaluate the power of decision tables as a hypothesis space for supervised learning algorithms. Decision tables are one of the simplest hypothesis spaces possible, and usually they are easy to… Expand

Controlled experiments on the web: survey and practical guide

- R. Kohavi, Roger Longbotham, D. Sommerfield, Randal M. Henne
- Computer Science
- Data Mining and Knowledge Discovery
- 1 February 2009

The web provides an unprecedented opportunity to evaluate ideas quickly using controlled experiments, also called randomized experiments, A/B tests (and their generalizations), split tests,… Expand

The Case against Accuracy Estimation for Comparing Induction Algorithms

- F. Provost, T. Fawcett, R. Kohavi
- Computer Science
- ICML
- 24 July 1998

We analyze critically the use of classi cation accuracy to compare classi ers on natural data sets, providing a thorough investigation using ROC analysis, standard machine learning algorithms, and… Expand

- 1,130
- 44
- Open Access

Bias Plus Variance Decomposition for Zero-One Loss Functions

- R. Kohavi, D. Wolpert
- Computer Science
- ICML
- 3 July 1996

We present a bias variance decomposition of expected misclassi cation rate the most commonly used loss function in supervised classi cation learning The bias variance decomposition for quadratic loss… Expand

- 644
- 44
- Open Access