• Corpus ID: 2205579

Progress in data-based bandwidth selection for kernel density estimation

  title={Progress in data-based bandwidth selection for kernel density estimation},
  author={Chris Jones and J. S. Marron and Simon J. Sheather},
  journal={Computational Statistics},
We review the extensive recent literature on automatic, data-based selection of a global smoothing parameter in univariate kernel density estimation. Proposals are presented in a unified framework, making considerable reference to their theoretical properties as we go. The results of a major simulation study of the practical performance of many of these methods are summarised. Also, our remarks are further consolidated by describing a small portion of our practical experience on real datasets… 

Figures from this paper

Bandwidth Selection Methods for Kernel Density Estimation - A Review of Performance
Existing methods are reviewed and compared on a set of designs that exhibits features like few bumps and exponentially falling tails concentrating thereby on small and moderate sample sizes to help solve practical issues like fully automatic procedures, implementation and performance.
Bandwidth selection for kernel density estimation: a review of fully automatic selectors
This work reviews existing methods and compares them on a set of designs that exhibit few bumps and exponentially falling tails and finds that a mixture of simple plug-in and cross-validation methods produces bandwidths with a quite stable performance.
Bandwidth Selection in Kernel Density Estimation
A new bandwidth selector has promising behavior with respect to the visual error criterion, especially in the cases of limited sample sizes, and a new data-driven bandwidth selector is proposed which is thought to be without these drawbacks.
Bandwidth Selection in
This paper summarizes the most important arguments for each criterion and gives an overview over the existing bandwidth selection methods and it will be shown how the presented bandwidth selectors can be implemented in a much faster way.
Cross‐validation Bandwidth Matrices for Multivariate Kernel Density Estimation
The results suggest that SCV for full bandwidth matrices is the most reliable of the CV methods, and observe that experience from the univariate setting can sometimes be a misleading guide for understanding bandwidth selection in the multivariate case.
Exploring the Use of Variable Bandwidth Kernel Density Estimators
The use of one implementation of a variable kernel estimator in conjunction with several rules and procedures for bandwidth selection applied to several real datasets showed fewer modes than those chosen by the Silverman test, especially those distributions in which multimodality was caused by several noisy minor modes.
A Brief Survey of Bandwidth Selection for Density Estimation
Abstract There has been major progress in recent years in data-based bandwidth selection for kernel density estimation. Some “second generation” methods, including plug-in and smoothed bootstrap
Practical bandwidth selection in deconvolution kernel density estimation
Practical aspects of kernel smoothing for binary regression and density estimation
This thesis explores the practical use of kernel smoothing in three areas: binary regression, density estimation and Poisson regression sample size calculations. Both nonparametric and


A reliable data-based bandwidth selection method for kernel density estimation
The key to the success of the current procedure is the reintroduction of a non- stochastic term which was previously omitted together with use of the bandwidth to reduce bias in estimation without inflating variance.
On a class of kernel density estimate bandwidth selectors
A class of data-based bandwidth selection procedures for kernel density estimation is investigated. These procedures yield bandwidth estimates ĥ which have a (poor) relative error rate of convergence
An automatic bandwidth selector for kernel density estimation
SUMMARY To select a proper bandwidth is a critical step in kernel density estimation. It is well known that the bandwidth selected by cross-validation has a large variability. This difficulty limits
Lower bounds for bandwidth selection in density estimation
If the error criterion is Integrated Squared Error then, even under very strong assumptions on the underlying density, relative error of the selected bandwidth cannot be reduced below ordern−1/10 (as the sample size grows), and performance in minimizing MISE should become the benchmark for measuring performance of bandwidth selection methods.
The Maximal Smoothing Principle in Density Estimation
Abstract We propose a widely applicable method for choosing the smoothing parameters for nonparametric density estimators. It has come to be realized in recent years (e.g., see Hall and Marron 1987;
Local Bandwidth Selection for Density Estimation
The problem of choosing a local value of the bandwidth h for a kernel density estimate is considered. Estimates of the density f at a given point are needed in the estimation of the asymptotic
Best Possible Constant for Bandwidth Selection
The point of this paper is to show how semiparametric arguments can be employed to calculate the best possible constant coefficient, i.e. an analog of the usual Fisher Information, in this convergence.
Rejoinder to ``Practical performance of several data driven bandwidth selectors"
It is seen that, while most of the bandwidth selectors gain some in terms of variance reduction, some of them lose a lot in Terms of increased bias resulting in inferior overall performance when compared to Least Squares Cross-Validation.