Automatic Construction and Natural-Language Description of Nonparametric Regression Models


This paper presents the beginnings of an automatic statistician, focusing on regression problems. Our system explores an open-ended space of statistical models to discover a good explanation of a data set, and then produces a detailed report with figures and naturallanguage text. Our approach treats unknown regression functions nonparametrically using Gaussian processes, which has two important consequences. First, Gaussian processes can model functions in terms of high-level properties (e.g. smoothness, trends, periodicity, changepoints). Taken together with the compositional structure of our language of models this allows us to automatically describe functions in simple terms. Second, the use of flexible nonparametric models and a rich language for composing them in an open-ended manner also results in stateof-the-art extrapolation performance evaluated over 13 real time series data sets from various domains.

Extracted Key Phrases

14 Figures and Tables

Citations per Year

64 Citations

Semantic Scholar estimates that this publication has 64 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@inproceedings{Lloyd2014AutomaticCA, title={Automatic Construction and Natural-Language Description of Nonparametric Regression Models}, author={James Robert Lloyd and David K. Duvenaud and Roger B. Grosse and Joshua B. Tenenbaum and Zoubin J. C. Ghahramani}, booktitle={AAAI}, year={2014} }