# NONSEPARABLE DYNAMIC NEAREST NEIGHBOR GAUSSIAN PROCESS MODELS FOR LARGE SPATIO-TEMPORAL DATA WITH AN APPLICATION TO PARTICULATE MATTER ANALYSIS.

@article{Datta2015NONSEPARABLEDN, title={NONSEPARABLE DYNAMIC NEAREST NEIGHBOR GAUSSIAN PROCESS MODELS FOR LARGE SPATIO-TEMPORAL DATA WITH AN APPLICATION TO PARTICULATE MATTER ANALYSIS.}, author={Abhirup Datta and Sudipto Banerjee and Andrew O. Finley and Nicholas A. S. Hamm and Martijn Schaap}, journal={The annals of applied statistics}, year={2015}, volume={10 3}, pages={ 1286-1316 } }

Particulate matter (PM) is a class of malicious environmental pollutants known to be detrimental to human health. Regulatory efforts aimed at curbing PM levels in different countries often require high resolution space-time maps that can identify red-flag regions exceeding statutory concentration limits. Continuous spatio-temporal Gaussian Process (GP) models can deliver maps depicting predicted PM levels and quantify predictive uncertainty. However, GP-based approaches are usually thwarted by…

## 101 Citations

### Bag of DAGs: Flexible Nonstationary Modeling of Spatiotemporal Dependence

- Computer Science
- 2022

A class of nonstationary processes that characterize varying directional associations in space and time for point-referenced data that lead to interpretable nonstationarity and scalability for large data due to sparsity of all DAGs in the bag.

### On nearest‐neighbor Gaussian process models for massive spatial data

- Computer ScienceWiley interdisciplinary reviews. Computational statistics
- 2016

A multivariate data analysis is presented which demonstrates how the nearest neighbor approach yields inference indistinguishable from the full rank GP despite being several times faster.

### Nearest-Neighbor Geostatistical Models for Non-Gaussian Data

- Computer Science
- 2021

This work develops a class of nearest neighbor mixture transition distribution process (NNMP) models that provides flexibility and scalability for non-Gaussian geostatistical data, and develops conditions to construct general NNMP models with pre-specified stationary marginal distributions.

### Bag of DAGs: Flexible & Scalable Modeling of Spatiotem- poral Dependence

- Computer Science
- 2021

A main contribution of this paper is the development of a well-defined nonstationary process using multiple yet simple directed acyclic graphs (DAGs), which leads to computational efficiency, flexibility, and interpretability.

### An Additive Gaussian Process Approximation for Large Spatio-Temporal Data

- Computer Science
- 2017

A new additive spatio-temporal Gaussian process approximation to model complex dependence structures for large spatio/temporal data and inferential and computational benefits of the proposed method are presented.

### Scalable penalized spatiotemporal land-use regression for ground-level nitrogen dioxide.

- Environmental ScienceThe annals of applied statistics
- 2021

Nitrogen dioxide (NO2) is a primary constituent of traffic-related air pollution and has well established harmful environmental and human-health impacts. Knowledge of the spatiotemporal distribution…

### Efficient Spatio-Temporal Gaussian Regression via Kalman Filtering

- Mathematics, Computer ScienceAutom.
- 2020

### High-Dimensional Bayesian Geostatistics.

- Computer ScienceBayesian analysis
- 2017

Two approaches can be described as model-based solutions for big spatiotemporal datasets that ensure that the algorithmic complexity has ~ n floating point operations (flops), where n the number of spatial locations (per iteration).

### Spatial meshing for general Bayesian multivariate models

- Computer Science
- 2022

This article introduces Bayesian models of spatially referenced data in which the likelihood or the latent process (or both) are not Gaussian and introduces the simplified manifold preconditioner adaptation (SiMPA) algorithm which uses second order information about the target but avoids expensive matrix operations.

### An additive approximate Gaussian process model for large spatio‐temporal data

- Computer ScienceEnvironmetrics
- 2019

A new computationally efficient additive approximate Gaussian process that incorporates a computational‐complexity‐reduction method and a separable covariance function, which can flexibly capture various spatio‐temporal dependence structures is proposed.

## References

SHOWING 1-10 OF 85 REFERENCES

### Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets

- Computer ScienceJournal of the American Statistical Association
- 2016

A class of highly scalable nearest-neighbor Gaussian process (NNGP) models to provide fully model-based inference for large geostatistical datasets are developed and it is established that the NNGP is a well-defined spatial process providing legitimate finite-dimensional Gaussian densities with sparse precision matrices.

### Gaussian predictive process models for large spatial data sets

- Computer ScienceJournal of the Royal Statistical Society. Series B, Statistical methodology
- 2008

This work achieves the flexibility to accommodate non‐stationary, non‐Gaussian, possibly multivariate, possibly spatiotemporal processes in the context of large data sets in the form of a computational template encompassing these diverse settings.

### Fixed Rank Filtering for Spatio-Temporal Data

- Environmental Science
- 2010

Datasets from remote-sensing platforms and sensor networks are often spatial, temporal, and very large. Processing massive amounts of data to provide current estimates of the (hidden) state from…

### SUPPLEMENT: NON-SEPARABLE DYNAMIC NEAREST-NEIGHBOR GAUSSIAN PROCESS MODELS FOR LARGE SPATIO-TEMPORAL DATA WITH AN APPLICATION TO PARTICULATE MATTER ANALYSIS

- Mathematics
- 2016

where for any positive number x, [x] denotes the greatest integer not exceeding x. So the eligible set for a space-time point consists of m-nearest neighbors from the time levels j and j − 1, [m/2]…

### Bayesian hierarchical spatio‐temporal smoothing for very large datasets

- Mathematics
- 2012

Spatio‐temporal statistics is prone to the curse of dimensionality: one manifestation of this is inversion of the data–covariance matrix, which is not in general feasible for very‐large‐to‐massive…

### Bayesian dynamic modeling for large space-time datasets using Gaussian predictive processes

- Computer ScienceJ. Geogr. Syst.
- 2012

This paper extends the applicability of a previously proposed class of dynamic space-time models by enabling them to accommodate large datasets by using a low-rank predictive process to reduce the dimensionality of the data and ease the computational burden of estimating the spatio-temporal process of interest.

### A spatially varying coefficient model for mapping PM10 air quality at the European scale

- Environmental Science
- 2015

### Dynamic Models for Spatio-Temporal Data

- Environmental Science
- 2007

We propose a model for nonstationary spatio-temporal data. To account for spatial variability, we model the mean function at each time period as a locally-weighted mixture of linear regressions. To…

### Nonstationary multivariate process modeling through spatially varying coregionalization

- Mathematics
- 2004

Models for the analysis of multivariate spatial data are receiving increased attention these days. In many applications it will be preferable to work with multivariate spatial processes to specify…

### Tapering spatio temporal models

- Mathematics, Computer Science
- 2011

Since spatio-temporal data are tipically of large dimension it is proposed to perform estimation both through maximum likelihood by means of the EM algorithm and a modi ed version of it exploiting the covariance tapering likelihood function.