Eigenvector Spatial Filtering for Large Data Sets: Fixed and Random Effects Approaches

  title={Eigenvector Spatial Filtering for Large Data Sets: Fixed and Random Effects Approaches},
  author={Daisuke Murakami and Daniel A. Griffith},
  journal={Geographical Analysis},
Eigenvector spatial filtering (ESF) is a spatial modeling approach, which has been applied in urban and regional studies, ecological studies, and so on. However, it is computationally demanding, and may not be suitable for large data modeling. The objective of this study is developing fast ESF and random effects ESF (RE-ESF), which are capable of handling very large samples. To achieve it, we accelerate eigen-decomposition and parameter estimation, which make ESF and RE-ESF slow. The former is… 
Spatially varying coefficient modeling for large datasets: Eliminating N from spatial regressions
Abstract While spatially varying coefficient (SVC) modeling is popular in applied science, its computational burden is substantial. This is especially true if a multiscale property of SVC is
The Importance of Scale in Spatially Varying Coefficient Modeling
The objective of this study is to show that capturing the “spatial scale” of each data relationship is crucially important to make SVC modeling more stable and, in doing so, adds flexibility.
spfilteR: An R Package for Semiparametric Spatial Filtering with Eigenvectors in (Generalized) Linear Models
Eigenvector-based spatial filtering constitutes a highly flexible semiparametric approach to account for spatial autocorrelation in a regression framework. It combines judiciously selected
A memory-free spatial additive mixed modeling for big spatial data
This study develops a spatial additive mixed modeling approach estimating spatial and non-spatial effects from large samples, such as millions of observations, with a Moran coefficient-based approach and applies it to an income analysis using United States (US) data in 2015.
Balancing Spatial and Non‐Spatial Variation in Varying Coefficient Modeling: A Remedy for Spurious Correlation
This study discusses the importance of balancing spatial and non-spatial variation in spatial regression modeling. Unlike spatially varying coefficients (SVC) modeling, which is popular in spatial
spmoran: An R package for Moran's eigenvector-based spatial regression analysis
The objective of this study is illustrating how to use "spmoran," which is an R package for Moran's eigenvector-based spatial regression analysis, which applies ESF and RE-ESF models for a land price analysis.
Low rank spatial econometric models
This article presents a re-structuring of spatial econometric models in a linear mixed model framework. To that end, it proposes low rank spatial econometric models that are robust to the existence
House price estimation using an eigenvector spatial filtering approach
Purpose Numerous geo-statistical methods have been developed to analyse the spatial dimension and composition of house prices. Despite these advances, spatial filtering remains an under-researched
Eigenvector Spatial Filtering-Based Logistic Regression for Landslide Susceptibility Assessment
The ESFLR model adequately addressed the spatial autocorrelation of residuals by reducing the Moran’s I value of the residuals to 0.0270, and is an effective and flexible method for landslide analysis.
Disentangling spatial and environmental effects: flexible methods for community ecology and macroecology
This work compared how methods in four broad families of statistical models estimated the contribution of the environment and space to variation in species binary occurrence and abundance and provided general guidelines on the usefulness of different models under different ecological and sampling scenarios.


Random effects specifications in eigenvector spatial filtering: a simulation study
The main findings of this simulation are that in many cases, parameter estimates of the extended RE-ESF are more accurate than other ESF models; the elimination of the spatial component confounding with explanatory variables results in biased parameter estimates; efficiency of an accuracy maximization-based conventional ESF is comparable to RE- ESF inMany cases.
Application of Lasso to the Eigenvector Selection Problem in Eigenvector Based Spatial Filtering
Eigenvector based spatial filtering is one of the well-used approaches to model spatial autocorrelation among the observations or errors in a regression model. In this approach, subset of
Fixed rank kriging for very large spatial data sets
Spatial statistics for very large spatial data sets is challenging. The size of the data set, "n", causes problems in computing optimal spatial predictors such as kriging, since its computational
Semiparametric Filtering of Spatial Autocorrelation: The Eigenvector Approach
In the context of spatial regression analysis, several methods can be used to control for the statistical effects of spatial dependencies among observations. Maximum likelihood or Bayesian approaches
Accounting for Spatial Autocorrelation in Linear Regression Models Using Spatial Filtering with Eigenvectors
Ordinary least squares linear regression models are frequently used to analyze and model spatial phenomena. These models are useful and easily interpreted, and the assumptions, strengths, and
Interpretation and Computation of Estimates from Regression Models using Spatial Filtering
Abstract Spatial filtering in various forms has become a popular way to address spatial dependence in statistical models (Griffith, 2003; Tiefelsdorf & Griffith, 2007). However, spatial filtering
Gaussian predictive process models for large spatial data sets.
This work achieves the flexibility to accommodate non-stationary, non-Gaussian, possibly multivariate, possibly spatiotemporal processes in the context of large data sets in the form of a computational template encompassing these diverse settings.
The SAR Model for Very Large Datasets: A Reduced Rank Approach
The SAR model is widely used in spatial econometrics to model Gaussian processes on a discrete spatial lattice, but for large datasets, fitting it becomes computationally prohibitive, and hence, its
Eigenvector selection with stepwise regression techniques to construct eigenvector spatial filters
A simulation experiment shows that, with a well-prepared candidate eigenvector set, ESF can effectively account for spatial autocorrelation and achieve computational efficiency, and a nonlinear equation is proposed for constructing an ideal candidate eigervector set based on the results of the simulation experiment.
Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets
A class of highly scalable nearest-neighbor Gaussian process (NNGP) models to provide fully model-based inference for large geostatistical datasets are developed and it is established that the NNGP is a well-defined spatial process providing legitimate finite-dimensional Gaussian densities with sparse precision matrices.