Corpus ID: 219177347

Random Spatial Forests

@article{Wai2020RandomSF,
  title={Random Spatial Forests},
  author={Travis Hee Wai and Michael Tesauro Young and Adam Szpiro},
  journal={arXiv: Methodology},
  year={2020}
}
We introduce random spatial forests, a method of bagging regression trees allowing for spatial correlation. Our main contribution is the development of a computationally efficient tree building algorithm which selects each split of the tree adjusting for spatial correlation. We evaluate two different approaches for estimation of random spatial forests, a pseudo-likelihood approach combining random forests with kriging and a non-parametric version for a general class of spatial smoothers. We… Expand

Figures and Tables from this paper

References

SHOWING 1-10 OF 35 REFERENCES
Random forest as a generic framework for predictive modeling of spatial and spatio-temporal variables
TLDR
A random forest for spatial predictions framework (RFsp) where buffer distances from observation points are used as explanatory variables, thus incorporating geographical proximity effects into the prediction process, and appears to be especially attractive for building multivariate spatial prediction models that can be used as “knowledge engines” in various geoscience fields. Expand
Comparing spatial regression to random forests for large environmental data sets
TLDR
It is found that the spatial regression model with transformations, and a subsequent selection of significant covariates, has cross-validation performance comparable to random forests, and prediction interval coverage is close to nominal for each method. Expand
Random Forests
  • L. Breiman
  • Mathematics, Computer Science
  • Machine Learning
  • 2004
TLDR
Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression. Expand
Fixed rank kriging for very large spatial data sets
Spatial statistics for very large spatial data sets is challenging. The size of the data set, "n", causes problems in computing optimal spatial predictors such as kriging, since its computationalExpand
Improve ground-level PM2.5 concentration mapping using a random forests-based geostatistical approach.
TLDR
A random forests-based geostatistical approach to improve one of the most commonly used satellite-derived, gridded PM2.5 datasets with a refined spatial resolution (0.01°) and enhanced accuracy by combining the random forests machine learning method and the kriging family of methods. Expand
Gaussian predictive process models for large spatial data sets.
TLDR
This work achieves the flexibility to accommodate non-stationary, non-Gaussian, possibly multivariate, possibly spatiotemporal processes in the context of large data sets in the form of a computational template encompassing these diverse settings. Expand
A comparison of statistical and machine learning methods for creating national daily maps of ambient PM2.5 concentration.
TLDR
It is suggested that future exposure assessment methods for regional PM2.5 incorporate information from neighboring sites when deriving predictions at unsampled locations or attempt to account for spatial dependence. Expand
About regression-kriging: From equations to case studies
TLDR
This paper discusses the characteristics of regression-kriging (RK), its strengths and limitations, and illustrates these with a simple example and three case studies, and addresses pragmatic issues: implementation of RK in existing software packages, comparison ofRK with alternative interpolation techniques, and practical limitations to using RK. Expand
Covariance Tapering for Likelihood-Based Estimation in Large Spatial Data Sets
Maximum likelihood is an attractive method of estimating covariance parameters in spatial models based on Gaussian processes. But calculating the likelihood can be computationally infeasible forExpand
That BLUP is a Good Thing: The Estimation of Random Effects
In animal breeding, Best Linear Unbiased Prediction, or BLUP, is a technique for estimating genetic merits. In general, it is a method of estimating random effects. It can be used to derive theExpand
...
1
2
3
4
...