Spatial clustering of the failure to geocode and its implications for the detection of disease clustering.

@article{Zimmerman2008SpatialCO,
  title={Spatial clustering of the failure to geocode and its implications for the detection of disease clustering.},
  author={Dale L. Zimmerman and Xiangming Fang and Soumya Mazumdar},
  journal={Statistics in medicine},
  year={2008},
  volume={27 21},
  pages={
          4254-66
        }
}
Geocoding a study population as completely as possible is an important data assimilation component of many spatial epidemiologic studies. Unfortunately, complete geocoding is rare in practice. The failure of a substantial proportion of study subjects' addresses to geocode has consequences for spatial analyses, some of which are not yet fully understood. This article explicitly demonstrates that the failure to geocode can be spatially clustered, and it investigates the implications of this for… 
Reestimating a minimum acceptable geocoding hit rate for conducting a spatial analysis
TLDR
Results indicate that variations in intensity, clustering and aggregation levels lead to different minimum acceptable geocoding match rates, and specific techniques such as cluster detection seem to be especially sensitive to the existence of non-geocoded data, so the highly approved 85% geocoded rate may need to be raised.
Geographic variability in geocoding success for West Nile virus cases in South Dakota.
Minimum geocoding match rates: an international study of the impact of data and areal unit sizes
TLDR
An international investigation into the impact of the (in)ability to geocode an address on the resulting spatial pattern is conducted, finding that the level of geocoding success depends on the number of points and theNumber of areal units under analysis, but generally show that the necessary levels of geOCoding success are lower than found in previous research.
Local indicators of geocoding accuracy (LIGA): theory and application
TLDR
This paper finds that a family of curves describes the relationship between perturbability and positional error, and uses these curves to evaluate sensitivity of alternative spatial weight specifications to positional error both globally and locally.
Geocoding Error, Spatial Uncertainty, and Implications for Exposure Assessment and Environmental Epidemiology
TLDR
Surfaces of spatial patterns in errors are developed in order to identify locations in the study area where exposures may be over-/under-estimated and methods for quantifying and interpreting geocoding error with respect to exposure misclassification are suggested.
Using Imputation to Provide Location Information for Nongeocoded Addresses
TLDR
This manuscript develops and evaluates a set of imputation strategies for dealing with missing spatial information from nongeocoded addresses and indicates that the imputation strategy based on using available population-based age, gender, and race information performed the best overall at the county, tract, and block group levels.
Evaluation of geoimputation strategies in a large case study
TLDR
Characteristics of the estimated records such as the demographic profile and population density information provide a measure of certainty of geographic imputation in geoimputation methods.
...
1
2
3
...

References

SHOWING 1-10 OF 37 REFERENCES
Estimating the intensity of a spatial point process from locations coarsened by incomplete geocoding.
TLDR
Substantial improvements in the estimation quality of coarsened-data analyses relative to analyses of only the observations that geocode are demonstrated via simulation and an example from a rural health study in Iowa.
Geographic bias related to geocoding in epidemiologic studies
TLDR
Geographic bias in GIS analyses with unrepresentative data owing to missing geocodes is described, using as an example a spatial analysis of prostate cancer incidence among whites and African Americans in Virginia, 1990–1999.
Geocoding Addresses from a Large Population-based Study: Lessons Learned
TLDR
This work develops an iterative geocoding process that would achieve a high match rate in a large population-based health study and provides practical information for investigators who are considering the use of GIS in their population health research.
Conceptual and practical issues in the detection of local disease clusters: a study of mortality in Hamilton, Ontario
Recent advances in local spatial statistics and operational computing capacity have led to growing interest in the detection of disease clusters for public health surveillance and for improving
Positional Accuracy of Geocoded Addresses in Epidemiologic Research
TLDR
The suitability of geocoding for epidemiologic research depends on the level of spatial resolution required to assess exposure, and although sources of error in positional accuracy for geocoded addresses exist, geocode of addresses is, for the most part, very accurate.
Modeling the probability distribution of positional errors incurred by residential address geocoding
TLDR
Mixtures of bivariate t distributions with few components appear to be flexible enough to fit many positional error datasets associated with geocoding, yet parsimonious enough to be feasible for nascent applications of measurement-error methodology to spatial epidemiology.
Positional error in automated geocoding of residential addresses
  • M. Cayo, T. Talbot
  • Environmental Science
    International journal of health geographics
  • 2003
TLDR
This study evaluated the positional error caused during automated geocoding of residential addresses and how this error varies between population densities, and an alternative method of geocoded using residential property parcel data.
Positional Accuracy of Two Methods of Geocoding
TLDR
GPS measurements at homes in a case–control study of non-Hodgkin lymphoma in Iowa indicate greater positional errors for rural addresses compared with town addresses.
Locational uncertainty in georeferencing public health datasets
TLDR
This study assessed the potential locational bias introduced using street centerline data and evaluated georeferencing effects on a location-dependent, exposure assessment process.
Geocoding Health Data: The Use of Geographic Codes in Cancer Prevention and Control, Research, and Practice - Edited by Gerard Rushton, Marc P. Armstrong, Josephine Gittler, Barry R. Greene, Claire E. Pavlik, Michele M. West, and Dale L. Zimmerman
TLDR
This volume focuses on the use of cancer-related heath data and is specifically designed to be a reference for cancer registries, however, the principles and applications are easily transferable for use in other disciplines.
...
1
2
3
4
...