Measuring European population stratification with microarray genotype data.

@article{Bauchet2007MeasuringEP,
  title={Measuring European population stratification with microarray genotype data.},
  author={Marc Bauchet and Brian P Mcevoy and Laurel Pearson and Ellen E. Quillen and Tamara Sarkisian and Kristine Hovhannesyan and Ranjan Deka and Daniel G. Bradley and Mark David Shriver},
  journal={American journal of human genetics},
  year={2007},
  volume={80 5},
  pages={
          948-56
        }
}
A proper understanding of population genetic stratification--differences in individual ancestry within a population--is crucial in attempts to find genes for complex traits through association mapping. We report on genomewide typing of approximately 10,000 single-nucleotide polymorphisms in 297 individuals, to explore population structure in Europeans of known and unknown ancestry. The results reveal the presence of several significant axes of stratification, most prominently in a northern… Expand

Figures, Tables, and Topics from this paper

Inferring Geographic Coordinates of Origin for Europeans Using Small Panels of Ancestry Informative Markers
TLDR
A systematic exploration of the extent to which geographic coordinates of origin within Europe can be predicted, with small panels of SNPs, and proposes SNPs that can prove extremely useful in a variety of different settings, such as stratification correction or genetic ancestry testing, and the study of the history of European populations. Expand
Discerning the Ancestry of European Americans in Genetic Association Studies
TLDR
Empirical patterns of population structure in European Americans are investigated, analyzing 4,198 samples from four genome-wide association studies to show that components roughly corresponding to northwest European, southeast European, and Ashkenazi Jewish ancestry are the main sources of European American population structure. Expand
Optimal selection of genetic variants for adjustment of population stratification in European association studies
TLDR
This work compares the correction performance of classical and robust principal components with the use of AIMs selected according to four different methods, and finds the first classical and the first two robust PCs achieved the lowest type I error, followed at some distance by the first eight $IN$-AIMs. Expand
Accounting for ancestry: population substructure and genome-wide association studies.
TLDR
It is suggested that it will be important to explore results in homogeneous population subsets as the authors seek to define the extent to which genomic variation influences complex phenotypes. Expand
Application of Ancestry Informative Markers to Association Studies in European Americans
TLDR
A broad assessment of European American population structure is provided, and several sets of ancestry informative markers (AIMs) are provided for inferring ancestry in European Americans, a structured population that is often sampled in association studies. Expand
Analysis and Application of European Genetic Substructure Using 300 K SNP Information
TLDR
Two sets of European substructure ancestry informative markers (ESAIMs) were identified that provide substantial substructure information that can be used for improving error rates in association testing of candidate genes and in replication studies of WGA scans. Expand
Investigation of the fine structure of European populations with applications to disease association studies
TLDR
A method to predict the ethnic origin of samples by comparing the sample genotypes with those from a reference set of samples of known origin is presented, which can be performed using just summary information on the known samples, and individual genotype data are not required. Expand
European Population Genetic Substructure: Further Definition of Ancestry Informative Markers for Distinguishing among Diverse European Ethnic Groups
TLDR
The studies demonstrated that most allele frequency differences between different European groups could be controlled effectively in analyses using these AIM sets and should be widely applicable to ongoing studies to confirm and delineate specific disease susceptibility candidate regions without the necessity of performing additional genome-wide SNP studies in additional subject sets. Expand
Ancestry informative markers for fine-scale individual assignment to worldwide populations
TLDR
A systematic evaluation of the resolution that can be achieved for the inference of genetic ancestry, even when small panels of genetic markers are used and a decision tree for the prediction of individual ancestry is proposed. Expand
Title : Genes mirror geography within Europe
Understanding the genetic structure of human populations is of fundamental interest to medical, forensic and anthropological sciences. Advances in high-throughput genotyping technology have markedlyExpand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 31 REFERENCES
Demonstrating stratification in a European American population
TLDR
The failure of standard methods to detect stratification in case-control association studies indicates that new methods may be required, and a SNP in the gene LCT that varies widely in frequency across Europe was strongly associated with height. Expand
Large-scale SNP analysis reveals clustered and continuous patterns of human genetic variation
TLDR
Analysis of 11,555 single nucleotide polymorphisms in 203 individuals from 12 diverse human populations provides a valuable resource for the definition of marker panels to detect and control for population stratification in population-based gene identification studies. Expand
Principal components analysis corrects for stratification in genome-wide association studies
TLDR
This work describes a method that enables explicit detection and correction of population stratification on a genome-wide scale and uses principal components analysis to explicitly model ancestry differences between cases and controls. Expand
The genomic distribution of population substructure in four populations using 8,525 autosomal SNPs
TLDR
A non-uniform distribution of human genetic substructure is illustrated, an instructional and useful paradigm for education and research and strong correlations between inter-marker distance and both locus-specific FST levels and branch lengths are demonstrated. Expand
European Population Substructure: Clustering of Northern and Southern Populations
TLDR
Using a genome-wide single nucleotide polymorphism (SNP) panel, population structure in a diverse group of Europeans and European Americans was observed, with a consistent and reproducible distinction between “northern” and “southern" European population groups. Expand
Genomic control for association studies.
TLDR
The performance of the genomic control method is quite good for plausible effects of liability genes, which bodes well for future genetic analyses of complex disorders. Expand
Detecting association in a case‐control study while correcting for population stratification
TLDR
A way to measure and correct for stratification by genotyping a moderate number of unlinked genetic markers in the same set of cases and controls in which a candidate association was found is described. Expand
Clines, Clusters, and the Effect of Study Design on the Inference of Human Population Structure
TLDR
Analysis of the 993-locus dataset corroborates earlier results: if enough markers are used with a sufficiently large worldwide sample, individuals can be partitioned into genetic clusters that match major geographic subdivisions of the globe, with some individuals from intermediate geographic locations having mixed membership in the clusters that correspond to neighboring regions. Expand
Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.
TLDR
Extensions to the method of Pritchard et al. for inferring population structure from multilocus genotype data are described and methods that allow for linkage between loci are developed, which allows identification of subtle population subdivisions that were not detectable using the existing method. Expand
Inference of population structure using multilocus genotype data.
TLDR
A model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations that can be applied to most of the commonly used genetic markers, provided that they are not closely linked. Expand
...
1
2
3
4
...