Genotype imputation for genome-wide association studies

@article{Marchini2010GenotypeIF,
  title={Genotype imputation for genome-wide association studies},
  author={Jonathan Marchini and Bryan N. Howie},
  journal={Nature Reviews Genetics},
  year={2010},
  volume={11},
  pages={499-511}
}
In the past few years genome-wide association (GWA) studies have uncovered a large number of convincingly replicated associations for many complex human diseases. Genotype imputation has been used widely in the analysis of GWA studies to boost power, fine-map associations and facilitate the combination of results across studies using meta-analysis. This Review describes the details of several different statistical methods for imputing genotypes, illustrates and discusses the factors that… 
Genotype Imputation from Large Reference Panels.
TLDR
An overview of genotype imputation is presented and the computational techniques that make it possible to impute genotypes from reference panels with millions of individuals are described.
False positive findings during genome-wide association studies with imputation: Influence of allele frequency and imputation accuracy.
TLDR
It is shown that the imputation accuracy is diminished if the real genotypes contain minor alleles, which raises concerns about results from analysis of association studies when rare variants are studied, particularly when case-control studies are unbalanced.
Statistical analysis for genome-wide association study
TLDR
This paper presents an overview of widely used approaches and strategies for analysis of GWAS, offered a general consideration to deal with GWAS data.
The meta-analysis of genome-wide association studies
TLDR
This review distinguishes between the use of meta-analysis to validate previously reported genetic associations and its use for gene discovery, and advocates viewing gene discovery as an exploratory screen that requires independent replication instead of treating it as the application of hundreds of thousands of statistical tests.
Statistical Analysis of GWAS
TLDR
This chapter discusses the statistical analysis of genome-wide association studies, mainly concerned with downstream analysis of association, where the genotypes of each individual at each SNP have been established.
FAPI: Fast and accurate P-value Imputation for genome-wide association study
TLDR
This work develops a fast and accurate P-value imputation (FAPI) method that utilizes summary statistics of common variants only and develops a metric to detect abnormal association at a variant and shows that it had a significantly greater power compared with LD-PAC, a method that quantifies the evidence of spurious associations based on likelihood ratio.
Meta-analysis in genome-wide association studies.
TLDR
This review discusses the key methodological issues in the set-up, information gathering and processing, and analysis of meta-analyses of genome-wide association datasets, and illustrates the application ofMeta-analysis methods in the elucidation of common genetic variants associated with Type 2 diabetes.
Meta-analysis methods for genome-wide association studies and beyond
TLDR
Both widely applied and newer statistical methods for GWAS meta-analysis are overviewed, including issues of interpretation and assessment of sources of heterogeneity, and challenges and solutions surrounding the goals of making meta- analysis data publicly available and building powerful consortia are discussed.
Genome-Wide Association Analysis of Imputed Rare Variants: Application to Seven Common Complex Diseases
TLDR
It is demonstrated here, by simulation, that imputation from an existing scaffold of genome‐wide genotype data up to high‐density reference panels has the potential to identify rare variant associations with complex traits, without the need for costly re‐sequencing experiments.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 58 REFERENCES
Practical aspects of imputation-driven meta-analysis of genome-wide association studies.
TLDR
This review collects a series of guidelines, practical detail and learned experiences from a variety of individuals who have contributed to the subject of imputation.
Missing data imputation and haplotype phase inference for genome-wide association studies
TLDR
This article discusses common features of the best algorithms for haplotype phase inference and missing data imputation in large-scale data sets, as well as some important differences between classes of methods.
A new multipoint method for genome-wide association studies by imputation of genotypes
TLDR
This work proposes a coherent analysis framework that treats the genome-wide association problem as one involving missing or uncertain genotypes, and proposes a model-based imputation method for inferring genotypes at observed or unobserved SNPs, leading to improved power over existing methods for multipoint association mapping.
Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies
TLDR
Current algorithms can accurately impute genotypes for untyped markers, which enables researchers to pool data between studies conducted using different SNP sets, and dense marker sets outperform sparser ones in terms of imputation yield and accuracy.
A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies
TLDR
It is found that imputation accuracy can be greatly enhanced by expanding the reference panel to contain thousands of chromosomes and that IMPUTE v2 outperforms other methods in this setting at both rare and common SNPs, with overall error rates that are 15%–20% lower than those of the closest competing method.
Imputation-Based Analysis of Association Studies: Candidate Regions and Quantitative Traits
TLDR
A new framework for the analysis of association studies is introduced, designed to allow untyped variants to be more effectively and directly tested for association with a phenotype, and results in increased power to detect association, even in cases in which the causal variant is typed.
Genotype-imputation accuracy across worldwide human populations.
Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes
TLDR
The results illustrate the value of large discovery and follow-up samples for gaining further insights into the inherited basis of T2D, and detect at least six previously unknown loci with robust evidence for association.
Analyses and Comparison of Accuracy of Different Genotype Imputation Methods
TLDR
This study investigated and compared the performance of five popular genotype imputation methods, MACH, IMPUTE, fastPHASE, PLINK and Beagle, and showed that a stronger LD and a lower MAF for an untyped marker produced better ARs for all the five methods.
Designing Genome-Wide Association Studies: Sample Size, Power, Imputation, and the Choice of Genotyping Chip
TLDR
It is argued that the statistical power to detect a causative variant should be the major criterion in study design and that, when taking budgetary considerations into account, the most powerful design may not always correspond to the chip with the highest coverage.
...
1
2
3
4
5
...