Candidate single nucleotide polymorphism selection using publicly available tools: a guide for epidemiologists.

@article{Bhatti2006CandidateSN,
  title={Candidate single nucleotide polymorphism selection using publicly available tools: a guide for epidemiologists.},
  author={Parveen Bhatti and Deanna M. Church and Joni L. Rutter and Jeffery P. Struewing and Alice J. Sigurdson},
  journal={American journal of epidemiology},
  year={2006},
  volume={164 8},
  pages={
          794-804
        }
}
Single nucleotide polymorphisms (SNPs) are the most common form of human genetic variation, with millions present in the human genome. Because only 1% might be expected to confer more than modest individual effects in association studies, the selection of predictive candidate variants for complex disease analyses is formidable. Technologic advances in SNP discovery and the ever-changing annotation of the genome have led to massive informational resources that can be difficult to master across… 

Figures and Tables from this paper

An integrative scoring system for ranking SNPs by their potential deleterious effects
TLDR
A new integrative scoring system for prioritizing SNPs based on their possible deleterious effects within a probabilistic framework is proposed and improves upon other function-assessment systems for SNPs by assigning significantly higher FS scores to already known disease-related SNPs than to neutral SNPs.
Ranking single nucleotide polymorphisms by potential deleterious effects
TLDR
A new integrative scoring system for prioritizing SNPs based on their possible deleterious effects in a probabilistic framework is proposed and the evaluation result is provided on the OMIM (Online Mendelian Inheritance in Man) database, which is one of the most widely-used databases of human genes and genetic disorders.
F-SNP: computationally predicted functional SNPs for disease association studies
TLDR
The Functional Single Nucleotide Polymorphism (F-SNP) database integrates information obtained from 16 bioinformatics tools and databases about the functional effects of SNPs to identify and focus on SNPs with potential deleterious effect to human health.
Selection of Genes and Single Nucleotide Polymorphisms for Fine Mapping Starting From a Broad Linkage Region
TLDR
An empirical two-step fine mapping approach is described, in which candidate genes are prioritized using a bioinformatics approach, and the top genes are chosen for further SNP selection with a linkage disequilibrium based method (Tagger).
Bioinformatic Tools for the Search of Disease-Associated Variations
TLDR
This work states that genome-wide association studies involving more than one million single nucleotide polymorphisms (SNPs) and even large-scale genome resequencing remains challenging and requires a sound knowledge about available bioinformatic data resources.
Exploring functional variant discovery in non-coding regions with SInBaD
TLDR
A new model SInBaD (Sequence-Information-Based-Decision-model) which relies on nucleotide conservation information to evaluate any annotated human variant in all known exons, introns, splice junctions and promoter regions, which demonstrates high prediction accuracy and demonstrates that variants predicted as functional have a significantly higher occurrence in cancer patients.
Positional conservation and amino acids shape the correct diagnosis and population frequencies of benign and damaging personal amino acid mutations.
TLDR
It is found that accurate discrimination between benign and deleterious mutations is strongly influenced by the long-term (among species) history of positions that harbor those mutations.
In silico analysis of Single Nucleotide Polymorphisms (SNPs) in the Heparin-Binding EGF-like Growth Factor (HBEGF) gene and their allelic profiles in the Sri Lankan population: a comprehensive approach to prioritise SNPs for candidate gene studies
TLDR
A comprehensive approach to select the most appropriate SNP markers for molecular epidemiological studies is illustrated using the Heparin-Binding EGF-like growth factor gene to understand the genetic identity of the Sinhalese in relation to HapMap populations.
Single-Nucleotide Polymorphisms (SNP) Mining and Their Effect on the Tridimensional Protein Structure Prediction in a Set of Immunity-Related Expressed Sequence Tags (EST) in Atlantic Salmon (Salmo salar)
TLDR
Results suggest that candidate SNPs identified may play a crucial potential role in the immune response of Atlantic Salmon phenotype with low FASTSNP predicted score risk.
...
...

References

SHOWING 1-10 OF 63 REFERENCES
An Evolutionary Perspective on Single-Nucleotide Polymorphism Screening in Molecular Cancer Epidemiology
TLDR
The concept that SNPs altering the conserved amino acids are more likely to be associated with cancer susceptibility is supported, and a molecular evolutionary approach may hold great promise for prioritizing SNPs to be genotyped in future molecular epidemiological studies.
Characterization of single-nucleotide polymorphisms in coding regions of human genes
TLDR
The cSNPs most likely to influence disease, those that alter the amino acid sequence of the encoded protein, are found at a lower rate and with lower allele frequencies than silent substitutions, likely reflects selection acting against deleterious alleles during human evolution.
SNPselector: a web tool for selecting SNPs for genetic association studies
TLDR
A web-based program, SNPselector, to automate the process of manually choosing thousands of genotyping SNPs for tens or hundreds of genes, and prioritizes these SNPs on their tagging for linkage disequilibrium, SNP allele frequencies and source, function, regulatory potential and repeat status.
Human non-synonymous SNPs: server and survey.
TLDR
A World Wide Web server is presented to predict the effect of an nsSNP on protein structure and function and the dependence of selective pressure on the structural and functional properties of proteins is studied.
Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium.
TLDR
It is demonstrated that, although common variation tends to be shared between populations, tagSNPs should be selected separately for populations with different ancestries.
Haplotype tagging for the identification of common disease genes
TLDR
This work shows how knowledge of the common haplotypes and the SNPs that tag them can be used to explain the often complex patterns of LD between adjacent markers and provide key fine-mapping data within regions of strong LD.
TAMAL: an integrated approach to choosing SNPs for genetic studies of human complex traits
TLDR
The TAMAL (Technology And Money Are Limiting) web site is developed to help identify promising SNPs for further investigation and is available free of charge at http://neoref.ils.unc.edu/tamal.
Tag SNP selection for association studies
TLDR
A quasi‐empirical approach towards evaluating the power of large studies with this calculation based upon the SNP genotype and haplotype frequencies estimated in a haplotype discovery panel.
Identifying functional genetic variants in DNA repair pathway using protein conservation analysis.
TLDR
The analysis of nsSNPs in 88 DNA repair genes and their functional evaluation based on the conservation of amino acids among the protein family members demonstrated that >30% of variants of DNA repair proteins are highly likely to affect the function of the proteins drastically.
PupasView: a visual tool for selecting suitable SNPs, with putative pathological effect in genes, for genotyping purposes
TLDR
PupasView is the first resource that integrates phenotypic effects caused by SNPs at both the translational and the transcriptional level and will be of much help in studies of multifactorial disorders, where the use of functional SNPs will increase the sensitivity of the identification of the genes responsible for the disease.
...
...