Interpreting functional effects of coding variants: challenges in proteome-scale prediction, annotation and assessment
@article{Shameer2016InterpretingFE, title={Interpreting functional effects of coding variants: challenges in proteome-scale prediction, annotation and assessment}, author={Khader Shameer and Lokesh P. Tripathi and Krishna R. Kalari and Joel T. Dudley and Ramanathan Sowdhamini}, journal={Briefings in bioinformatics}, year={2016}, volume={17 5}, pages={ 841-62 } }
Accurate assessment of genetic variation in human DNA sequencing studies remains a nontrivial challenge in clinical genomics and genome informatics. Ascribing functional roles and/or clinical significances to single nucleotide variants identified from a next-generation sequencing study is an important step in genome interpretation. Experimental characterization of all the observed functional variants is yet impractical; thus, the prediction of functional and/or regulatory impacts of the various…
Figures and Tables from this paper
19 Citations
A method for scoring the cell type-specific impacts of noncoding variants in personal genomes
- BiologyProceedings of the National Academy of Sciences
- 2020
A prioritization tool called OpenCausal is developed which takes as inputs a personal genome and a reference context-specific TF expression profile and returns a list of noncoding variants prioritized according to their impact on chromatin accessibility for any given genomic region of interest.
Accurately annotate compound effects of genetic variants using a context-sensitive framework
- BiologyNucleic acids research
- 2017
COPE, a gene-centric variant annotation tool that integrates the entire sequential context in evaluating the functional effects of intra-genic variants, is introduced, identifying numerous cases of multiple-variant compound effects that frequently led to false-positive and false-negative loss-of-function calls by conventional variant-centric tools.
Pleiotropic Variability Score: A Genome Interpretation Metric to Quantify Phenomic Associations of Genomic Variants
- Biology
- 2021
A new genome interpretation metric called Pleiotropic Variability Score (PVS), which uses semantic reasoning to score the relatedness of a genetic variant’s associated phenotypes based on those phenotypes’ relationships in the human phenotype ontology and disease ontology, is developed.
Network-Based Analysis for Biological Discovery
- BiologyEncyclopedia of Bioinformatics and Computational Biology
- 2019
miRMut: Annotation of mutations in miRNA genes from human whole-exome or whole-genome sequencing
- BiologySTAR protocols
- 2022
VCF.Filter: interactive prioritization of disease-linked genetic variants from sequencing data
- BiologyNucleic Acids Res.
- 2017
VCF.Filter is developed to facilitate the search for disease-linked variants, providing a standalone Java program with a user-friendly interface for interactive variant filtering and annotation, and its support for custom annotations and filtering criteria.
parSMURF, a High Performance Computing tool for the genome-wide detection of pathogenic variants
- Computer SciencebioRxiv
- 2020
Several prediction problems in Computational Biology and Genomic Medicine are characterized by both big data as well as a high imbalance between examples to be learned, whereby positive examples can…
parSMURF, a high-performance computing tool for the genome-wide detection of pathogenic variants
- Computer ScienceGigaScience
- 2020
Abstract Background Several prediction problems in computational biology and genomic medicine are characterized by both big data as well as a high imbalance between examples to be learned, whereby…
Integrative Analysis of Multi-Omics Data
- BiologyEncyclopedia of Bioinformatics and Computational Biology
- 2019
Identification of differentially expressed peptides in high‐throughput proteomics data
- BiologyBriefings Bioinform.
- 2018
It is found that three or four replicates are the minimum requirement for high‐throughput data analysis and confident assignment of significant changes and empirical Bayes method (limma) achieves the highest sensitivity, and is recommended for performing differential expression analysis at the peptide level.
References
SHOWING 1-10 OF 197 REFERENCES
Incorporating molecular and functional context into the analysis and prioritization of human variants associated with cancer
- BiologyJ. Am. Medical Informatics Assoc.
- 2012
By aggregating mutations with known disease association at the domain level, the DS-score method was able to discover domain positions enriched with multiple occurrences of deleterious mutations while incorporating relevant functional annotations and can be incorporated into translational bioinformatics tools to characterize rare and novel variants within large-scale sequencing studies.
In silico functional profiling of human disease‐associated and polymorphic amino acid substitutions
- BiologyHuman mutation
- 2010
A range of bioinformatic tools, designed to predict structural and functional sites in protein sequences, were employed to demonstrate that intrinsic biases exist in terms of the distribution of different types of human AAS with respect to specific structural, functional and pathological features.
A Systematic Survey of Loss-of-Function Variants in Human Protein-Coding Genes
- BiologyScience
- 2012
Functional and evolutionary differences between LoF-tolerant and recessive disease genes and a method for using these differences to prioritize candidate genes found in clinical sequencing studies are described.
ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data
- BiologyNucleic acids research
- 2010
The ANNOVAR tool to annotate single nucleotide variants and insertions/deletions, such as examining their functional consequence on genes, inferring cytogenetic bands, reporting functional importance scores, finding variants in conserved regions, or identifying variants reported in the 1000 Genomes Project and dbSNP is developed.
Single-Nucleotide Polymorphism Bioinformatics: A Comprehensive Review of Resources
- BiologyCirculation. Cardiovascular genetics
- 2009
The principle aim of this review is to provide a comprehensive overview of available bioinformatics resources relating to human genetics research, with an emphasis on SNP-centered resources.
A method and server for predicting damaging missense mutations
- BiologyNature Methods
- 2010
A new method and the corresponding software tool, PolyPhen-2, which is different from the early tool polyPhen1 in the set of predictive features, alignment pipeline, and the method of classification is presented and performance, as presented by its receiver operating characteristic curves, was consistently superior.
A general framework for estimating the relative pathogenicity of human genetic variants
- BiologyNature Genetics
- 2014
The ability of CADD to prioritize functional, deleterious and pathogenic variants across many functional categories, effect sizes and genetic architectures is unmatched by any current single-annotation method.
An Abundance of Rare Functional Variants in 202 Drug Target Genes Sequenced in 14,002 People
- BiologyScience
- 2012
It is concluded that because of rapid population growth and weak purifying selection, human populations harbor an abundance of rare variants, many of which are deleterious and have relevance to understanding disease risk.
Choice of transcripts and software has a large effect on variant annotation
- BiologyGenome Medicine
- 2013
The extent of differences in annotation of 80 million variants from a whole-genome sequencing study is quantified and the types of apparent errors made by Annovar and VEP are characterised and discussed to discuss their impact on the analysis of DNA variants in genome sequencing studies.
Associating disease-related genetic variants in intergenic regions to the genes they impact
- BiologyPeerJ
- 2014
This hybrid method outperforms a genomic distance baseline on a small test set of expression quantitative trait loci, as well as either method individually, and shows the potential for this method to uncover relationships between intergenic SNPs and target genes across chromosomes.