Corpus ID: 1572419

Deep Learning for Genomics: A Concise Overview

@article{Yue2018DeepLF,
  title={Deep Learning for Genomics: A Concise Overview},
  author={Tianwei Yue and Haohan Wang},
  journal={ArXiv},
  year={2018},
  volume={abs/1802.00810}
}
Advancements in genomic research such as high-throughput sequencing techniques have driven modern genomic studies into "big data" disciplines. This data explosion is constantly challenging conventional methods used in genomics. In parallel with the urgent demand for robust algorithms, deep learning has succeeded in a variety of fields such as vision, speech, and text processing. Yet genomics entails unique challenges to deep learning since we are expecting from deep learning a superhuman… Expand
A primer on deep learning in genomics
TLDR
A perspective and primer on deep learning applications for genome analysis and successful applications in the fields of regulatory genomics, variant calling and pathogenicity scores are provided. Expand
Application of deep learning in genomics.
TLDR
The basic concepts in machine learning and artificial neural network are described, followed by elaboration on the workflow of using convolutional neural network in genomics and synthetic biology at the levels of DNA, RNA and protein are provided. Expand
Gene Expression and Protein Function: A Survey of Deep Learning Methods
TLDR
This survey provides an overview of the various types of problems in this domain and the neural architectures that work for these data sets and point out existing deep learning work from other domains that has the potential to be applied to these domains. Expand
Identifying viruses from metagenomic data using deep learning
TLDR
Powered by deep learning and high throughput sequencing metagenomic data, DeepVirFinder significantly improved the accuracy of viral identification and will assist the study of viruses in the era of metagenomics. Expand
Identifying viruses from metagenomic data by deep learning.
TLDR
A reference-free and alignment-free machine learning method, DeepVirFinder, for predicting viral sequences in metagenomic data using deep learning techniques that will significantly accelerate the discovery rate of viruses. Expand
Deep Learning and its Applications : A Survey
26 www.ijerm.com  Abstract— Deep learning has gained its attractions towards the researchers, academicians and several other organizations in the research areas of big data, genomics, naturalExpand
Retrotransposons in Plant Genomes: Structure, Identification, and Classification through Bioinformatics and Machine Learning
TLDR
Important aspects of TEs are discussed, such as their structure, importance in the evolution and architecture of the host, and their current classifications and nomenclatures, and current methods and their limitations are addressed. Expand
DeepGx: Deep Learning Using Gene Expression for Cancer Classification
TLDR
This paper proposes a solution to transform the 1-dimensional gene expression values into 2-dimensional images and considers the overall features of the genes and computes features that are needed in the classification task of the deep learning model by using the convolutional neural network (CNN). Expand
Opportunities and challenges for the computational interpretation of rare variation in clinically important genes
TLDR
It is argued that a genomic learning healthcare system must allow for continuous collection and assessment of rare variants, emerging machine learning methods will enable algorithms to predict the clinical impact of rare variant on protein function, and ethical considerations must inform the construction and deployment of all rare-variation triage strategies. Expand
Opportunities and Challenges for Interpreting Rare Variation in Clinically Important Genes
TLDR
A framework for a genomic learning healthcare system with an initial focus on early-onset treatable disease in newborns and actionable pharmacogenomics is proposed and it is argued that emerging machine learning methods will enable algorithms to predict the clinical impact of rare variants on protein function. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 229 REFERENCES
Deep learning in bioinformatics
TLDR
This review will provide valuable insights and serve as a starting point for researchers to apply deep learning approaches in their bioinformatics studies and suggest future research directions. Expand
Genome-Wide Prediction of cis-Regulatory Regions Using Supervised Deep Learning Methods
TLDR
DECRES is introduced, the first supervised deep learning approach for the identification of enhancer and promoter regions in the human genome, and key experimental features that contribute to the predictive performance are identified. Expand
Deep Learning to Analyze RNA-Seq Gene Expression Data
TLDR
A first approximation on the use of deep learning for the analysis of RNA-Seq gene expression profiles data indicates that a straightforward application of deep nets implementations available in public scientific tools and under the conditions described within this work is not enough to outperform simpler models like LASSO. Expand
Opportunities and obstacles for deep learning in biology and medicine
TLDR
It is found that deep learning has yet to revolutionize biomedicine or definitively resolve any of the most pressing challenges in the field, but promising advances have been made on the prior state of the art. Expand
A review on machine learning principles for multi-view biological data integration
TLDR
It is shown that Bayesian models are able to use prior information and model measurements with various distributions, and a range of deep neural networks can be integrated in multi-modal learning for capturing the complex mechanism of biological systems. Expand
DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences
TLDR
The DanQ model, a novel hybrid convolutional and bi-directional long short-term memory recurrent neural network framework for predicting noncoding function de novo from sequence, improves considerably upon other models across several metrics. Expand
Learning structure in gene expression data using deep architectures, with an application to gene clustering
TLDR
This paper uses deep architectures pre-trained in an unsupervised manner using denoising autoencoders as a preprocessing step for a popular unsuper supervised learning task, and proposes that these architectures can be treated as empirical versions of Deep Belief Networks (DBNs). Expand
PEDLA: predicting enhancers with a deep learning-based algorithmic framework
TLDR
This work developed a deep learning-based algorithmic framework named PEDLA, which can directly learn an enhancer predictor from massively heterogeneous data and generalize in ways that are mostly consistent across various cell types/tissues, and demonstrated that PEDla framework integrates diverse heterogeneous features and gives state-of-the-art performance relative to five existing methods for enhancer prediction. Expand
Reverse-complement parameter sharing improves deep learning models for genomics
TLDR
It is shown that conventional deep learning models that do not explicitly model this property can produce substantially different predictions on forward and reverse-complement versions of the same DNA sequence, and four new convolutional neural network layers are presented that leverage the reverse- complement property of genomic DNA sequence. Expand
Basset: Learning the regulatory code of the accessible genome with deep convolutional neural networks
TLDR
An open source package Basset is introduced to apply deep convolutional neural networks to learn the functional activity of DNA sequences from genomics data and offers a powerful computational approach to annotate and interpret the noncoding genome. Expand
...
1
2
3
4
5
...