A Comprehensive Overview of Online Resources to Identify and Predict Bacterial Essential Genes

@article{Peng2017ACO,
  title={A Comprehensive Overview of Online Resources to Identify and Predict Bacterial Essential Genes},
  author={Chong Peng and Yan Lin and Hao Luo and F. Gao},
  journal={Frontiers in Microbiology},
  year={2017},
  volume={8}
}
Genes critical for the survival or reproduction of an organism in certain circumstances are classified as essential genes. Essential genes play a significant role in deciphering the survival mechanism of life. They may be greatly applied to pharmaceutics and synthetic biology. The continuous progress of experimental method for essential gene identification has accelerated the accumulation of gene essentiality data which facilitates the study of essential genes in silico. In this article, we… 

Figures and Tables from this paper

Network-based methods for predicting essential genes or proteins: a survey
TLDR
The advanced methods for network-based prediction of essential genes/proteins are surveyed and the challenges and directions for future research are presented.
CEG 2.0: an updated database of clusters of essential genes including eukaryotic organisms
TLDR
The first database to store prokaryotic essential genes in clusters, CEG (Clusters of Essential Genes), was constructed, and the service of essential gene prediction for both proKaryotes and eukaryotes was provided.
Machine learning approach to gene essentiality prediction: a review
TLDR
This review examines various methods applied to essential gene prediction task, their strengths, limitations and the factors responsible for effective computational prediction of essential genes to present the standard procedure and resources available for predicting essential genes in organisms.
Comparative Gene Essentiality across the Bacterial Domain
TLDR
It is shown there is a modest positive correlation between genome size and number of essential genes, and a group of genes that while being highly conserved are also typically non-essential are found, suggesting that some housekeeping genes confer a significant survival benefit in nature while being non- essential in vitro.
Geptop 2.0: Accurately Select Essential Genes from the List of Protein-Coding Genes in Prokaryotic Genomes.
TLDR
An automatically computational tool named Geptop is developed to select essential genes from the set of protein-coding genes in a prokaryotic genome, which utilizes the strategy of reciprocally best hit for homology search and evolutionary distance for weight assigning.
Essential gene prediction using limited gene essentiality information–An integrative semi-supervised machine learning strategy
TLDR
A novel integrative approach has been proposed for essential gene prediction that shows universality in application to both Prokaryotes and Eukaryotes with limited labeled data.
Expression-based prediction of human essential genes and candidate lncRNAs in cancer cells
TLDR
A new machine learning approach, XGEP, is developed to predict essential genes and candidate lncRNAs in cancer cells through utilization of relevant features derived from the TCGA transcriptome dataset through collaborative embedding and achieves significantly higher performance than previous models.
Variability of Bacterial Essential Genes Among Closely Related Bacteria: The Case of Escherichia coli
TLDR
The quality of the genome-sequences in the database is not enough to make accurate predictions about the conservation of essential-genes among members of this bacterial species, mainly in the case of Escherichia coli.
The Escherichia coli bcsB gene is a conditional essential gene in the context of functional cellulose synthesis
TLDR
It is proposed that bcsB is a conditionally essential gene in E. coli in strains that have a functional capacity to synthesize cellulose, presumably due to the activity of BcsB in the translocation of this polymer across the periplasm.
...
...

References

SHOWING 1-10 OF 135 REFERENCES
Three computational tools for predicting bacterial essential genes.
TLDR
The CEFG research group has constructed three online services to predict essential genes in bacterial genomes, applicable for single gene sequences without annotated functions, single genes with definite names, and complete genomes of bacterial strains.
Advances and perspectives in computational prediction of microbial gene essentiality.
TLDR
The advances made in the use of computational methods to predict microbial gene essentiality are reviewed, perspectives for the future of these techniques and the possible practical applications of essential genes are reviewed.
Identifying Bacterial Essential Genes Based on a Feature-Integrated Method
TLDR
A novel computational method is developed that can effectively predict bacterial essential genes by extracting and integrating homologous features, protein domain feature, gene intrinsic features, and network topological features, which could have practical applications in accurately investigating essential genes from broad bacterial species and also provide helpful guidelines for the minimal cell.
Predicting essential genes in fungal genomes.
TLDR
This work identifies 14 characteristic sequence features potentially associated with essentiality, such as localization signals, codon adaptation, GC content, and overall hydrophobicity, and trained a machine learning classifier capable of predicting essential genes in S. mikatae and verified a subset of the predictions with eight in vivo knockouts.
Predicting essential genes based on network and sequence analysis.
TLDR
Employing the topological features in the protein interaction network and the sequence properties, a machine learning classifier capable of predicting essential genes is built.
Combining Genome-Scale Experimental and Computational Methods To Identify Essential Genes in Rhodobacter sphaeroides
TLDR
Rhodobacter sphaeroides is one of the best-studied alphaproteobacteria from biochemical, genetic, and genomic perspectives and transposon sequencing is used to identify genes that are essential under several growth conditions, enabling a systems-level understanding of the genetic framework underlying metabolic diversity in bacterial species.
Predicting Essential Genes and Proteins Based on Machine Learning and Network Topological Features: A Comprehensive Review
TLDR
The state-of-the-art methods for identifying essential genes and proteins based on machine learning and network topological features are shown, the progress and limitations of current methods are pointed out, and the challenges and directions for further research are discussed.
DEG 10, an update of the database of essential genes that includes both protein-coding genes and noncoding genomic elements
TLDR
DEG 10 includes essential genomic elements under different conditions in three domains of life, with customizable BLAST tools.
Protein Localization Analysis of Essential Genes in Prokaryotes
TLDR
A comprehensive protein localization analysis of essential genes in 27 prokaryotes including 24 bacteria, 2 mycoplasmas and 1 archaeon has been performed and shows that proteins encoded by essential genes are enriched in internal location sites, while exist in cell envelope with a lower proportion compared with non-essential ones.
A new computational strategy for predicting essential genes
TLDR
FWM can remarkably improve the accuracy of essential gene prediction and may be used as an alternative method for other classification work and contribute substantially to the knowledge of the minimum gene sets required for living organisms and the discovery of new drug targets.
...
...