An improved method for identification of small non-coding RNAs in bacteria using support vector machine

@article{Barman2017AnIM,
  title={An improved method for identification of small non-coding RNAs in bacteria using support vector machine},
  author={Ranjan Kumar Barman and A. Mukhopadhyay and Santasabuj Das},
  journal={Scientific Reports},
  year={2017},
  volume={7}
}
Bacterial small non-coding RNAs (sRNAs) are not translated into proteins, but act as functional RNAs. They are involved in diverse biological processes like virulence, stress response and quorum sensing. Several high-throughput techniques have enabled identification of sRNAs in bacteria, but experimental detection remains a challenge and grossly incomplete for most species. Thus, there is a need to develop computational tools to predict bacterial sRNAs. Here, we propose a computational method… 
Prioritizing bona fide bacterial small RNAs with machine learning classifiers
TLDR
The results indicate that even though there is limited sRNA sequence conservation across different bacterial species, there are intrinsic features in the genomic context of sRNAs that are conserved across taxa.
Computational Prediction of sRNA in Acinetobacter baumannii.
TLDR
This method improves the identification of sRNA in Acinetobacter and other bacterial species by being faster and more accurate than the current sRNAPredict3 tool.
Predicting small RNAs in bacteria via sequence learning ensemble method
TLDR
The sequence learning ensemble method, which uses the linear weighted sum of outputs from the individual feature-based predictors to predict sRNAs, and the genetic algorithm is adopted to optimize the parameters in the ensemble system, outperforms existing state-of-the-art sRNA prediction methods.
Sequence-based bacterial small RNAs prediction using ensemble learning strategies
TLDR
WAEM and NNEM can produce better results than existing state-of-the-art sRNA prediction methods and are helpful for understanding the biological mechanism of bacteria.
Prevalence of small base-pairing RNAs derived from diverse genomic loci.
Expansion of the SOS regulon of Vibrio cholerae through extensive transcriptome analysis and experimental validation
TLDR
It is shown that genotoxic stress induces a pervasive transcriptional response, affecting almost 20% of the V. cholerae genes, and that the SOS regulon is larger than previously known, and its syntenic organization is conserved among Vibrio species.
The Helitron family classification using SVM based on Fourier transform features applied on an unbalanced dataset
TLDR
This paper characterize and classify Helitron types using spectral features and support vector machine (SVM) classification technique and demonstrated that some specific classes which do not show similarities are easily discriminated with important accuracy rates exceeding 90%.
A Review of Artificial Intelligence Applications in Bacterial Genomics
TLDR
The characteristics of bacterial genomics are introduced, and artificial intelligence applications inacterial genomics research are summarized from three aspects: gene finding, gene function prediction and gene expression network construction.
Supervised and unsupervised algorithms for bioinformatics and data science.
Challenges at the interface of control engineering and synthetic biology
TLDR
This paper provides examples of biological implementations of negative feedback systems, and discusses progress made toward realisation of other feedback and control architectures, and explores future directions for work in the field, including new approaches for theoretical design of biological control systems.
...
1
2
...

References

SHOWING 1-10 OF 55 REFERENCES
A multiobjective method for robust identification of bacterial small non-coding RNAs
TLDR
This work presents a novel methodology designed to identify bacterial sRNAs by incorporating the knowledge encoded by different sRNA prediction methods and optimally aggregating them as potential predictors, which ensures robustness during detection and prediction even in genomes with distinct nucleotide composition.
sRNAscanner: A Computational Tool for Intergenic Small RNA Detection in Bacterial Genomes
TLDR
Data derived from this study has successfully identified six novel S. Typhimurium sRNA genes and computational specificity analysis suggests that ∼40% of sRNAscanner hits with high cumulative sum of scores represent genuine, undiscovered s RNA genes.
A computational approach to identify genes for functional RNAs in genomic sequences.
TLDR
This work developed a machine learning approach using neural networks and support vector machines to extract common features among known RNAs for prediction of new RNA genes in the unannotated regions of prokaryotic and archaeal genomes and achieved a significant improvement in accuracy.
Computational Small RNA Prediction in Bacteria
TLDR
In this review, a plethora of small RNA prediction methods and tools that have been reported in the past decade are discussed comprehensively and assessed based on their attributes, compatibility, and their prediction accuracy.
De novo computational prediction of non-coding RNA genes in prokaryotic genomes
TLDR
This work presents a novel de novo prediction algorithm for ncRNA genes using features derived from the sequences and structures of known nc RNA genes in comparison to decoys, and enables the identification of both cis- and trans-acting ncRNAs in partially or completely sequenced microbial genomes without requiring homology or structural conservation.
Deep sequencing of Salmonella RNA associated with heterologous Hfq proteins in vivo reveals small RNAs as a major target class and identifies RNA processing phenotypes
TLDR
Evidence is provided of a conserved inherent sRNA-binding property of Hfq, which may facilitate the lateral transmission of regulatory sRNAs among distantly related species and suggests that the expression of heterologous RNA-binding proteins combined with deep sequencing analysis of RNA ligands can be used as a molecular tool to dissect individual steps of RNA metabolism in vivo.
Genome-wide detection of novel regulatory RNAs in E. coli.
TLDR
By applying a deep-sequencing approach, the vast majority of the previously validated regulatory elements are detected and quantified and 10 new sRNAs and nine new regulatory leader sequences are identified in the intergenic regions of E. coli.
Target identification of small noncoding RNAs in bacteria.
Deep Sequencing Analysis of Small Noncoding RNA and mRNA Targets of the Global Post-Transcriptional Regulator, Hfq
TLDR
This work found that the mRNAs of hilD (master regulator of the SPI-1 invasion genes) and flhDC (flagellar master regulator) were bound by Hfq, and presented a generic strategy that is ideal for the systems-level analysis of the post-transcriptional regulons of RNA-binding proteins and for sRNA discovery in a wide range of bacteria.
Systematic deletion of Salmonella small RNA genes identifies CyaR, a conserved CRP‐dependent riboregulator of OmpX synthesis
TLDR
The Hfq‐dependent CyaR (formerly RyeE) sRNA is identified as an ompX repressor and cyaR expression is tightly controlled by the cyclic AMP receptor protein, CRP, which represents a new link between porin repression and nutrient availability that is likely to be widely conserved among enterobacteria.
...
1
2
3
4
5
...