Analyzing Modular RNA Structure reveals Low Global Structural Entropy in microRNA Sequence

@article{Shaw2011AnalyzingMR,
  title={Analyzing Modular RNA Structure reveals Low Global Structural Entropy in microRNA Sequence},
  author={Timothy Isham Shaw and Amir Manzour and Yingfeng Wang and Russell L. Malmberg and Liming Cai},
  journal={Journal of bioinformatics and computational biology},
  year={2011},
  volume={9 2},
  pages={
          283-98
        }
}
Secondary structure remains the most exploitable feature for noncoding RNA (ncRNA) gene finding in genomes. However, methods based on secondary structure prediction may generate superfluous amount of candidates for validation and have yet to deliver the desired performance that can complement experimental efforts in ncRNA gene finding. This paper investigates a novel method, unpaired structural entropy (USE) as a measurement for the structure fold stability of ncRNAs. USE proves to be effective… 

Figures and Tables from this paper

Identifying RNA 5-methylcytosine sites via pseudo nucleotide compositions.
TLDR
In the present study, a support vector machine based-method is proposed to identify m5C sites in Homo sapiens and it was observed by the jackknife cross-validation that the overall success rate achieved by the proposed model is 90.42%.
mSRFR: a machine learning model using microalgal signature features for ncRNA classification
This work presents mSRFR (microalgae SMOTE Random Forest Relief model), a classification tool for noncoding RNAs (ncRNAs) in microalgae, including green algae, diatoms, golden algae, and
Transcriptome Sequencing and Annotation for the Jamaican Fruit Bat (Artibeus jamaicensis)
TLDR
The Jamaican fruit bat transcriptome dataset is a resource that should provide additional candidate markers for studying bat evolution and ecology, and tools for analysis of the host response and pathology of disease.

References

SHOWING 1-10 OF 38 REFERENCES
Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine
TLDR
The local structure-sequence features reflect discriminative and conserved characteristics of miRNAs, and the successful ab initio classification of real and pseudo pre-miRNAs opens a new approach for discovering new mi RNAs.
RNA stem-loops: to be or not to be cleaved by RNAse III.
TLDR
Comparing known pre-miRNAs to other predicted noncoding transcripts obtained from comparative genomics scans, using the structure comparison program RNAforester, suggests that many nonc coding RNAs may be under selection to dodge the RNAi pathway.
Identification and Classification of Conserved RNA Secondary Structures in the Human Genome
TLDR
A general comparative genomics method based on phylogenetic stochastic context-free grammars for identifying functionalRNAs encoded in the human genome is developed and used to survey an eight-way genome-wide alignment of the human, chimpanzee, mouse, rat, dog, chicken, zebra-fish, and puffer-fish genomes for deeply conserved functional RNAs.
MiRFinder: an improved approach and software implementation for genome-wide fast microRNA precursor scans
TLDR
A computational learning method SVM (support vector machine) was implemented to build a high throughput and good performance computational pre-miRNA prediction tool called MiRFinder, designed for genome-wise, pair-wise sequences from two related species.
Unique folding of precursor microRNAs: quantitative evidence and implications for de novo identification.
TLDR
This large-scale characterization analysis reveals that pre-miRs are significantly different from other types of ncRNAs, pseudohairpins, m RNAs, and genomic background according to the nonparametric Kruskal-Wallis ANOVA (p<0.001).
Fast and reliable prediction of noncoding RNAs
TLDR
All of the known noncoding RNAs and cis-acting elements with high significance are recovered and compelling evidence for many other conserved RNA secondary structures not described so far to the authors' knowledge is found.
Structured RNAs in the ENCODE selected regions of the human genome.
TLDR
A computational study to detect functional RNA structures within the ENCODE regions of the human genome using three recently introduced programs based on either phylogenetic-stochastic context-free grammar (EvoFold) or energy directed folding (RNAz and AlifoldZ), yielding several thousand candidate structures.
Prediction of viral microRNA precursors based on human microRNA precursor sequence and structural features
TLDR
An efficient prediction method is developed based on the hypothesis that sequence and structure features which discriminate between host microRNA precursor hairpins and pseudo microRNAs are shared by viral microRNA as they depend on host machinery for the processing of microRNA precursors.
Evidence that microRNA precursors, unlike other non-coding RNAs, have lower folding free energies than random sequences
TLDR
The majority of the microRNA sequences clearly exhibit a folding free energy that is considerably lower than that for shuffled sequences, indicating a high tendency in the sequence towards a stable secondary structure.
Self Containment, a Property of Modular RNA Structures, Distinguishes microRNAs
TLDR
It is found that miRNA stem loops exhibit high self containment, consistent with the requirement for structural invariance imposed by the miRNA biogenesis pathway, while most other structured RNAs do not and a trend toward natural selection for self containment in most functional RNA classes is found.
...
...