Global importance of RNA secondary structures in protein-coding sequences

  title={Global importance of RNA secondary structures in protein-coding sequences},
  author={Markus Fricke and Ruman Gerst and Bashar Ibrahim and Michael Niepmann and Manja Marz},
  pages={579 - 583}
Motivation: The protein‐coding sequences of messenger RNAs are the linear template for translation of the gene sequence into protein. Nevertheless, the RNA can also form secondary structures by intramolecular base‐pairing. Results: We show that the nucleotide distribution within codons is biased in all taxa of life on a global scale. Thereby, RNA secondary structures that require base‐pairing between the position 1 of a codon with the position 1 of an opposing codon (here named RNA secondary… 

Figures and Tables from this paper

Conserved Secondary Structures in Viral mRNAs

This is the first compilation of potentially functional conserved RNA structures in viral coding regions, covering the complete RefSeq viral database, and was able to recover structural elements from previous studies and discovered a variety of novel structured regions.

Widespread selection for high and low secondary structure in coding sequences across all domains of life

It is demonstrated that selection for high and low secondary structure is a widespread phenomenon, and another line of evidence that synonymous mutations are less neutral than commonly thought, which is of importance for many evolutionary models.

Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life

It is shown that codon composition and amino acid identity are main determinants of RNA secondary structure, and that the arrangement of synonymous codons within coding sequences is non-random, demonstrating that selection for high and low levels of secondary structure is a widespread phenomenon.

Widespread non-modular overlapping codes in the coding regions

Current knowledge related to overlapping codes inside the coding regions, such as the influence of synonymous codon usage on translation speed (and, in turn, the effect of translation speed on protein folding), ribosomal frameshifting, mRNA stability, methylation, splicing, transcription and more are reviewed.

Ribosome Pausing at Inefficient Codons at the End of the Replicase Coding Region Is Important for Hepatitis C Virus Genome Replication

Using ribosome profiling of cells replicating full-length infectious HCV genomes, it is uncovered that ribosomes accumulate at the HCV stop codon and about 30 nucleotides upstream of it, which may allow the enzymatically active replicase core to find its genuine RNA template in cis, while the protein is still held in place by being stuck with its C-terminus in the exit tunnel of the paused Ribosome.

Hepatitis C Virus Translation Regulation

The liver-specific microRNA-122 (miR-122) stimulates HCV IRES-dependent translation, most likely by stabilizing a certain structure of the IRES that is required for initiation.

Cellular Gene Expression during Hepatitis C Virus Replication as Revealed by Ribosome Profiling

After establishing HCV replication, the lack of global changes in cellular gene expression indicates an adaptation to chronic infection, while the downregulation of mitochondrial respiratory chain genes indicates how a virus may further contribute to cancer cell-like metabolic reprogramming (“Warburg effect”) even in the hepatocellular carcinoma cells used here.

Hepatitis C Virus Downregulates Core Subunits of Oxidative Phosphorylation, Reminiscent of the Warburg Effect in Cancer Cells

HCV downregulates the expression of mitochondrial oxidative phosphorylation complex core subunits quite early after infection and serves to provide more anabolic metabolites upstream of the citric acid cycle, such as amino acids, pentoses and NADPH for cancer cell growth.

HCV Genetic Diversity Can Be Used to Infer Infection Recency and Time since Infection

Genetic diversity in HCV correlates with TSI and is a proxy for infection recency and TSI, even several years post-infection.

sRNARFTarget: a fast machine-learning-based approach for transcriptome-wide sRNA target prediction

Bacterial small regulatory RNAs (sRNAs) are key regulators of gene expression in many processes related to adaptive responses. A multitude of sRNAs have been identified in many bacterial species;



Concurrent Neutral Evolution of mRNA Secondary Structures and Encoded Proteins

A statistical analysis of retroviral mRNA supports the hypothesis that the natural genetic code is adapted to such complementary coding, and finds that preservation of RNA secondary structure by compensatory mutations is evolutionary compatible with the efficient search for new variants on the protein level.

A periodic pattern of mRNA secondary structure created by the genetic code

The first transcriptome-wide in silico analysis of the human and mouse mRNA foldings found a pronounced periodic pattern of nucleotide involvement in mRNA secondary structure, and it is demonstrated that the third degenerate codon sites contribute most strongly to mRNA stability.

Widespread selection for local RNA secondary structure in coding regions of bacterial genes.

Several significant associations suggest functional roles for RNA structures in RNA processing, regulation of mRNA stability, and translational control, including stronger secondary structure bias in the coding regions of intron-containing yeast genes than in intronless genes, and significantly higher folding potential in polycistronics messages than in monocistronic messages in Escherichia coli.

The impact of RNA structure on coding sequence evolution in both bacteria and eukaryotes

It is concluded that structurally sensitive sites in mRNA sequences normally have less nucleotide divergence in all species the authors analyzed, and is helpful to the development of a codon model with RNA structure information.

Secondary structure and coding potential of the coat protein gene of bacteriophage MS2.

  • L. A. Ball
  • Biology, Chemistry
    Nature: New biology
  • 1973
The strongest evidence for the presence of base-paired secondary structure comes from the nucleotide sequences which have been determined for parts of the RNAs of bacteriophages MS21 and R172 and for a 6S RNA of bacteriaiophage λ3.

Tracing Specific Synonymous Codon–Secondary Structure Correlations Through Evolution

The GAU-α-helix correlation is also strong in non-human mammalian and vertebrate proteins but is much weaker or insignificant in S. cerevisiae and plants, suggesting the existence of a novel evolutionary selection mechanism.

Deciphering the rules by which dynamics of mRNA secondary structure affect translation efficiency in Saccharomyces cerevisiae

The results showed that the combined effect of mRNA secondary structure and codon usage in highly translated mRNAs causes a short ribosomal distance in structural regions, which in turn eliminates the structures during translation, leading to a high elongation rate.

Dynamics of translation by single ribosomes through mRNA secondary structures

Single-molecule fluorescence resonance energy transfer is used to determine reaction rates for specific steps within the elongation cycle as the Escherichia coli ribosome encounters stem-loop or pseudoknot mRNA secondary structures, finding that unfolding of mRNAsecondary structures is more closely coupled to E-site tRNA dissociation than to tRNA translocation.

The large extent of putative secondary nucleic acid structure in random nucleotide sequences or amino acid derived messenger-RNA

  • W. Fitch
  • Biology
    Journal of Molecular Evolution
  • 2005
One cannot predict with any confidence the secondary structure of messenger RNA from amino acids sequences, even when the presence of extensive numbers of variants are known and the amino acid sequences of other orthologous proteins are also available.

Cis-acting RNA elements in human and animal plus-strand RNA viruses