A search for patterns in the nucleotide sequence of the MS2 genome

  title={A search for patterns in the nucleotide sequence of the MS2 genome},
  author={John W. Erickson and Gary George Altman},
  journal={Journal of Mathematical Biology},
SummaryThe nucleotide sequence of the RNA of the bacteriophage MS2 was examined by computer for internal patterns. We used a technique which analyzes a nucleotide sequence as a Markov chain. This led us to discover patterns within the translated and untranslated regions of the RNA in addition to those patterns formed by the codons. One of the more surprising results of this analysis was the discovery that the non-coding sequences in the genome are as highly ordered, although in a different… 
Statistical Predictions of Coding Regions in Prokaryotic Genomes by Using Inhomogeneous Markov Models
The chapter talks about higher-order models and models of typical and atypical genes, a category of special interest for evolutionary studies, as well as for studies of pathogenic bacteria whose pathogenicity islands or antibiotic-resistance genes could be relatively recent additions to the whole genome.
On the informational content of viral DNA.
Methyltransferases as tools to alter the specificity of restriction endonucleases.
This chapter describes enzymatic strategies to generate large DNA fragments and statistical tools that can aid researchers in choosing the restriction enzymes that are most likely to generated large fragments in the genome in question, if a sequence data base can be investigated.
Between a chicken and a grape: estimating the number of human genes
The history of efforts to establish the human gene count is reviewed, the evidence behind the current best estimate of 22,333 genes is explained, and comparisons with other species show that nothing about the humanGene count is exceptional, and it is not particularly different from other mammalian species.
Markov chain analysis finds a significant influence of neighboring bases on the occurrence of a base in eucaryotic nuclear DNA sequences both protein-coding and noncoding
A considerable sample of eucaryotic nuclear DNA sequences have significant local structure over subsequences of three to five contiguous bases, and that this structure occurs throughout the total length of the sequence.
Context-Dependent Evolutionary Models for Non-Coding Sequences: An Overview of Several Decades of Research and an Analysis of Laurasiatheria and Primate Evolution
  • G. Baele
  • Biology, Computer Science
    Evolutionary Biology
  • 2011
This paper discusses various approaches presented in recent years to model context-dependent evolution, and presents new results on two mammalian datasets to shed a light on so-called lineage-dependent context- dependent evolution.
Between a chicken and a grape: estimating the number of human genes
Many people expected the question 'How many genes in the human genome?' to be resolved with the publication of the genome sequence in 2001, but estimates continue to fluctuate.
Modelling the ancestral sequence distribution and model frequencies in context-dependent models for primate non-coding sequences
It is shown that the combination of a dependency scheme at the ancestral root sequence and a context-dependent evolutionary model across the remainder of the tree allows for accurate estimation of the model's parameters.


Complete nucleotide sequence of bacteriophage MS2 RNA: primary and secondary structure of the replicase gene
The complete, primary chemical structure of a viral genome has now been established and biological properties, such as ribosome binding and codon interactions can now be discussed on a molecular basis.
Nucleotide Sequence of the Gene Coding for the Bacteriophage MS2 Coat Protein
By characterization of fragments, isolated from a nuclease digest of MS2 RNA, the entire nucleotide sequence of the coat gene was established. A “flower”-like model is proposed for the secondary
Nucleotide sequence of bacteriophage φX174 DNA
The sequence identifies many of the features responsible for the production of the proteins of the nine known genes of the organism, including initiation and termination sites for the proteins and RNAs.
A test for nucleotide sequence homology.
3'-Terminal nucletide sequence (n equals 361) of bacteriophage MS2 RNA.
32P-Labeled MS2 RNA was partially digested with ribonuclease T1 and a unique reading frame could be deduced, which indicated that the replicase gene ends with a U-A-G termination signal and is followed by a 174-nucleotide-long untranslated segment.
Tests for Contingency Tables and Marltov Chains
A number of useful tests for contingency tables and finite stationary Markov chains are presented in this paper based on the use of the notions of information theory. A consistent and simple approach
Studies on the bacteriophage MS2 6. The nucleoside 5'-triphosphate end groups of the replicative intermediate and the replicative form.
The viral proteins synthesized in non-suppressor cells by amber mutants in the A protein cistron of the RNA bacteriophage MS2 were analyzed and it was shown that the absence of fragment is not due to selective proteolytic breakdown.
Prediction of alpha-helical regions in proteins of known sequence.