Comparing Reverse Complementary Genomic Words Based on Their Distance Distributions and Frequencies

  title={Comparing Reverse Complementary Genomic Words Based on Their Distance Distributions and Frequencies},
  author={Ana Helena Tavares and Jakob Raymaekers and Peter J. Rousseeuw and Raquel M. Silva and Carlos A. C. Bastos and Armando Pinho and Paula Brito and Vera Afreixo},
  journal={Interdisciplinary Sciences: Computational Life Sciences},
In this work, we study reverse complementary genomic word pairs in the human DNA, by comparing both the distance distribution and the frequency of a word to those of its reverse complement. Several measures of dissimilarity between distance distributions are considered, and it is found that the peak dissimilarity works best in this setting. We report the existence of reverse complementary word pairs with very dissimilar distance distributions, as well as word pairs with very similar distance… 
Local Symmetry of Non-Coding Genetic Sequences
A formal definition of DNA strand local symmetry is presented, characterized in terms of generalized logits and tested for the longest non-coding sequences of bacterial genomes, suggesting that even neutral mutations are skewed with respect to leading and lagging strands.
Extrapolation of significant genes and transcriptional regulatory networks involved in Zea mays in response in UV-B stress
Results infer that the major changes occurred in the cell cycle, transcriptional regulation, post-transcriptional modification, phytohormones, flavonoids biosynthesis, and chromatin remodeling of the maize.
de novo transcriptomic profiling of differentially expressed genes in grass halophyte Urochondra setulosa under high salinity
This study offers the first comprehensive evaluation of Urochondra setulosa leaf transcriptome, providing useful information to understand the gene regulation at extremely saline levels.


The Symmetry of Oligonucleotide Distance Distributions in the Human Genome
It is concluded that in the human genome, symmetric oligonucleotides have equivalent occurrence frequency and, additionally, they present similar distance distributions, which suggests a stronger parity rule than Chargaff’s.
Dissimilar Symmetric Word Pairs in the Human Genome
In this work we explore the dissimilarity between symmetric word pairs, by comparing the inter-word distance distribution of a word to that of its reversed complement. We propose a new measure of
Analysis of single-strand exceptional word symmetry in the human genome: new measures.
The global symmetry present in the human genome is clearly exceptional and significant and the chromosomes present distinct exceptional symmetry profiles.
Genome analysis with inter-nucleotide distances
The main contribution is a method to obtain genomic signatures for complete genomes, based on the inter-nucleotide distances, that are able to discriminate between different species and using these signatures and hierarchical clustering, it is possible to build phylogenetic trees.
Why are complementary DNA strands symmetric?
It is shown that ubiquitous reverse-complement symmetry does not result from a single cause, such as point mutation or recombination, but rather emerges from the combined effects of a wide spectrum of mechanisms operating at multiple orders and length scales.
Asymptotically increasing compliance of genomes with Chargaff's second parity rules through inversions and inverted transpositions
A quantitative formulation of the hypothesis that inversions and inverted transposition could be a major contributing if not dominant factor in the almost universal validity of the CIImono and CIIoligo rules is proposed.
Tandem repeats finder: a program to analyze DNA sequences.
A new algorithm for finding tandem repeats which works without the need to specify either the pattern or pattern size is presented and its ability to detect tandem repeats that have undergone extensive mutational change is demonstrated.
Strand Symmetry: Characteristics and Origins
It is concluded that the phenomenon of strand symmetry would be the original feature and relic of the primordial genome and a new hypothesis is proposed about its origin as well as about the constitution of early genomes.