DNA Symphony: A new method to represent genomic sequences

Abstract

Complete genomic sequences from biological species can be graphically represented by a “genomic signature”. This representation provides information about the oligonucleotide frequencies considering different size of k-mers. Moreover, genomic sequences can also be represented by an audio signal, obtained by translating each oligonucleotide or protein into a certain range of audio frequencies. Although audio representation strategies provide an interesting result, they only use part of the genomic sequence. To date no method exists which contemplates the complete genome sequence. This work proposes a new method for audio representation of genomes by composing a polyphonic signal using a set of complete genomic sequences. This method is described here by first extracting the genomic signature for each sequence. Then, to obtain the audio signal, two-dimensional genomic signatures are transformed into a one-dimensional sequence by normalizing each value into an audible spectrum. Finally each signal, depending on the number of sequences, is played on a different channel to generate a polyphonic track. The experimental results and the audio analysis suggest that the described method, preserves the main patterns and genome structure from the original sequence.

3 Figures and Tables

Cite this paper

@inproceedings{Medina2013DNASA, title={DNA Symphony: A new method to represent genomic sequences}, author={Rosario A. Medina and Harieth M. Bernedo Cordova and Jes{\'u}s P. Mena-Chalco}, year={2013} }