Eugene V. Korotkov

Learn More
An earlier reported method for revealing latent periodicity of the nucleotide sequences has been considerably modified in a case of small samples, by applying a Monte Carlo method. This improved method has been used to search for the latent periodicity of some nucleotide sequences of the EMBL data bank. The existence of the nucleotide sequences' latent(More)
A method of informational decomposition has been developed, allowing one to reveal hidden periodicity in any symbol sequence. The informational decomposition is calculated without conversion of a symbol sequence into a numerical one, which facilitates finding periodicities in a symbol sequence. The method permits introducing an analog of the autocorrelation(More)
The definition of a phase shift of triplet periodicity (TP) is introduced. The mathematical algorithm for detection of TP phase shift of nucleotide sequences has been developed. Gene sequences from Kegg-46 data bank were analyzed with a purpose of searching genes with a phase shift of TP. The presence of a phase shift of triplet periodicity has been shown(More)
The existence of a typical latent periodicity of 21 bases from the Tar chemoreceptor gene of Escherichia coli (E. coli) (MCP II) in the bacterial genes has been investigated in this work. Among 583 annotated bacterial genes and ORFs in the GenBank, in which the typical periodicity has been found, the chemoreceptors' genes constituted the most numerous group(More)
We introduce a new concept of triplet periodicity class (TPC) and a measure of similarity between such classes. We performed classification of 472288 triplet periodicity (TP) regions found in 578868 genes from 29th release of KEGG databank. Totally 2520 classes were obtained. They contain 94% of 472288 found cases of TP. For 92% of TP regions contained in(More)
Development of mathematical methods for study of symbolical sequence periodicity gets special significance nowadays. First of all it is concerned with the successful determination of DNA sequences from various genomes and accumulation of a great number of amino acid sequences. Therefore there is a problem for mathematics and biologists to be solved to(More)
A method of latent periodicity search being developed. Mutual information is used to reveal of DNA or mRNA sequence latent periodicity. The latent periodicity of DNA sequence is a periodicity with low level of homology between any two periods inside DNA sequence. The mutual information between artificial numerical sequence and DNA sequence is calculated.(More)
In our previous studies, we found statistically significant latent periodicity in a huge amount of protein sequences and a proposition was made that many of periodicity patterns correspond to structural or functional features of protein families. However, it was unable to test since latent periodicity is feebly marked and often interrupted by insertions and(More)
A method of latent periodicity search is developed. We use mutual information to reveal the latent periodicity of mRNA sequences. The latent periodicity of an mRNA sequence is a periodicity with a low level of similarity between any two periods inside the mRNA sequence. The mutual information between an artificial numerical sequence and an mRNA sequence is(More)