Analysis of the Statistical Characteristics in Mining of Frequent Sequences

@inproceedings{Tumasonis2005AnalysisOT,
  title={Analysis of the Statistical Characteristics in Mining of Frequent Sequences},
  author={Romanas Tumasonis and Gintautas Dzemyda},
  booktitle={Intelligent Information Systems},
  year={2005}
}
The paper deals with the search and analysis of the subsequences in large volume sequences (texts, DNA sequences, etc.). A new algorithm ProMFS for mining frequent sequences is proposed and investigated. It is based on the estimated probabilistic-statistical characteristics of the appearance of elements of the sequence and their order. The algorithm builds a new much shorter sequence and makes decisions on the main sequence in accordance with the results of analysis of the shorter one. 

Topics from this paper.