A Novel Variable-order Markov Model for Clustering Categorical Sequences

@article{Xiong2014ANV,
  title={A Novel Variable-order Markov Model for Clustering Categorical Sequences},
  author={Tengke Xiong and Shengrui Wang and Qingshan Jiang and Joshua Zhexue Huang},
  journal={IEEE Transactions on Knowledge and Data Engineering},
  year={2014},
  volume={26},
  pages={2339-2353}
}
Clustering categorical sequences is an important and difficult data mining task. Despite recent efforts, the challenge remains, due to the lack of an inherently meaningful measure of pairwise similarity. In this paper, we propose a novel variable-order Markov framework, named weighted conditional probability distribution (WCPD), to model clusters of categorical sequences. We propose an efficient and effective approach to solve the challenging problem of model initialization. To initialize the… CONTINUE READING

Citations

Publications citing this paper.

References

Publications referenced by this paper.
Showing 1-10 of 36 references

Similar Papers

Loading similar papers…