Toward unsupervised correlation preserving discretization

  title={Toward unsupervised correlation preserving discretization},
  author={Sameep Mehta and Srinivasan Parthasarathy and Hui Yang},
  journal={IEEE Transactions on Knowledge and Data Engineering},
Discretization is a crucial preprocessing technique used for a variety of data warehousing and mining tasks. In this paper, we present a novel PCA-based unsupervised algorithm for the discretization of continuous attributes in multivariate data sets. The algorithm leverages the underlying correlation structure in the data set to obtain the discrete intervals and ensures that the inherent correlations are preserved. Previous efforts on this problem are largely supervised and consider only… CONTINUE READING


Publications citing this paper.
Showing 1-10 of 24 extracted citations

An effective discretization method for disposing high-dimensional data

Inf. Sci. • 2014
View 5 Excerpts
Highly Influenced

An ICA-Based Multivariate Discretization Algorithm

KSEM • 2006
View 3 Excerpts
Highly Influenced

Data discretization: taxonomy and big data challenge

Wiley Interdiscip. Rev. Data Min. Knowl. Discov. • 2016
View 1 Excerpt

Clustering to determine predictive model for news reports analysis and econometric modeling

2015 IEEE 2nd International Conference on Recent Trends in Information Systems (ReTIS) • 2015
View 1 Excerpt


Publications referenced by this paper.

Similar Papers

Loading similar papers…