Significant Pattern Mining on Continuous Variables
@article{Sugiyama2017SignificantPM, title={Significant Pattern Mining on Continuous Variables}, author={M. Sugiyama and K. Borgwardt}, journal={ArXiv}, year={2017}, volume={abs/1702.08694} }
Significant pattern mining, the search for sets of binary features that are statistically significantly enriched in a class of objects, is of fundamental importance in a wide range of applications from economics to statistical genetics. Still, all existing approaches make the restrictive assumption that the features are binary and require a binarization of continuous data during preprocessing, which often leads to a loss of information. Here, we solve the open problem of significant pattern… CONTINUE READING
Figures, Tables, and Topics from this paper
2 Citations
A Modified C4.5 Classification Algorithm: With the Discretization Method in Calculating the Goodness Score Equivalent
- Mathematics
- 2019 IEEE 6th International Conference on Engineering Technologies and Applied Sciences (ICETAS)
- 2019
References
SHOWING 1-10 OF 45 REFERENCES
Significant Pattern Mining with Confounding Variables
- Psychology, Computer Science
- PAKDD
- 2016
- 10
- Highly Influential
Itemsets for Real-Valued Datasets
- Computer Science, Mathematics
- 2013 IEEE 13th International Conference on Data Mining
- 2013
- 7
- PDF
Finding significant combinations of features in the presence of categorical covariates
- Computer Science, Mathematics
- NIPS
- 2016
- 23
- PDF
Mining gene expression data with pattern structures in formal concept analysis
- Computer Science
- Inf. Sci.
- 2011
- 196
- Highly Influential
- PDF
Significant Subgraph Mining with Multiple Testing Correction
- Mathematics, Computer Science
- SDM
- 2015
- 25
- PDF
Fast and Memory-Efficient Significant Pattern Mining via Permutation Testing
- Mathematics, Computer Science
- KDD
- 2015
- 38
- PDF
Revisiting Numerical Pattern Mining with Formal Concept Analysis
- Mathematics, Computer Science
- IJCAI
- 2011
- 87
- Highly Influential
- PDF
Kingfisher: an efficient algorithm for searching for both positive and negative dependency rules with statistical significance measures
- Mathematics, Computer Science
- Knowledge and Information Systems
- 2011
- 35
- Highly Influential
- PDF