Data Mining Algorithms for Virtual Screening of Bioactive Compounds
@inproceedings{Deshpande2007DataMA, title={Data Mining Algorithms for Virtual Screening of Bioactive Compounds}, author={M. Deshpande and M. Kuramochi and G. Karypis}, year={2007} }
In this chapter we study the problem of classifying chemical compound datasets. We present a sub-structure-based classification algorithm that decouples the sub-structure discovery process from the classification model construction and uses frequent subgraph discovery algorithms to find all topological and geometric sub-structures present in the dataset. The advantage of this approach is that during classification model construction, all relevant sub-structures are available allowing the… CONTINUE READING
Topics from this paper
2 Citations
Cheminformatics Explorations of Natural Products.
- Medicine
- Progress in the chemistry of organic natural products
- 2019
- 2
Perspectives on Knowledge Discovery Algorithms Recently Introduced in Chemoinformatics: Rough Set Theory, Association Rule Mining, Emerging Patterns, and Formal Concept Analysis
- Computer Science, Medicine
- J. Chem. Inf. Model.
- 2015
- 21
References
SHOWING 1-10 OF 74 REFERENCES
Comparisons of classification methods for screening potential compounds
- Computer Science
- Proceedings 2001 IEEE International Conference on Data Mining
- 2001
- 13
Data analysis of high-throughput screening results: application of multidomain clustering to the NCI anti-HIV data set.
- Biology, Medicine
- Journal of medicinal chemistry
- 2002
- 31
Mining molecular fragments: finding relevant substructures of molecules
- Computer Science
- 2002 IEEE International Conference on Data Mining, 2002. Proceedings.
- 2002
- 485
- PDF
Recursive Partitioning Analysis of a Large Structure-Activity Data Set Using Three-Dimensional Descriptors1
- Computer Science
- J. Chem. Inf. Comput. Sci.
- 1998
- 83
Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning
- Computer Science, Medicine
- J. Chem. Inf. Comput. Sci.
- 1999
- 129
Analysis of Large Screening Data Sets via Adaptively Grown Phylogenetic-Like Trees
- Computer Science, Medicine
- J. Chem. Inf. Comput. Sci.
- 2002
- 42
Warmr: a data mining tool for chemical data
- Computer Science, Medicine
- J. Comput. Aided Mol. Des.
- 2001
- 80