A Maximum Common Subgraph Kernel Method for Predicting the Chromosome Aberration Test

Abstract

The chromosome aberration test is frequently used for the assessment of the potential of chemicals and drugs to elicit genetic damage in mammalian cells in vitro. Due to the limitations of experimental genotoxicity testing in early drug discovery phases, a model to predict the chromosome aberration test yielding high accuracy and providing guidance for structure optimization is urgently needed. In this paper, we describe a machine learning approach for predicting the outcome of this assay based on the structure of the investigated compound. The novelty of the proposed method consists in combining a maximum common subgraph kernel for measuring the similarity of two chemical graphs with the potential support vector machine for classification. In contrast to standard support vector machine classifiers, the proposed approach does not provide a black box model but rather allows to visualize structural elements with high positive or negative contribution to the class decision. In order to compare the performance of different methods for predicting the outcome of the chromosome aberration test, we compiled a large data set exhibiting high quality, reliability, and consistency from public sources and configured a fixed cross-validation protocol, which we make publicly available. In a comparison to standard methods currently used in pharmaceutical industry as well as to other graph kernel approaches, the proposed method achieved significantly better performance.

DOI: 10.1021/ci900367j

Cite this paper

@article{Mohr2010AMC, title={A Maximum Common Subgraph Kernel Method for Predicting the Chromosome Aberration Test}, author={Johannes Mohr and Brijnesh J. Jain and Andreas Sutter and Antonius ter Laak and Thomas Steger-Hartmann and Nikolaus Heinrich and Klaus Obermayer}, journal={Journal of chemical information and modeling}, year={2010}, volume={50 10}, pages={1821-38} }