Development of Linear, Ensemble, and Nonlinear Models for the Prediction and Interpretation of the Biological Activity of a Set of PDGFR Inhibitors

Abstract

A QSAR modeling study has been done with a set of 79 piperazyinylquinazoline analogues which exhibit PDGFR inhibition. Linear regression and nonlinear computational neural network models were developed. The regression model was developed with a focus on interpretative ability using a PLS technique. However, it also exhibits a good predictive ability after outlier removal. The nonlinear CNN model had superior predictive ability compared to the linear model with a training set error of 0.22 log(IC50) units (R2 = 0.93) and a prediction set error of 0.32 log(IC50) units (R2 = 0.61). A random forest model was also developed to provide an alternate measure of descriptor importance. This approach ranks descriptors, and its results confirm the importance of specific descriptors as characterized by the PLS technique. In addition the neural network model contains the two most important descriptors indicated by the random forest model.

DOI: 10.1021/ci049849f

21 Figures and Tables

Statistics

01020'05'06'07'08'09'10'11'12'13'14'15'16'17
Citations per Year

84 Citations

Semantic Scholar estimates that this publication has 84 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Guha2004DevelopmentOL, title={Development of Linear, Ensemble, and Nonlinear Models for the Prediction and Interpretation of the Biological Activity of a Set of PDGFR Inhibitors}, author={Rajarshi Guha and Peter C. Jurs}, journal={Journal of chemical information and computer sciences}, year={2004}, volume={44 6}, pages={2179-89} }