Comparative chemometric modeling of cytochrome 3A4 inhibitory activity of structurally diverse compounds using stepwise MLR, FA-MLR, PLS, GFA, G/PLS and ANN techniques.
Article Details
- CitationCopy to clipboard
Roy K, Pratim Roy P
Comparative chemometric modeling of cytochrome 3A4 inhibitory activity of structurally diverse compounds using stepwise MLR, FA-MLR, PLS, GFA, G/PLS and ANN techniques.
Eur J Med Chem. 2009 Jul;44(7):2913-22. doi: 10.1016/j.ejmech.2008.12.004. Epub 2008 Dec 16.
- PubMed ID
- 19128860 [ View in PubMed]
- Abstract
Twenty-eight structurally diverse cytochrome 3A4 (CYP3A4) inhibitors have been subjected to quantitative structure-activity relationship (QSAR) studies. The analyses were performed with electronic, spatial, topological, and thermodynamic descriptors calculated using Cerius 2 version 10 software. The statistical tools used were linear [multiple linear regression with factor analysis as preprocessing step (FA-MLR), stepwise MLR, partial least squares (PLS), genetic function algorithm (GFA), genetic PLS (G/PLS)] and non-linear methods [artificial neural network (ANN)]. All the five linear modeling methods indicate the importance of n-octanol/water partition coefficient (logP) along with different topological and electronic parameters. The best model obtained from the training set (stepwise regression) based on highest external predictive R(2) value and lowest RMSEP value also showed good internal predictive power. Other models like FA-MLR, PLS, GFA and G/PLS are also of statistically significant internal and external validation characteristics. The best model [according to r(m)(2) for the test set, as defined by P.P. Roy, K. Roy, QSAR Comb. Sci. 27 (2008) 302-313] obtained from ANN showed a good r(2) value (determination coefficient between observed and predicted values) for the test set compounds, which was superior to those of other statistical models except the stepwise regression derived model. However, based upon the r(m)(2) value (test set), which penalizes a model for large differences between observed and predicted values, the stepwise MLR model was found to be inferior to other methods except PLS. Considering r(m)(2) value for the whole set, the G/PLS derived model appears to be the best predictive model for this data set. For choosing the best predictive model from among comparable models, r(m)(2) for the whole set calculated based on leave-one-out predicted values of the training set and model-derived predicted values for the test set compounds is suggested to be a good criterion.
DrugBank Data that Cites this Article
- Binding Properties
Drug Target Property Measurement pH Temperature (°C) Bromocriptine Cytochrome P450 3A4 IC 50 (nM) 3.00E+03 N/A N/A Details Chloroquine Cytochrome P450 3A4 IC 50 (nM) 3.50E+05 N/A N/A Details Cimetidine Cytochrome P450 3A4 IC 50 (nM) 1.00E+06 N/A N/A Details Clotrimazole Cytochrome P450 3A4 IC 50 (nM) 5.00E+01 N/A N/A Details Dihydroergotamine Cytochrome P450 3A4 IC 50 (nM) 3.00E+03 N/A N/A Details Diltiazem Cytochrome P450 3A4 IC 50 (nM) 2.17E+05 N/A N/A Details Econazole Cytochrome P450 3A4 IC 50 (nM) 4.31E+02 N/A N/A Details Fluconazole Cytochrome P450 3A4 IC 50 (nM) 2.50E+04 N/A N/A Details Metyrapone Cytochrome P450 3A4 IC 50 (nM) 4.93E+03 N/A N/A Details Miconazole Cytochrome P450 3A4 IC 50 (nM) 8.51E+02 N/A N/A Details Nifedipine Cytochrome P450 3A4 IC 50 (nM) 4.70E+04 N/A N/A Details Omeprazole Cytochrome P450 3A4 IC 50 (nM) 7.80E+04 N/A N/A Details