...
首页> 外文期刊>Journal of enzyme inhibition and medicinal chemistry. >Development of linear and nonlinear predictive QSAR models and their external validation using molecular similarity principle for anti-HIV indolyl aryl sulfones.
【24h】

Development of linear and nonlinear predictive QSAR models and their external validation using molecular similarity principle for anti-HIV indolyl aryl sulfones.

机译:线性和非线性预测QSAR模型的开发以及使用抗HIV吲哚基芳基砜的分子相似性原理进行外部验证。

获取原文
获取原文并翻译 | 示例
           

摘要

Quantitative structure-activity relationship (QSAR) studies have been carried out on indolyl aryl sulfones, a class of novel HIV-1 non-nucleoside reverse transcriptase inhibitors, using physicochemical, topological and structural parameters along with appropriate indicator variables. The statistical tools used were linear methods (e.g., stepwise regression analysis, partial least squares (PLS), factor analysis followed by multiple regression (FA-MLR), genetic function approximation combined with multiple linear regression (GFA-MLR) and GFA followed by PLS or G/PLS and nonlinear method (artificial neural network or ANN). In case of physicochemical parameters, GFA-MLR generated the best Equation (n = 97, R(2) = 0.862, Q(2) = 0.821). Using topological parameters, the best Equation (based on leave-one-out Q(2)) was obtained with stepwise regression technique (n = 97, R(2) = 0.867, Q(2) = 0.811). When topological and physicochemical parameters were used in combination, statistical quality increased to a great extent (n = 97, R(2) = 0.891, Q(2) = 0.849 from stepwise regression). Furthermore, the whole dataset had been divided into test (25% of whole dataset) and training (remaining 75%) sets. Models were developed based on the training set and predictive potential of such models was checked from the test set. The selection of the training set was based on K-means clustering of the standardized descriptors (topological and physicochemical). In this case also the best results were obtained with stepwise regression (n = 72, R(2) = 0.906, Q(2) = 0.853) but external predictive capacity of this model ([image omitted] ) was inferior to the model developed from GFA-MLR technique (R(2) = 0.883, Q(2) = 0.823, [image omitted] ). However, the squared regression coefficient between observed activity and predicted activity values of the test set compounds for the best linear model, i.e., GFA-MLR (r(2) = 0.736) was lower in comparison to the best nonlinear model developed using artificial neural network (r(2) = 0.781). Thus, based on external validation, the ANN models were superior to the linear models. The predictive potential of the best linear Equation (stepwise regression model) was superior to that of the previously published CoMFA (Q(2) = 0.81, SDEP(Test) = 0.89) on the same data set (Ragno R. et al., J Med Chem 2006, 49, 3172-3184). Furthermore, the physicochemical parameter based models also supported the previous observations based on docking (Ragno R. et al., J Med Chem 2005, 48, 213-223).
机译:使用物理化学,拓扑学和结构参数以及合适的指示变量,对吲哚基芳基砜(一类新型HIV-1非核苷逆转录酶抑制剂)进行了定量构效关系(QSAR)研究。所使用的统计工具为线性方法(例如逐步回归分析,偏最小二乘(PLS),因子分析,然后进行多元回归(FA-MLR),遗传函数近似与多元线性回归(GFA-MLR)和GFA组合,然后进行PLS或G / PLS和非线性方法(人工神经网络或ANN)在理化参数的情况下,GFA-MLR生成了最佳方程(n = 97,R(2)= 0.862,Q(2)= 0.821)。拓扑参数,通过逐步回归技术(n = 97,R(2)= 0.867,Q(2)= 0.811)获得最佳方程式(基于留一法式Q(2))。结合使用,统计质量得到了很大程度的提高(逐步回归的n = 97,R(2)= 0.891,Q(2)= 0.849),而且,整个数据集被划分为测试(占整体的25%数据集)和训练(剩余75%)集。根据训练集和suc的预测潜力开发了模型从测试集中检查了h个模型。训练集的选择基于标准化描述符(拓扑和物理化学)的K-均值聚类。在这种情况下,逐步回归也能获得最佳结果(n = 72,R(2)= 0.906,Q(2)= 0.853),但是该模型的外部预测能力([图像省略])不如已开发的模型来自GFA-MLR技术(R(2)= 0.883,Q(2)= 0.823,[省略的图像])。但是,与使用人工神经网络开发的最佳非线性模型相比,最佳线性模型(即GFA-MLR(r(2)= 0.736))在测试组化合物的观测活性和预测活性值之间的平方回归系数要低。网络(r(2)= 0.781)。因此,基于外部验证,ANN模型优于线性模型。在同一数据集上,最佳线性方程(逐步回归模型)的预测潜力优于先前发布的CoMFA(Q(2)= 0.81,SDEP(Test)= 0.89)(Ragno R.等, J Med Chem 2006,49,3172-3184)。此外,基于理化参数的模型也支持基于对接的先前观察(Ragno R.等人,J Med Chem 2005,48,213-223)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号