首页> 外文期刊>Analytical methods >Quantitative structure–activity relationship (QSAR) study of carcinogenicity of polycyclic aromatic hydrocarbons (PAHs) in atmospheric particulate matter by random forest (RF)
【24h】

Quantitative structure–activity relationship (QSAR) study of carcinogenicity of polycyclic aromatic hydrocarbons (PAHs) in atmospheric particulate matter by random forest (RF)

机译:随机森林(RF)对大气颗粒物中多环芳烃(PAHs)致癌性的定量构效关系(QSAR)研究

获取原文
           

摘要

The carcinogenicity or toxicity information of a substance can be quickly and easily obtained by using a quantitative structure–activity relationship (QSAR) model. In this study, the carcinogenicity of PAHs was analyzed and predicted by using a random forest (RF) model with the molecular structure information and carcinogenicity data of PAHs. The molecular structure information of 91 PAHs was represented by molecular descriptors (such as structure descriptors, topology descriptors, molecular connectivity index and geometric descriptors) which were calculated by using Dragon5.4 software. The model parameters (ntree and mtry) and input variables were optimized and evaluated with respect to the accuracy, positive predictive value (PPV), negative predictive value (NPV) and out-of-bag (OOB) error. Then, based on the optimized model parameters and input variables, the RF, partial least squares-discriminant analysis (PLS-DA) and artificial neural network (ANN) models were constructed to predict the carcinogenicity of PAHs. The results show that the classification accuracy, PPV, NPV and modeling time are 0.9333, 0.8889, 1.0000 and 10.40 s for the RF model, respectively, which shows a better predictive ability than the PLS-DA and ANN models for the prediction of the carcinogenicity of PAHs. Therefore, it is demonstrated that RF are a very promising method for the accurate prediction of the carcinogenicity of PAHs.
机译:通过使用定量结构-活性关系(QSAR)模型,可以快速轻松地获得物质的致癌性或毒性信息。在这项研究中,通过使用具有PAHs分子结构信息和致癌性数据的随机森林(RF)模型来分析和预测PAHs的致癌性。 91种多环芳烃的分子结构信息由分子描述符(如结构描述符,拓扑描述符,分子连接指数和几何描述符)表示,这些描述符是使用Dragon5.4软件计算的。优化并评估了模型参数(ntree和mtry)和输入变量的准确性,正预测值(PPV),负预测值(NPV)和袋外误差(OOB)。然后,基于优化的模型参数和输入变量,构建了RF,偏最小二乘判别分析(PLS-DA)和人工神经网络(ANN)模型来预测PAHs的致癌性。结果表明,RF模型的分类准确度,PPV,NPV和建模时间分别为0.9333、0.8889、1.0000和10.40 s,显示出比PLS-DA和ANN模型更好的预测致癌性的能力。多环芳烃。因此,证明RF是准确预测PAHs致癌性的非常有前途的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号