Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets

Korotcov Alexandru; Tkachenko Valery; Russo Daniel P.; Ekins Sean

首页> 外文期刊>Molecular pharmaceutics >Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets

【24h】

Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets

机译：使用不同药物发现数据集的多机学习方法和度量的深度学习比较

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Machine learning methods have been applied to many data sets in pharmaceutical research for several decades. The relative ease and availability of fingerprint type molecular descriptors paired with Bayesian methods resulted in the widespread use of this approach for a diverse array of end points relevant to drug discovery. Deep learning is the latest machine learning algorithm attracting attention for many of pharmaceutical applications from docking to virtual screening. Deep learning is based on an artificial neural network with multiple hidden layers and has found considerable traction for many artificial intelligence applications. We have previously suggested the need for a comparison of different machine learning methods with deep learning across an array of varying data sets that is applicable to pharmaceutical research. End points relevant to pharmaceutical research include absorption, distribution, metabolism, excretion, and toxicity (ADME/Tox) properties, as well as activity against pathogens and drug discovery data sets. In this study, we have used data sets for solubility, probe-likeness, hERG, KCNQ1, bubonic plague, Chagas, tuberculosis, and malaria to compare different machine learning methods using FCFP6 fingerprints. These data sets represent whole cell screens, individual proteins, physicochemical properties as well as a data set with a complex end point. Our aim was to assess whether deep learning offered any improvement in testing when assessed using an array of metrics including AUC, F1 score, Cohens kappa, Matthews correlation coefficient and others. Based on ranked normalized scores for the metrics or data sets Deep Neural Networks (DNN) ranked higher than SVM, which in turn was ranked higher than all the other machine learning methods. Visualizing these properties for training and test sets using radar type plots indicates when models are inferior or perhaps over trained. These results also suggest the need for assessing deep learning further using multiple metrics with much larger scale comparisons, prospective testing as well as assessment of different fingerprints and DNN architectures beyond those used.

机译：机器学习方法已应用于几十年的药物研究中的许多数据集。与贝叶斯方法配对的指纹型分子描述符的相对缓解性和可用性导致这种方法广泛使用与药物发现相关的各种终点。深度学习是最新的机器学习算法吸引了许多药物应用中的注意力从对接到虚拟筛选。深度学习基于具有多个隐藏层的人工神经网络，并为许多人工智能应用发现了相当大的牵引力。我们之前建议需要比较不同的机器学习方法，并在适用于制药研究的不同数据集中进行深入学习的不同机器学习方法。与药物研究相关的终点包括吸收，分布，代谢，排泄和毒性（ADME / TOX）性质，以及对抗病原体和药物发现数据集的活性。在这项研究中，我们使用了数据集进行溶解度，探针相似，HERG，KCNQ1，Bubonic Plague，Chagas，结核病和疟疾，以比较使用FCFP6指纹的不同机器学习方法。这些数据集代表整个细胞屏幕，单个蛋白质，物理化学特性以及具有复杂终点的数据集。我们的目的是评估使用包括AUC，F1得分，Cohens Kappa，Matthews相关系数等的一系列指标评估时深入学习是否在评估时提供了任何改进。基于测量的指标或数据的规范化分数设置深神经网络（DNN）排名高于SVM，又排名高于所有其他机器学习方法。使用雷达类型图来可视化这些属性进行培训和测试集，指示何时何时劣等或可能在培训中。这些结果还建议需要使用多种度量进行评估深度学习，这些指标具有更大的比较，预期测试以及对所使用的不同指纹和DNN架构的评估。

著录项

来源
《Molecular pharmaceutics》 |2017年第12期|共14页
作者
Korotcov Alexandru; Tkachenko Valery; Russo Daniel P.; Ekins Sean;
展开▼
作者单位

Sci Data Software LLC 14914 Bradwill Court Rockville MD 20850 USA;

Sci Data Software LLC 14914 Bradwill Court Rockville MD 20850 USA;

Collaborat Pharmaceut Inc 840 Main Campus Dr Lab 3510 Raleigh NC 27606 USA;

Collaborat Pharmaceut Inc 840 Main Campus Dr Lab 3510 Raleigh NC 27606 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类药学;
关键词
deep learning; drug discovery; machine learning; pharmaceutics; support vector machine;

机译：深入学习;药物发现;机器学习;药剂;支持向量机;

相似文献

外文文献
中文文献
专利

1. Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets [J] . Korotcov Alexandru, Tkachenko Valery, Russo Daniel P., Molecular pharmaceutics . 2017,第12期

机译：使用不同药物发现数据集的多机学习方法和度量的深度学习比较
2. From machine learning to deep learning: progress in machine intelligence for rational drug discovery [J] . Lu Zhang, Jianjun Tan, Dan Han, Drug discovery today . 2017,第11期

机译：从机器学习到深度学习：理性药物发现的机器智能进展
3. Comparison of machine learning and deep learning techniques in promoter prediction across diverse species [J] . Nikita Bhandari, Satyajeet Khare, Rahee Walambe, PeerJ Computer Science . 2021,第a期

机译：不同物种启动子预测中机器学习与深层学习技术的比较
4. Performance comparison of Extreme Learning Machines and other machine learning methods on WBCD data set [C] . Ömer Selim Keskin, Akif Durdu, Muhammet Fatih Aslan, Signal Processing and Communications Applications Conference . 2021

机译：极端学习机与其他机器学习方法对WBCD数据集的性能比较
5. Illuminating Understudied Kinases and Facilitating Drug Discovery through Integrative Protein Kinase Resources and Machine Learning Methods [D] . Huang, Liang-Chin. 2021

机译：通过整合蛋白激酶资源和机器学习方法照亮升级的激酶并促进药物发现
6. Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Datasets [O] . Alexandru Korotcov, Valery Tkachenko, Daniel P Russo, -1

机译：使用多种药物发现数据集将深度学习与多种机器学习方法和指标进行比较
7. A Very Large-Scale Bioactivity Comparison of Deep Learning and Multiple Machine Learning Algorithms for Drug Discovery [O] . Thomas R. Lane, Daniel H. Foil, Eni Minerali, 2020

机译：深度学习和多种机器学习算法的一种非常大的生物活动比较药物发现

Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets

摘要

著录项

相似文献

相关主题

期刊订阅