首页> 美国卫生研究院文献>Chemical Science >MoleculeNet: a benchmark for molecular machine learning
【2h】

MoleculeNet: a benchmark for molecular machine learning

机译:MoleculeNet:分子机器学习的基准

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Molecular machine learning has been maturing rapidly over the last few years. Improved methods and the presence of larger datasets have enabled machine learning algorithms to make increasingly accurate predictions about molecular properties. However, algorithmic progress has been limited due to the lack of a standard benchmark to compare the efficacy of proposed methods; most new algorithms are benchmarked on different datasets making it challenging to gauge the quality of proposed methods. This work introduces MoleculeNet, a large scale benchmark for molecular machine learning. MoleculeNet curates multiple public datasets, establishes metrics for evaluation, and offers high quality open-source implementations of multiple previously proposed molecular featurization and learning algorithms (released as part of the DeepChem open source library). MoleculeNet benchmarks demonstrate that learnable representations are powerful tools for molecular machine learning and broadly offer the best performance. However, this result comes with caveats. Learnable representations still struggle to deal with complex tasks under data scarcity and highly imbalanced classification. For quantum mechanical and biophysical datasets, the use of physics-aware featurizations can be more important than choice of particular learning algorithm.
机译:在过去的几年中,分子机器学习已经迅速成熟。改进的方法和更大的数据集的存在使机器学习算法能够对分子性质做出越来越准确的预测。然而,由于缺乏标准的基准来比较所提出的方法的有效性,算法的进展受到了限制。大多数新算法都在不同的数据集上进行基准测试,因此很难衡量所提出方法的质量。这项工作介绍了MoleculeNet,这是分子机器学习的大规模基准。 MoleculeNet策划了多个公共数据集,建立了评估指标,并提供了多个先前提出的分子特征化和学习算法的高质量开源实现(已作为DeepChem开源库的一部分发布)。 MoleculeNet基准测试表明,可学习的表示形式是分子机器学习的强大工具,并且广泛提供了最佳性能。但是,此结果带有警告。在数据缺乏和高度不平衡的分类下,可学习的表示形式仍然难以处理复杂的任务。对于量子力学和生物物理数据集,使用物理感知特征化比选择特定学习算法更为重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号