...
首页> 外文期刊>The Journal of Chemical Physics >Descriptors representing two- and three-body atomic distributions and their effects on the accuracy of machine-learned inter-atomic potentials
【24h】

Descriptors representing two- and three-body atomic distributions and their effects on the accuracy of machine-learned inter-atomic potentials

机译:代表两种和三体原子分布的描述符及其对机器学习际原子潜力的准确性的影响

获取原文
获取原文并翻译 | 示例
           

摘要

When determining machine-learning models for inter-atomic potentials, the potential energy surface is often described as a non-linear function of descriptors representing two- and three-body atomic distribution functions. It is not obvious how the choice of the descriptors affects the efficiency of the training and the accuracy of the final machine-learned model. In this work, we formulate an efficient method to calculate descriptors that can separately represent two- and three-body atomic distribution functions, and we examine the effects of including only two- or three-body descriptors, as well as including both, in the regression model. Our study indicates that non-linear mixing of two- and three-body descriptors is essential for an efficient training and a high accuracy of the final machine-learned model. The efficiency can be further improved by weighting the two-body descriptors more strongly. We furthermore examine a sparsification of the three-body descriptors. The three-body descriptors usually provide redundant representations of the atomistic structure, and the number of descriptors can be significantly reduced without loss of accuracy by applying an automatic sparsification using a principal component analysis. Visualization of the reduced descriptors using three-body distribution functions in real-space indicates that the sparsification automatically removes the components that are less significant for describing the distribution function.
机译:当确定用于原子间电位的机器学习模型时,势能表面通常被描述为表示两个和三体原子分配功能的描述符的非线性函数。描述符的选择是不明显的,如何影响培训的效率和最终机器学习模型的准确性。在这项工作中,我们制定了一种有效的方法来计算可以单独代表两种和三体原子分配功能的描述符,并且我们检查包括两个或三个身体描述符的效果,以及包括两者的效果回归模型。我们的研究表明,两个和三体描述符的非线性混合对于有效的培训和最终机器学习模型的高准确性至关重要。通过更强烈地加权双体描述符可以进一步提高效率。我们进一步检查了三体描述符的稀疏化。三体描述符通常提供原子结构的冗余表示,并且可以通过使用主成分分析应用自动稀疏来显着减小描述符的数量而不会损失精度。使用真实空间中的三体分布函数的可视化描述符指示稀疏自动删除用于描述分布函数不太重要的组件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号