...
首页> 外文期刊>Journal of Computational Chemistry: Organic, Inorganic, Physical, Biological >Task-parallel message passing interface implementation of Autodock4 for docking of very large databases of compounds using high-performance super-computers
【24h】

Task-parallel message passing interface implementation of Autodock4 for docking of very large databases of compounds using high-performance super-computers

机译:Autodock4的任务并行消息传递接口实现,用于使用高性能超级计算机对接非常大的化合物数据库

获取原文
获取原文并翻译 | 示例
           

摘要

A message passing interface (MPI)-based implementation (Autodock4.lga.MPI) of the grid-based docking program Autodock4 has been developed to allow simultaneous and independent docking of multiple compounds on up to thousands of central processing units (CPUs) using the Lamarkian genetic algorithm. The MPI version reads a single binary file containing precalculated grids that represent the protein-ligand interactions, i.e., van der Waals, electrostatic, and desolvation potentials, and needs only two input parameter files for the entire docking run. In comparison, the serial version of Autodock4 reads ASCII grid files and requires one parameter file per compound. The modifications performed result in significantly reduced input/output activity compared with the serial version. Autodock4.lga.MPI scales up to 8192 CPUs with a maximal overhead of 16.3%, of which two thirds is due to input/output operations and one third originates from MPI operations. The optimal docking strategy, which minimizes docking CPU time without lowering the quality of the database enrichments, comprises the docking of ligands preordered from the most to the least flexible and the assignment of the number of energy evaluations as a function of the number of rotatable bounds. In 24 h, on 8192 high-performance computing CPUs, the present MPI version would allow docking to a rigid protein of about 300K small flexible compounds or 11 million rigid compounds.
机译:基于网格的对接程序Autodock4的基于消息传递接口(MPI)的实现(Autodock4.lga.MPI)已被开发为允许使用以下命令在多达数千个中央处理器(CPU)上同时且独立地对接多种化合物。拉马克遗传算法。 MPI版本读取单个二进制文件,该二进制文件包含代表蛋白质-配体相互作用(即范德华力,静电势和去溶剂化势能)的预先计算的网格,并且在整个对接过程中仅需要两个输入参数文件。相比之下,Autodock4的串行版本读取ASCII网格文件,并且每个化合物需要一个参数文件。与串行版本相比,所执行的修改导致输入/输出活动大大减少。 Autodock4.lga.MPI最多可扩展至8192个CPU,最大开销为16.3%,其中三分之二归因于输入/输出操作,三分之一来自MPI操作。最佳对接策略可最大程度地减少对接CPU时间而又不降低数据库扩充的质量,该对接策略包括按从最大到最小的顺序对配体进行对接,以及根据可旋转范围的数量分配能量评估的数量。在24小时内,在8192个高性能计算CPU上,当前的MPI版本将允许对接至约300K小型柔性化合物或1100万刚性化合物的刚性蛋白质。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号