首页> 外文会议>2004 computing frontier conference >A Parallel Backtracking Framework(BkFr)for Single and Multiple Clusters
【24h】

A Parallel Backtracking Framework(BkFr)for Single and Multiple Clusters

机译:用于单个和多个集群的并行回溯框架(BkFr)

获取原文
获取原文并翻译 | 示例

摘要

There is a wide class of important problems,typified by the NP- complete SAT problem,for which the best known solutions are obtained using the backtracking strategy and heuristics to bound the search of the associated state-space tree.The backtracking strategy is well suited to parallelization,since different portions of the state-space tree can be assigned to different processes.The resulting strategy is often referred to in the literature as parallel depth-first search or parallel backtracking.In this paper we introduce Backtracking Framework(BkFr)that simplifies implementation of the parallel backtracking paradigm in the single cluster environment,and supports the extension of computations over multiple clusters.BkFr uses two communication methods–MPI for intra-cluster communication and ICI(Inter-Cluster Interface)for inter-cluster communication. What differentiates ICI from MPI-2,Globus and other libraries that support inter-cluster message passing is its lightweight (requiring only Unix socket API and little or no systems administration support),and its fault-tolerant capabilities.The latter capability was one of our principal motivations for developing ICI,since it allows new clusters to join an ongoing computation anytime without affecting the integrity of the whole system.Also,a sudden failure of a cluster that is part of the computation will result in a reassignment of its work to another resource.There are a number of other parallel frameworks for parallel state-space tree searching,such as the ZRAM framework, but they do not support the multi-cluster fault-tolerant option.In addition to its ease of implementation,BkFr offers a simple interface(less than 10 functions)for new compute modules, making it relatively simple for the user to alter existing code to a form suitable for the BkFr framework.To further simplify the utilization of BkFr for solving user-defined problems,we have developed our own Shared Variable Emulation layer,which hides MPI and ICI communication functions,and allows the user to send information(messages)as variables.Because of its lightweight features,our framework does not incur significant additional overhead,and performance data shows good speedup on problems ranging from a simply implemented version of the sum of subsets problem,to complicated implementations of SAT solvers such as SBSAT and zChaff.While the current implementations of ICI and BkFr are developed in the context of MPI,they can readily be adapted to other message passing environments.
机译:存在大量重要问题,以NP完全SAT问题为典型,使用回溯策略和启发式方法来绑定相关状态空间树的搜索可获得最著名的解决方案。回溯策略非常适合在并行化中,由于状态空间树的不同部分可以分配给不同的进程。因此,在文献中通常将这种策略称为并行深度优先搜索或并行回溯。在本文中,我们介绍了回溯框架(BkFr) BkFr使用两种通信方法:MPI用于集群内通信和ICI(集群间接口)用于集群间通信,从而简化了在单个集群环境中并行回溯范式的实现,并支持在多个集群上的计算扩展。 ICI与MPI-2,Globus和其他支持群集间消息传递的库的不同之处在于它的轻量级(仅需要Unix socket API且几乎不需要系统管理支持),以及其容错功能。后一种功能是其中之一。我们开发ICI的主要动机是因为它允许新群集随时随地加入正在进行的计算而不会影响整个系统的完整性。此外,作为计算一部分的群集突然发生故障也将导致其工作重新分配给另一种资源。还有许多其他并行框架用于并行状态空间树搜索,例如ZRAM框架,但它们不支持多集群容错选项。除了易于实现之外,BkFr还提供了一个用于新计算模块的简单界面(少于10个功能),使用户相对容易地将现有代码更改为适合BkFr框架的形式。利用BkFr解决用户定义的问题,我们开发了自己的共享变量仿真层,它隐藏了MPI和ICI通信功能,并允许用户将信息(消息)作为变量发送。由于其轻量级的功能,我们的框架能够不会产生明显的额外开销,性能数据显示,从简单实现的子集和问题到复杂的SAT求解器(如SBSAT和zChaff)实现,问题的处理速度都得到了很好的加速。虽然ICI和BkFr的当前实现是在在MPI的上下文中,它们可以很容易地适应其他消息传递环境。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号