首页> 外国专利> Parallel processing of large data files on distributed file systems with dynamic workload balancing

Parallel processing of large data files on distributed file systems with dynamic workload balancing

机译:动态工作负载平衡在分布式文件系统上并行处理大型数据文件

摘要

Parallel processing files on a distributed file system, responds to a request to process the file, by partitioning the file into a set of predetermined sized blocks by a single coordinator and assigns blocks to a reader in a set of readers. Block assignments are recorded to a control table accessible to all readers. The set of readers search the control table for a row assigned to a respective reader. Responsive to locating the row, the respective reader changes a state of the row to in progress updating an associated timestamp. The respective reader scans the control table for an available set of the blocks, when all blocks currently assigned are exhausted and dynamically re-assigns an available block from another owning reader to itself and changes ownership to itself. When no rows are available, the respective reader completes processing and returns an end of file indication to the single coordinator.
机译:分布式文件系统上的并行处理文件,通过单个协调器将文件划分为一组预定大小的块,然后将块分配给一组读取器中的读取器,从而响应处理文件的请求。块分配记录在控制表中,所有读者均可访问。该组阅读器在控制表中搜索分配给各个阅读器的行。响应于定位行,各个读取器改变行的状态以进行更新相关的时间戳。当当前分配的所有块都用完时,相应的读取器将在控制表中扫描可用的块集合,并从另一个拥有的读取器向其自身动态重新分配可用的块并将其所有权更改为自身。当没有可用的行时,相应的阅读器将完成处理,并将文件结尾指示返回给单个协调器。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号