【24h】

Parallel CNV detection algorithm based on cloud computing

机译:基于云计算的并行CNV检测算法

获取原文

摘要

Recently, the cost of whole-genome sequencing has decreased dramatically due to the development of next-generation sequencing technology, and a huge amount of sequencing data has been generated and released by research laboratories worldwide. However, it is difficult to develop mature genome analysis software and high-performance computing resources which are available to assay genome data in real time. In this paper, we propose a parallel and robust CNV detection algorithm to run on a cloud computing environment. The proposed method, which we call CNV shape was developed using a shape-based CNV detection algorithm with Map/Reduce framework. This method finds regions above a certain length with continuously increased or decreased read coverage generated by mapping sequencing data onto the human reference genome. In order to maintain load balancing of each node in the cloud computing environment, we use a partitioning method. Also, we demonstrate the efficiency of the proposed method for CNV detection using publicly available sequencing data.
机译:最近,由于下一代测序技术的发展,全基因组测序的成本已大大降低,并且全世界的研究实验室已经生成并发布了大量的测序数据。但是,难以开发可用于实时分析基因组数据的成熟的基因组分析软件和高性能计算资源。在本文中,我们提出了一种可在云计算环境上运行的并行且健壮的CNV检测算法。提出的方法,我们称为CNV形状,是使用具有Map / Reduce框架的基于形状的CNV检测算法开发的。该方法发现一定长度以上的区域,该区域具有通过将测序数据映射到人类参考基因组上而产生的连续不断增加或减少的阅读覆盖率。为了在云计算环境中维持每个节点的负载平衡,我们使用了分区方法。此外,我们展示了使用公开可用的测序数据进行CNV检测的拟议方法的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号