首页> 美国卫生研究院文献>Journal of Computational Biology >Pathset Graphs: A Novel Approach for Comprehensive Utilization of Paired Reads in Genome Assembly
【2h】

Pathset Graphs: A Novel Approach for Comprehensive Utilization of Paired Reads in Genome Assembly

机译:路径集图:基因组装配中配对读取的综合利用的新方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>One of the key advances in genome assembly that has led to a significant improvement in contig lengths has been improved algorithms for utilization of paired reads (mate-pairs). While in most assemblers, mate-pair information is used in a post-processing step, the recently proposed Paired de Bruijn Graph (PDBG) approach incorporates the mate-pair information directly in the assembly graph structure. However, the PDBG approach faces difficulties when the variation in the insert sizes is high. To address this problem, we first transform mate-pairs into edge-pair histograms that allow one to better estimate the distance between edges in the assembly graph that represent regions linked by multiple mate-pairs. Further, we combine the ideas of mate-pair transformation and PDBGs to construct new data structures for genome assembly: pathsets and pathset graphs.
机译:>导致重叠群长度显着改善的基因组组装的关键进展之一是改进了利用配对读段(配对)的算法。虽然在大多数汇编程序中,配对对信息都用于后期处理步骤,但最近提出的成对的Bruijn图(PDBG)方法将配对对信息直接合并到装配图结构中。但是,当刀片尺寸变化很大时,PDBG方法会遇到困难。为了解决这个问题,我们首先将配对对转换为边缘对直方图,以使人们可以更好地估计装配图中代表多个匹配对链接的​​区域的边缘之间的距离。此外,我们结合了配对配对转换和PDBG的思想,以构建用于基因组组装的新数据结构:路径集和路径集图

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号