...
首页> 外文期刊>BMC Genomics >HGA: de novo genome assembly method for bacterial genomes using high coverage short sequencing reads
【24h】

HGA: de novo genome assembly method for bacterial genomes using high coverage short sequencing reads

机译:HGA:从头开始的细菌基因组的从头基因组组装方法,使用高覆盖范围的短测序读取

获取原文
           

摘要

Background Current high-throughput sequencing technologies generate large numbers of relatively short and error-prone reads, making the de novo assembly problem challenging. Although high quality assemblies can be obtained by assembling multiple paired-end libraries with both short and long insert sizes, the latter are costly to generate. Recently, GAGE-B study showed that a remarkably good assembly quality can be obtained for bacterial genomes by state-of-the-art assemblers run on a single short-insert library with very high coverage. Results In this paper, we introduce a novel hierarchical genome assembly (HGA) methodology that takes further advantage of such very high coverage by independently assembling disjoint subsets of reads, combining assemblies of the subsets, and finally re-assembling the combined contigs along with the original reads. Conclusions We empirically evaluated this methodology for 8 leading assemblers using 7 GAGE-B bacterial datasets consisting of 100 bp Illumina HiSeq and 250 bp Illumina MiSeq reads, with coverage ranging from 100x– ~ 200x. The results show that for all evaluated datasets and using most evaluated assemblers (that were used to assemble the disjoint subsets), HGA leads to a significant improvement in the quality of the assembly based on N50 and corrected N50 metrics.
机译:背景技术当前的高通量测序技术会产生大量相对较短且容易出错的读数,这使得从头组装问题变得具有挑战性。尽管可以通过组装具有短和长插入物大小的多个配对末端库来获得高质量的装配,但是后者的生成成本很高。最近,GAGE-B研究表明,通过在单个短插入文库上具有很高覆盖率的最先进汇编程序,可以为细菌基因组获得非常好的汇编质量。结果在本文中,我们介绍了一种新颖的层次基因组组装(HGA)方法,该方法可通过独立组装读段的不连贯子集,组合这些子集的组装,最后重新组装合并的重叠群与序列集来进一步利用如此高的覆盖率。原始读物。结论我们使用7个GAGE-B细菌数据集(包括100 bp Illumina HiSeq和250 bp Illumina MiSeq读数)对8个主要装配商进行了经验评估,覆盖范围为100x–〜200x。结果表明,对于所有评估的数据集并使用大多数评估的汇编程序(用于汇编不交集的子集),基于N50和更正的N50度量标准,HGA显着提高了汇编质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号