首页> 外文会议>Annual International Conference on Research in Computational Molecular Biology >MosaicFlye: Resolving Long Mosaic Repeats Using Long Reads
【24h】

MosaicFlye: Resolving Long Mosaic Repeats Using Long Reads

机译:MosaicFlye:使用长读来解决长马赛克重复问题

获取原文

摘要

Although long-read sequencing technologies opened a new era in genome assembly, the problem of resolving unbridged repeats (i.e., repeats that are not spanned by any reads) such as long segmental duplications in the human genome remains largely unsolved, making it a major obstacle towards achieving the goal of complete genome assemblies. Thus, long-read assemblers currently face the same repeat-resolution challenge that short-read assemblers faced a decade ago (albeit at a different scale of repeat lengths). Long error-prone reads and short accurate reads have their strengths and weaknesses, e.g., short reads may resolve some long repeats that are difficult to resolve with long reads. For example, diverged copies of a long repeat (e.g., copies differing by 3%) often don't share k-mers (for a typical k-mer size used in short-read assemblers) and thus are automatically resolved by the de Bruijn graph-based assemblers. In contrast, long-read assemblers face difficulties resolving such repeats since repeat copies with a 3% divergence are difficult to distinguish using the error-prone reads with error rates exceeding 10%. Thus, long-read assemblers trade the ability to resolve the unbridged but divergent repeat copies for the ability to resolve bridged repeats.
机译:尽管长读测序技术开启了基因组装配的新纪元,但解决未桥连的重复序列(例如,任何读段均未覆盖的重复序列)的问题(例如人类基因组中的长片段重复)仍未解决,这使其成为主要障碍实现完整的基因组组装的目标。因此,长读汇编器目前面临与十年前短读汇编器相同的重复分辨率挑战(尽管重复长度的比例不同)。容易出错的长读取和准确的短读取都有其优点和缺点,例如,短读取可能会解析一些长重复,而长读取则很难解决。例如,长重复的不同副本(例如,副本相差3%)通常不共享k-mers(对于短读汇编器中使用的典型k-mer大小),因此由de Bruijn自动解决基于图的汇编程序。相反,长时间读取的汇编程序面临解决此类重复的困难,因为使用错误率超过10%的易错读取很难区分差异为3%的重复副本。因此,长时间阅读的汇编程序将解析未桥接但发散的重复副本的能力换为解析桥接重复的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号