首页> 外文会议>Annual ACM symposium on applied computing;ACM symposium on applied computing;SAC 2010 >FANGS: High Speed Sequence Mapping for Next generation Sequencers
【24h】

FANGS: High Speed Sequence Mapping for Next generation Sequencers

机译:FANGS:下一代音序器的高速音序映射

获取原文

摘要

Next Generation Sequencing machines are generating millions of short DNA sequences (reads) everyday. There is a need for efficient algorithms to map these sequences to the reference genome to identify SNPs or rare transcripts and to fulfill the dream of personalized medicine. We present a Fast Algorithm for Next Generation Sequencers (FANGS), which dynamically reduces the search space by using q-gram filtering and pigeon hole principle to rapidly map 454-Roche reads onto a reference genome. FANGS is a sequential algorithm designed to find all the matches of a query sequence in the reference genome tolerating a large number of mismatches or insertions/deletions. Using FANGS, we mapped 50000 reads with a total of 25 million nucleotides to the human genome in as little as 23.3 minutes on a typical desktop computer. Through our experiments, we found that FANGS is upto an order of magnitude faster than the state-of-the-art techniques for queries of length 500 allowing 5 mismatches or insertion/deletions.
机译:下一代测序仪每天产生数百万条短DNA序列(读段)。需要有效的算法来将这些序列映射到参考基因组,以识别SNP或稀有转录本,并实现个性化医学的梦想。我们提出了一种下一代测序仪的快速算法(FANGS),该算法通过使用q-gram过滤和信鸽原理将454-Roche读图快速映射到参考基因组上,从而动态减少了搜索空间。 FANGS是一种顺序算法,旨在在参考基因组中查找查询序列的所有匹配项,从而容许大量错配或插入/缺失。使用FANGS,我们在典型的台式计算机上,只需23.3分钟即可将50000个读数(共2500万个核苷酸)映射到人类基因组。通过我们的实验,我们发现FANGS的查询速度比最新技术快了一个数量级,而长度为500的查询允许5个不匹配或插入/删除。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号