Separating metagenomic short reads into genomes via clustering

机译：通过聚类将宏基因组短读分为基因组

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

BackgroundThe metagenomics approach allows the simultaneous sequencing of all genomes in an environmental sample. This results in high complexity datasets, where in addition to repeats and sequencing errors, the number of genomes and their abundance ratios are unknown. Recently developed next-generation sequencing (NGS) technologies significantly improve the sequencing efficiency and cost. On the other hand, they result in shorter reads, which makes the separation of reads from different species harder. Among the existing computational tools for metagenomic analysis, there are similarity-based methods that use reference databases to align reads and composition-based methods that use composition patterns (i.e., frequencies of short words or l-mers) to cluster reads. Similarity-based methods are unable to classify reads from unknown species without close references (which constitute the majority of reads). Since composition patterns are preserved only in significantly large fragments, composition-based tools cannot be used for very short reads, which becomes a significant limitation with the development of NGS. A recently proposed algorithm, AbundanceBin, introduced another method that bins reads based on predicted abundances of the genomes sequenced. However, it does not separate reads from genomes of similar abundance levels.

机译：背景技术宏基因组学方法允许对环境样品中的所有基因组进行同时测序。这导致了高复杂度的数据集，除了重复和测序错误外，基因组的数量及其丰度比还未知。最近开发的下一代测序（NGS）技术显着提高了测序效率和成本。另一方面，它们导致较短的读取，这使得从不同物种读取的分离更加困难。在用于宏基因组分析的现有计算工具中，有基于相似度的方法使用参考数据库来对齐读段，以及基于组合物的方法使用组成模式（即短单词或l-mers的频率）来聚类阅读。基于相似度的方法无法在没有密切参考的情况下对未知物种的读物进行分类（这构成了大部分读物）。由于构图模式仅保留在很大的片段中，因此基于构图的工具无法用于非常短的读取，这随着NGS的发展而成为重大限制。最近提出的算法AbundanceBin引入了另一种方法，该方法可根据测序的基因组的预测丰度对装箱读数进行分类。但是，它没有将读物与相似丰度水平的基因组分开。

著录项

期刊名称 Algorithms for Molecular Biology : AMB
作者
Olga Tanaseichuk; James Borneman; Tao Jiang;
展开▼
作者单位

展开▼
年(卷),期 2019(7),
年度 2019
页码 27
总页数 15
原文格式 PDF
正文语种
中图分类应用微生物学;生化遗传学;生化药理学;
关键词
Metagenomics, NGS short reads, Genome separation, Clustering;

机译：元基因组学;NGS短读;基因组分离;聚类;

相似文献

外文文献
中文文献
专利

1. Separating metagenomic short reads into genomes via clustering [J] . Olga Tanaseichuk, James Borneman, Tao Jiang Algorithms for Molecular Biology . 2012,第1期

机译：通过聚类将宏基因组短读片段分离到基因组中
2. Complete 4.55-Megabase-Pair Genome of “Candidatus Fluviicola riflensis,” Curated from Short-Read Metagenomic Sequences [J] . Jillian F. Banfield, Karthik Anantharaman, Kenneth H. Williams, Genome Announcements . 2017,第47期

机译：从短读的元基因组序列中筛选出的“ Candidatus Fluviicola riflensis”完整的4.55碱基对基因组。
3. Individual genome assembly from complex community short-read metagenomic datasets [J] . Luo C., Tsementzi D., Kyrpides N.C., The ISME journal emultidisciplinary journal of microbial ecology . 2012,第4期

机译：来自复杂社区的短基因组学数据集的个体基因组组装
4. Separating Metagenomic Short Reads into Genomes via Clustering (Extended Abstract) [C] . Olga Tanaseichuk, James Borneman, Tao Jiang Algorithms in bioinformatics . 2011

机译：通过聚类将超基因组短片段读入基因组中（扩展摘要）
5. Scaling short read de novo DNA sequence assembly to gigabase genomes. [D] . Cook, Jeffrey J. 2011

机译：将短读从头DNA序列组装扩展到gigabase基因组。
6. MTR: taxonomic annotation of short metagenomic reads using clustering at multiple taxonomic ranks [O] . Fabio Gori, Gianluigi Folino, Mike S. M. Jetten, -1

机译：MTR：在多个分类学等级上使用聚类对短宏基因组读物进行分类注释
7. Separating metagenomic short reads into genomes via clustering [O] . 2012

机译：通过聚类将宏基因组短读分为基因组

Separating metagenomic short reads into genomes via clustering

摘要

著录项

相似文献

相关主题

期刊订阅