首页> 外文OA文献 >Within-species contamination of bacterial whole-genome sequence data has a greater influence on clustering analyses than between-species contamination
【2h】

Within-species contamination of bacterial whole-genome sequence data has a greater influence on clustering analyses than between-species contamination

机译:在物种内,细菌全基因组序列数据的污染对聚类分析具有更大的影响而不是物种污染物之间的聚类分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Abstract Although it is assumed that contamination in bacterial whole-genome sequencing causes errors, the influences of contamination on clustering analyses, such as single-nucleotide polymorphism discovery, phylogenetics, and multi-locus sequencing typing, have not been quantified. By developing and analyzing 720 Listeria monocytogenes, Salmonella enterica, and Escherichia coli short-read datasets, we demonstrate that within-species contamination causes errors that confound clustering analyses, while between-species contamination generally does not. Contaminant reads mapping to references or becoming incorporated into chimeric sequences during assembly are the sources of those errors. Contamination sufficient to influence clustering analyses is present in public sequence databases.
机译:摘要虽然假设细菌全基因组测序中的污染导致误差,但污染对聚类分析的影响,例如单核苷酸多态性发现,系统发育和多基因座测序键入尚未被定量。通过开发和分析720个李斯特菌单核细胞增生,沙门氏菌肠和大肠杆菌短读数据集,我们证明了物种内污染导致困扰聚类分析的误差,而物种污染通常不会。污染物读取到参考或在组装期间掺入嵌合序列中的嵌入序列是这些误差的来源。足以影响聚类分析的污染存在于公共序列数据库中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号