...
首页> 外文期刊>Genome Biology >Within-species contamination of bacterial whole-genome sequence data has a greater influence on clustering analyses than between-species contamination
【24h】

Within-species contamination of bacterial whole-genome sequence data has a greater influence on clustering analyses than between-species contamination

机译:在物种内,细菌全基因组序列数据的污染对聚类分析具有更大的影响而不是物种污染物之间的聚类分析

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Abstract Although it is assumed that contamination in bacterial whole-genome sequencing causes errors, the influences of contamination on clustering analyses, such as single-nucleotide polymorphism discovery, phylogenetics, and multi-locus sequencing typing, have not been quantified. By developing and analyzing 720 Listeria monocytogenes , Salmonella enterica , and Escherichia coli short-read datasets, we demonstrate that within-species contamination causes errors that confound clustering analyses, while between-species contamination generally does not. Contaminant reads mapping to references or becoming incorporated into chimeric sequences during assembly are the sources of those errors. Contamination sufficient to influence clustering analyses is present in public sequence databases.
机译:摘要虽然假设细菌全基因组测序中的污染导致误差,但污染对聚类分析的影响,例如单核苷酸多态性发现,系统发育和多基因座测序键入尚未被定量。通过开发和分析720个李斯特菌单核细胞增生,沙门氏菌肠和大肠杆菌短读数据集,我们证明了物种内污染导致困扰聚类分析的误差,而物种污染通常不会。污染物读取到参考或在组装期间掺入嵌合序列中的嵌入序列是这些误差的来源。足以影响聚类分析的污染存在于公共序列数据库中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号