首页> 外文期刊>Journal of Bioinformatics and Computational Biology >HUMAN TRASH ESTs — SEQUENCES FROM cDNA COLLECTION THAT ARE NOT ALIGNED TO GENOME ASSEMBLY
【24h】

HUMAN TRASH ESTs — SEQUENCES FROM cDNA COLLECTION THAT ARE NOT ALIGNED TO GENOME ASSEMBLY

机译:人体耗材EST-与基因组装配没有关联的cDNA收集序列

获取原文
获取原文并翻译 | 示例
           

摘要

Expressed sequence tags (ESTs) represent 500–1000-bp-long sequences corresponding to mRNAs derived from different sources (cell lines, tissues, etc.). The human EST database contains over 8,000,000 sequences, with over 4,000,000,000 total nucleotides. RNA molecules are transcribed from a genomic DNA template; therefore, all ESTs should match corresponding genomes. Nevertheless, we have found in the human EST database approximately 11,000 ESTs not matching sequences in the human genome database. The presence of "trash" ESTs (TESTs) in the EST database could result from DNA or RNA contamination of the laboratory equipment, tissues, or cell lines. TESTs could also represent sequences from unidentified human genes or from species inhabiting the human body. Here, we attempt to identify the sources of human EST database contaminations. In particular, we discuss systematic contamination of the mammalian EST databases with sequences of plants.
机译:表达的序列标签(EST)代表500-1000 bp长的序列,对应于源自不同来源(细胞系,组织等)的mRNA。人类EST数据库包含超过8,000,000个序列,总核苷酸超过4,000,000,000。 RNA分子从基因组DNA模板转录;因此,所有的EST应该匹配相应的基因组。然而,我们在人类EST数据库中发现了约11,000个与人类基因组数据库中的序列不匹配的EST。 EST数据库中“垃圾” EST(TEST)的存在可能是由实验室设备,组织或细胞系的DNA或RNA污染引起的。 TESTs还可以代表来自未鉴定的人类基因或人类居住物种的序列。在这里,我们尝试确定人类EST数据库污染的来源。特别是,我们讨论了植物序列对哺乳动物EST数据库的系统污染。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号