首页> 外文会议>Asia-Pacific Bioinformatics Conference >GOChase-ll: correcting semantic inconsistencies from Gene Ontology-based annotations for gene products
【24h】

GOChase-ll: correcting semantic inconsistencies from Gene Ontology-based annotations for gene products

机译:gochase-ll:校正基于基因本体的基因本体的语义不一致的基因产品

获取原文

摘要

Background: The Gene Ontology (GO) provides a controlled vocabulary for describing genes and gene products. In spite of the undoubted importance of GO, several drawbacks associated with GO and GO-based annotations have been introduced. We identified three types of semantic inconsistencies in GO-based annotations; semantically redundant, biological-domain inconsistent and taxonomy inconsistent annotations. Methods: To determine the semantic inconsistencies in GO annotation, we used the hierarchical structure of GO graph and tree structure of NCBI taxonomy. Twenty seven biological databases were collected for finding semantic inconsistent annotation. Results: The distributions and possible causes of the semantic inconsistencies were investigated usingtwenty seven biological databases with GO-based annotations. We found that some evidence codes of annotation were associated with the inconsistencies. The numbers of gene products and species in a database that are related to the complexity of databasemanagement are also in correlation with the inconsistencies. Consequently, numerous annotation errors arise and are propagated throughout biological databases and GO-based high-level analyses. GOChase-ll is developed to detect and correct both syntacticand semantic errors in GO-based annotations. Conclusions: We identified some inconsistencies in GO-based annotation and provided software, GOChase-ll, for correcting these semantic inconsistencies in addition to the previous corrections for the syntacticerrors by GOChase-l.
机译:背景:基因本体(GO)提供了用于描述基因和基因产物的受控词汇。尽管Go的重要意义,但介绍了与Go和Go-Go的注释相关的几个缺点。我们确定了基于Go的注释中的三种语义不一致;语义冗余,生物结构域不一致和分类不一致的注释。方法:要确定GO注释中的语义不一致,我们使用了NCBI分类的GO图和树结构的层次结构。收集了二十七个生物数据库,以寻找语义不一致的注释。结果:使用基于GO-Sep的注释,调查了语义不一致的分布和可能的原因。我们发现一些有证据的注释代码与不一致相关。与数据库中的数据库中的基因产品和物种的数量也与不一致性相关。因此,出现了许多注释误差,并在整个生物数据库中传播并基于Go的高级分析。 GoChase-LL是开发的,以检测并纠正基于Go的注释中的SyntacticAnd语义错误。结论:我们在基于Go的注释和软件,Gochase-LL中确定了一些不一致,以纠正这些语义不一致,除了Gochase-L的先前对语法识别的校正。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号