首页> 外文期刊>Journal of Cheminformatics >Chemical Entity Semantic Specification: Knowledge representation for efficient semantic cheminformatics and facile data integration
【24h】

Chemical Entity Semantic Specification: Knowledge representation for efficient semantic cheminformatics and facile data integration

机译:化学实体语义规范:有效的语义化学信息学和便捷的数据集成的知识表示

获取原文
           

摘要

Background Over the past several centuries, chemistry has permeated virtually every facet of human lifestyle, enriching fields as diverse as medicine, agriculture, manufacturing, warfare, and electronics, among numerous others. Unfortunately, application-specific, incompatible chemical information formats and representation strategies have emerged as a result of such diverse adoption of chemistry. Although a number of efforts have been dedicated to unifying the computational representation of chemical information, disparities between the various chemical databases still persist and stand in the way of cross-domain, interdisciplinary investigations. Through a common syntax and formal semantics, Semantic Web technology offers the ability to accurately represent, integrate, reason about and query across diverse chemical information. Results Here we specify and implement the Chemical Entity Semantic Specification (CHESS) for the representation of polyatomic chemical entities, their substructures, bonds, atoms, and reactions using Semantic Web technologies. CHESS provides means to capture aspects of their corresponding chemical descriptors, connectivity, functional composition, and geometric structure while specifying mechanisms for data provenance. We demonstrate that using our readily extensible specification, it is possible to efficiently integrate multiple disparate chemical data sources, while retaining appropriate correspondence of chemical descriptors, with very little additional effort. We demonstrate the impact of some of our representational decisions on the performance of chemically-aware knowledgebase searching and rudimentary reaction candidate selection. Finally, we provide access to the tools necessary to carry out chemical entity encoding in CHESS, along with a sample knowledgebase. Conclusions By harnessing the power of Semantic Web technologies with CHESS, it is possible to provide a means of facile cross-domain chemical knowledge integration with full preservation of data correspondence and provenance. Our representation builds on existing cheminformatics technologies and, by the virtue of RDF specification, remains flexible and amenable to application- and domain-specific annotations without compromising chemical data integration. We conclude that the adoption of a consistent and semantically-enabled chemical specification is imperative for surviving the coming chemical data deluge and supporting systems science research.
机译:背景技术在过去的几个世纪中,化学几乎渗透到人类生活方式的方方面面,丰富了医学,农业,制造业,战争和电子学等众多领域。不幸的是,由于这种化学的广泛采用,出现了针对特定用途的,不兼容的化学信息格式和表示策略。尽管为统一化学信息的计算表示付出了许多努力,但各种化学数据库之间的差异仍然存在,并阻碍了跨领域,跨学科的研究。通过通用的语法和形式语义,语义Web技术提供了跨各种化学信息准确表示,集成,推理和查询的功能。结果在这里,我们指定并实现了化学实体语义规范(CHESS),用于使用语义Web技术表示多原子化学实体,其子结构,键,原子和反应。 CHESS提供了在指定数据来源机制的同时捕获其相应化学描述符,连通性,功能组成和几何结构等方面的方法。我们证明,使用我们易于扩展的规范,可以有效地集成多个不同的化学数据源,同时只需很少的额外努力即可保留化学描述符的适当对应关系。我们展示了一些代表性决策对化学感知型知识库搜索和基本反应候选者选择的性能的影响。最后,我们提供了在CHESS中进行化学实体编码所必需的工具以及示例知识库。结论通过利用CHESS来利用语义Web技术的强大功能,可以提供一种简便的跨域化学知识集成方法,并完全保留数据对应性和出处。我们的表示法建立在现有的化学信息学技术的基础之上,并且凭借RDF规范,在不影响化学数据集成的前提下,仍可灵活且适用于特定于应用程序和特定领域的注释。我们得出结论,采用一致且具有语义支持的化学规范对于生存即将到来的化学数据泛滥和支持系统科学研究至关重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号