【24h】

An Identity Crisis in the Life Sciences

机译:生命科学中的身份危机

获取原文
获取原文并翻译 | 示例

摘要

~(my)Grid is an e-Science project assisting life scientists to build workflows that gather data from distributed, autonomous, replicated and heterogeneous resources. The provenance logs of workflow executions are recorded as RDF graphs. The log of one workflow run is used to trace the history of its execution process. However, by aggregating provenance logs of many workflow runs, one may gather the provenance of a common data product shared in multiple derivation paths. A successful aggregation relies on accurate and universal identification of each data product. The nature of bioinformatics data and services, however, makes this difficult. We describe the identity problem in bioinformatics data, and present a protocol for managing identity co-references and allocating identity to gathered and computed data products. The ability to overcome this problem means that the provenance of workflows in bioinformatics and other domains can be exploited to enhance the practice of e-Science.
机译:〜(my)Grid是一个电子科学项目,旨在帮助生命科学家构建从分布式,自治,复制和异构资源中收集数据的工作流。工作流执行的出处日志记录为RDF图。一个工作流程运行的日志用于跟踪其执行过程的历史记录。但是,通过汇总许多工作流程运行的出处日志,可以收集在多个派生路径中共享的通用数据产品的出处。成功的汇总取决于每个数据产品的准确和通用标识。然而,生物信息学数据和服务的性质使这一点变得困难。我们描述了生物信息学数据中的身份问题,并提出了一种协议,用于管理身份共同引用和将身份分配给收集和计算的数据产品。克服此问题的能力意味着可以利用生物信息学和其他领域中工作流的起源来增强电子科学的实践。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号