首页> 美国卫生研究院文献>Journal of Biomolecular Techniques : JBT >P19-S Managing Proteomics Data from Data Generation and Data Warehousing to Central Data Repository and Journal Reviewing Processes
【2h】

P19-S Managing Proteomics Data from Data Generation and Data Warehousing to Central Data Repository and Journal Reviewing Processes

机译:P19-S管理蛋白质组学数据从数据生成和数据仓库到中央数据存储库和期刊审阅过程

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In today’s proteomics research, various techniques and instrumentation bioinformatics tools are necessary to manage the large amount of heterogeneous data with an automatic quality control to produce reliable and comparable results. Therefore a data-processing pipeline is mandatory for data validation and comparison in a data-warehousing system. The proteome bioinformatics platform ProteinScape has been proven to cover these needs. The reprocessing of HUPO BPP participants’ MS data was done within ProteinScape. The reprocessed information was transferred into the global data repository PRIDE.ProteinScape as a data-warehousing system covers two main aspects: archiving relevant data of the proteomics workflow and information extraction functionality (protein identification, quantification and generation of biological knowledge). As a strategy for automatic data validation, different protein search engines are integrated. Result analysis is performed using a decoy database search strategy, which allows the measurement of the false-positive identification rate. Peptide identifications across different workflows, different MS techniques, and different search engines are merged to obtain a quality-controlled protein list.The proteomics identifications database (PRIDE), as a public data repository, is an archiving system where data are finally stored and no longer changed by further processing steps. Data submission to PRIDE is open to proteomics laboratories generating protein and peptide identifications. An export tool has been developed for transferring all relevant HUPO BPP data from ProteinScape into PRIDE using the PRIDE.xml format.The EU-funded ProDac project will coordinate the development of software tools covering international standards for the representation of proteomics data. The implementation of data submission pipelines and systematic data collection in public standards–compliant repositories will cover all aspects, from the generation of MS data in each laboratory to the conversion of all the annotating information and identifications to a standardized format. Such datasets can be used in the course of publishing in scientific journals.
机译:在当今的蛋白质组学研究中,需要各种技术和仪器生物信息学工具来管理大量异类数据,并进行自动质量控制,以产生可靠且可比的结果。因此,数据处理管道对于数据仓库系统中的数据验证和比较是必不可少的。蛋白质组生物信息学平台ProteinScape已被证明可以满足这些需求。 HUPO BPP参与者的MS数据的重新处理是在ProteinScape中完成的。经过重新处理的信息已转移到全球数据存储库PRIDE.ProteinScape作为数据仓库系统涵盖了两个主要方面:归档蛋白质组学工作流的相关数据和信息提取功能(蛋白质鉴定,定量和生物知识生成)。作为自动数据验证的一种策略,集成了不同的蛋白质搜索引擎。使用诱饵数据库搜索策略执行结果分析,该策略可以测量假阳性识别率。融合了跨不同工作流程,不同MS技术和不同搜索引擎的肽段鉴定,以获得质量控制的蛋白质清单。蛋白质组学鉴定数据库(PRIDE)作为一个公共数据存储库,是一个归档系统,在该系统中最终存储数据而无需不再需要进一步的处理步骤。向PRIDE提交数据的过程对蛋白质组学实验室开放,可进行蛋白质和肽段鉴定。已经开发了一种导出工具,可以使用PRIDE.xml格式将所有相关的HUPO BPP数据从ProteinScape传输到PRIDE。欧盟资助的ProDac项目将协调涵盖蛋白质组学数据表示国际标准的软件工具的开发。在符合公共标准的存储库中,数据提交管道的实施和系统的数据收集将涵盖所有方面,从每个实验室中的MS数据生成到所有注释信息和标识到标准格式的转换。这样的数据集可以在科学期刊的出版过程中使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号