...
首页> 外文期刊>Research Disclosure >Techniques for Mass Spectrometry - Cloud Connected Mass Spectrometry Data System
【24h】

Techniques for Mass Spectrometry - Cloud Connected Mass Spectrometry Data System

机译:质谱 - 云连接质谱数据系统的技术

获取原文
获取原文并翻译 | 示例
           

摘要

The 'atomic unit' of data in an liquid chromatography-mass spectrometry(LCMS) experiment is a mass spectrum. While LC apparatus provide compound separation utilities, the detected signal of interest is still the mass spectrum. Historically, because the atomic unit is so information rich it is typically stored as a vendor-specific raw file format Said another way, a collection of mass spectra encapsulating the time domain of the chromatogram is stored in binary format. Post-acquisition processing software typically accesses this data, conducts ETL (extract transform load) operations and runs various signal detection/annotation algorithms. The resulting data is transformed from a raw signal to a biological annotation (e.g molecular formula, peptide sequence etc) Advances in mass spectrometry instrumentation has resulted in a significant increase in data flux and presents a major bioinformatic challenge to traditional desktop applications. Of course, cloud computing strategies can be leveraged to scale algorithms/processing software but the .raw file format limits the ubiquitous application of distributed computing techniques to mass spectrometry specific data. That is, .raw files are the rate-limiting factor of applying massively parallel algorithm techniques such as map-reduce to mass spectrometry data. In other words, the .rawfile is not needed for almost all batch processing algorithms that only require a collection of scans (atomic signal units).
机译:液相色谱 - 质谱(LCMS)实验中的数据的“原子单元”是质谱。虽然LC装置提供复合分离实用程序,但检测到的感兴趣的信号仍然是质谱。从历史上看,因为原子单元是如此的信息,所以富有的信息通常被存储为一种特定于供应商的原始文件格式,所以封装色谱图的时域的质谱集合以二进制格式存储。后收购处理软件通常访问此数据,进行ETL(提取变换负载)操作并运行各种信号检测/注释算法。将得到的数据从原始信号转化为生物注释(例如分子式,肽序列等)质谱仪的进步导致数据通量显着增加,并对传统桌面应用提出了重大的生物信息挑战。当然,可以利用云计算策略来扩展算法/处理软件,但是.RAW文件格式限制了分布式计算技术对质谱特定数据的无处不在的应用。也就是说,.RAW文件是应用大规模并行算法技术的速率限制因子,例如MAP-Refey到质谱数据。换句话说,几乎所有只需要扫描的集合(原子信号单元)的几乎所有批处理算法都不需要.rawfile。

著录项

  • 来源
    《Research Disclosure》 |2020年第676期|1358-1359|共2页
  • 作者

  • 作者单位
  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号