首页> 外文期刊>Talanta: The International Journal of Pure and Applied Analytical Chemistry >Machine learning combined with non-targeted LC-HRMS analysis for a risk warning system of chemical hazards in drinking water: A proof of concept
【24h】

Machine learning combined with non-targeted LC-HRMS analysis for a risk warning system of chemical hazards in drinking water: A proof of concept

机译:机器学习结合非针对性LC-HRMS分析饮用水中的化学危害风险警示系统:概念证明

获取原文
获取原文并翻译 | 示例
           

摘要

Guaranteeing clean drinking water to the global population is becoming more challenging, because of the cases of water scarcity across the globe, growing population, and increased chemical footprint of this population. Existing targeted strategies for hazard monitoring in drinking water are not adequate to handle such diverse and multidimensional stressors. In the current study, we have developed, validated, and tested a machine learning algorithm based on the data produced via non-targeted liquid chromatography coupled with high resolution mass spectrometry (LC-HRMS) for the identification of potential chemical hazards in drinking water. The machine learning algorithm consisted of a composite statistical model including an unsupervised component (i.e. principal component analysis PCA) and a supervised one (i.e. partial least square discrimination analysis PLS-DA). This model was trained using a training set of 20 drinking water samples previously tested via conventional suspect screening. The developed model was validated using a validation set of 20 drinking water samples of which 4 were spiked with 15 labeled standards at four different concentration levels. The model successfully detected all of the added analytes in the four spiked samples without producing any cases of false detection. The same validation set was processed via conventional trend analysis in order to cross validate the composite model. The results of cross validation showed that even though the conventional trend analysis approach produced a false positive detection rate of = 5% the composite model outperformed that approach by producing zero cases of false detection. Additionally, the validated model went through an additional test with 42 extra drinking water samples from the same source for an unbiased examination of the model. Finally, the potentials and limitations of this approach were further discussed.
机译:保证清洁饮用水的全球人口正在变得更具挑战性,因为在全球水资源短缺的情况下,人口增长和提高这一人群的化学足迹。对于危害饮用水监测现有针对性的策略不足以处理这种多样化和多层面的压力源。在目前的研究中,我们已经开发,验证和测试基于经由加上高分辨率质谱(LC-HRMS),用于饮用水的识别潜在的化学危害的非靶向液相色谱法所产生的数据的机器学习算法。机器学习算法包括一个复合统计模型包括无监督分量(即主成分分析)和监督一个(即偏最小二乘判别分析PLS-DA)。这个模型是使用以前通过传统的犯罪嫌疑人的筛选测试20的饮用水样品的训练集训练。使用验证集的20个饮用水样品,其中4例在四个不同的浓度水平掺入15个标记的标准开发的模型进行了验证。该模型成功地检测到所有的四个加标样品中加入分析物,而不产生错误检测的任何情况下。相同的验证组是为了越过验证复合模型经由常规趋势分析处理。交叉验证的结果表明,即使传统的趋势分析方法产生℃的假阳性检测率; = 5%的复合模型优于这种方法通过产生零例的错误检测的。此外,验证模型经历了从同一来源42个额外的饮用水样品的额外测试模型的公正审查。最后,进一步讨论了这种方法的潜力和局限性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号