【24h】

Assessment of robustness and transferability of classification models built for cancer diagnostics using Raman spectroscopy

机译:使用拉曼光谱法评估用于癌症诊断的分类模型的鲁棒性和可移植性

获取原文
获取原文并翻译 | 示例
           

摘要

Over recent years, Raman spectroscopy has been demonstrated as a prospective tool for application in cancer diagnostics. The use of Raman spectroscopy for this purpose relies on pattern recognition methods that have been developed to perform well on data achieved under laboratory conditions. However, the application of Raman spectroscopy as a routine clinical tool is likely to result in imperfect data due to instrument-to-instrument variation. Such corruption to the pure tissue spectral data is expected to negatively impact the classification performance of the diagnostic model. In this paper, we present a thorough assessment of the robustness of the Raman approach. This was achieved by perturbing a set of spectra in different ways, including various linear shifts, nonlinear shifts and random noise and using previously optimised classification models to predict the class membership of each spectrum in a testing set. The loss of predictive power with increased corruption was used to calculate a score, which allows an easy comparison of the model robustness. For this approach, three different types of classification models, including linear discriminant analysis (LDA), partial least square discriminant analysis (PLS-DA) and support vector machine (SVM), built for lymph node diagnostics were the subject of the robustness testing. The results showed that a linear perturbation had the highest impact on the performance of all classification models. Among all linear corruption methods, a gradient y-shift resulted in the highest performance loss. Thus, the factor most likely to affect the predictive outcome of models when using different systems is a gradient y-shift.
机译:近年来,拉曼光谱已被证明是用于癌症诊断的前瞻性工具。为此,拉曼光谱法的使用依赖于模式识别方法,该方法已被开发为在实验室条件下获得的数据上表现良好。但是,由于仪器之间的差异,拉曼光谱作为常规临床工具的应用可能会导致数据不完整。预期对纯组织光谱数据的这种破坏会对诊断模型的分类性能产生负面影响。在本文中,我们对拉曼方法的鲁棒性进行了全面评估。这是通过以各种方式(包括各种线性位移,非线性位移和随机噪声)干扰一组光谱并使用先前优化的分类模型来预测测试集中每个光谱的类成员身份来实现的。预测能力的损失随着腐败程度的增加而被用于计算得分,从而可以轻松比较模型的鲁棒性。对于这种方法,为进行淋巴结诊断而构建的三种不同类型的分类模型,包括线性判别分析(LDA),偏最小二乘判别分析(PLS-DA)和支持向量机(SVM),是鲁棒性测试的主题。结果表明,线性扰动对所有分类模型的性能影响最大。在所有线性破坏方法中,梯度y移位导致最高的性能损失。因此,使用不同系统时最有可能影响模型预测结果的因素是梯度y位移。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号