首页> 外文期刊>Engineering Applications of Artificial Intelligence >Time series classification for the prediction of dialysis in critically ill patients using echo state networks
【24h】

Time series classification for the prediction of dialysis in critically ill patients using echo state networks

机译:使用回波状态网络的时间序列分类来预测重症患者的透析

获取原文
获取原文并翻译 | 示例
           

摘要

Objective: Time series often appear in medical databases, but only few machine learning methods exist that process this kind of data properly. Most modeling techniques have been designed with a static data model in mind and are not suitable for coping with the dynamic nature of time series. Recurrent neural networks (RNNs) are often used to process time series, but only a few training algorithms exist for RNNs which are complex and often yield poor results. Therefore, researchers often turn to traditional machine learning approaches, such as support vector machines (SVMs), which can easily be set up and trained and combine them with feature extraction (FE) and selection (FS) to process the high-dimensional temporal data. Recently, a new approach, called echo state networks (ESNs), has been developed to simplify the training process of RNNs. This approach allows modeling the dynamics of a system based on time series data in a straightforward way. The objective of this study is to explore the advantages of using ESN instead of other traditional classifiers combined with FE and FS in classification problems in the intensive care unit (ICU) when the input data consists of time series. While ESNs have mostly been used to predict the future course of a time series, we use the ESN model for classification instead. Although time series often appear in medical data, little medical applications of ESNs have been studied yet. Methods and material: ESN is used to predict the need for dialysis between the fifth and tenth day after admission in the ICU. The input time series consist of measured diuresis and creatinine values during the first 3 days after admission. Data about 830 patients was used for the study, of which 82 needed dialysis between the fifth and tenth day after admission. ESN is compared to traditional classifiers, a sophisticated and a simple one, namely support vector machines and the naive Bayes (NB) classifier. Prior to the use of the SVM and NB classifier, FE and FS is required to reduce the number of input features and thus alleviate the curse dimensionality. Extensive feature extraction was applied to capture both the overall properties of the time series and the correlation between the different measurements in the time series. The feature selection method consists of a greedy hybrid filter-wrapper method using a NB classifier, which selects in each iteration the feature that improves prediction the best and shows little multicollinearity with the already selected set Least squares regression with noise was used to train the linear readout function of the ESN to mitigate sensitivity to noise and overfitting. Fisher labeling was used to deal with the unbalanced data set. Parameter sweeps were performed to determine the optimal parameter values for the different classifiers. The area under the curve (AUC) and maximum balanced accuracy are used as performance measures. The required execution time was also measured. Results: The classification performance of the ESN shows significant difference at the 5% level compared to the performance of the SVM or the NB classifier combined with FE and FS. The NB+FE+FS, with an average AUC of 0.874, has the best classification performance. This classifier is followed by the ESN, which has an average AUC of 0.849. The SVM+FE+FS has the worst performance with an average AUC of 0.838. The computation time needed to pre-process the data and to train and test the classifier is significantly less for the ESN compared to the SVM and NB. Conclusion: It can be concluded that the use of ESN has an added value in predicting the need for dialysis through the analysis of time series data. The ESN requires significantly less processing time, needs no domain knowledge, is easy to implement, and can be configured using rules of thumb.
机译:目的:时间序列经常出现在医学数据库中,但是只有很少的机器学习方法可以正确处理此类数据。大多数建模技术在设计时都考虑了静态数据模型,因此不适合应对时间序列的动态性质。经常使用递归神经网络(RNN)来处理时间序列,但是针对RNN的训练算法很少,这些算法很复杂并且通常会产生较差的结果。因此,研究人员经常转向传统的机器学习方法,例如支持向量机(SVM),可以轻松地对其进行设置和培训,并将其与特征提取(FE)和选择(FS)结合起来以处理高维时间数据。最近,开发了一种称为回波状态网络(ESN)的新方法来简化RNN的训练过程。这种方法允许以简单的方式基于时间序列数据对系统的动力学建模。这项研究的目的是探索在输入数据包含时间序列的情况下,在重症监护病房(ICU)的分类问题中使用ESN代替其他传统分类器与FE和FS相结合的优势。尽管ESN主要用于预测时间序列的未来发展,但我们使用ESN模型进行分类。尽管时间序列经常出现在医学数据中,但对ESN的医学应用却很少研究。方法和材料:ESN用于预测入ICU后第五天到第十天之间的透析需求。输入的时间序列包括入院后前三天内测得的利尿和肌酐值。该研究使用了约830名患者的数据,其中82例在入院后第五至第十天需要透析。将ESN与传统分类器(一种复杂而简单的分类器)进行比较,即支持向量机和朴素贝叶斯(NB)分类器。在使用SVM和NB分类器之前,需要使用FE和FS减少输入特征的数量,从而减轻诅咒维数。应用了广泛的特征提取来捕获时间序列的整体属性以及时间序列中不同测量值之间的相关性。特征选择方法由使用NB分类器的贪婪混合滤波器包装器方法组成,该方法在每次迭代中选择可改善预测效果的特征,并与已选择的集合显示几乎没有多重共线性,使用带有噪声的最小二乘回归来训练线性ESN的读取功能可减轻对噪声和过度拟合的敏感性。 Fisher标记用于处理不平衡的数据集。进行参数扫描以确定不同分类器的最佳参数值。曲线下的面积(AUC)和最大平衡精度用作性能指标。还测量了所需的执行时间。结果:与SVM或NB分类器结合FE和FS的性能相比,ESN的分类性能在5%的水平上显示出显着差异。 NB + FE + FS,平均AUC为0.874,具有最佳的分类性能。该分类器后面是ESN,其平均AUC为0.849。 SVM + FE + FS的性能最差,平均AUC为0.838。与SVM和NB相比,ESN预处理数据以及训练和测试分类器所需的计算时间明显更少。结论:可以得出结论,ESN的使用在通过时间序列数据分析来预测透析需求方面具有附加价值。 ESN大大减少了处理时间,不需要领域知识,易于实现,并且可以使用经验法则进行配置。

著录项

  • 来源
  • 作者单位

    Department of Information Technology (INTEC), Ghent University - Interdisciplinary Institute for Broadband Technology (IBBT), Caston Crommenlaan 8 bus 201,B-9050 Ghent, Belgium;

    Department of Environmental Modelling, Flemish Institute for Technological Research (VITO), Boeretang 200, 2400 Mot, Belgium;

    Department of Electronics and Information Systems (ELIS), Ghent University, Sint-Pietersnieuwstraat 41, 9000 Ghent, Belgium;

    Department of Intensive Care Medicine, Ghent University Hospital, De Pintelaan 185 - 2 KI2 IC, B-9000 Ghent, Belgium;

    Department of Intensive Care Medicine, Ghent University Hospital, De Pintelaan 185 - 2 KI2 IC, B-9000 Ghent, Belgium;

    Department of Information Technology (INTEC), Ghent University - Interdisciplinary Institute for Broadband Technology (IBBT), Caston Crommenlaan 8 bus 201,B-9050 Ghent, Belgium;

    Department of Information Technology (INTEC), Ghent University - Interdisciplinary Institute for Broadband Technology (IBBT), Caston Crommenlaan 8 bus 201,B-9050 Ghent, Belgium;

    Department of Electronics and Information Systems (ELIS), Ghent University, Sint-Pietersnieuwstraat 41, 9000 Ghent, Belgium;

    Department of Intensive Care Medicine, Ghent University Hospital, De Pintelaan 185 - 2 KI2 IC, B-9000 Ghent, Belgium;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Time series; Classification; Echo state network; Dialysis; Feature extraction and selection;

    机译:时间序列;分类;回声状态网络;透析;特征提取和选择;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号