首页> 外文会议>International Conference on Machine Learning and Data Engineering >Using Electronic Health Records and Machine Learning to Make Medical-Related Predictions from Non-Medical Data
【24h】

Using Electronic Health Records and Machine Learning to Make Medical-Related Predictions from Non-Medical Data

机译:使用电子病历和机器学习根据非医学数据做出医学相关预测

获取原文

摘要

Objectives: Administrative HIS (Hospital Information System) and EHR (Electronic Health Record) data are characterized by lower privacy sensitivity, thus easier portability and handling, as well as higher information quality. In this paper we test the hypothesis that the application of machine learning techniques on data of this nature can be used to address prediction/forecasting problems in the Health IT domain. The novelty of this approach consists in that medical data (test results, diagnoses, doctors' notes etc.) are not included in the predictors' dataset. Moreover, there is limited need for separation of patient cohorts based on specific health conditions. Methods: We experiment with the prediction of the probability of early readmission at the time of a patient's discharge. We extract real HIS data and perform data processing techniques. We then apply a series of machine learning algorithms (Logistic Regression, Support Vector Machine, Gaussian Naïve Bayes, K-Nearest Neighbors and Deep Multilayer Neural Network) and measure the performance of the emergent models. Results: All applied methods performed well above random guessing, even with minimal hyper-parameter tuning. Conclusions: Given that the experiments provide evidence in favor of the underlying hypothesis, future experimentation on more fine-tuned (thus more robust) models could result in applications suited for productive environments.
机译:目标:行政性HIS(医院信息系统)和EHR(电子健康记录)数据的特征在于较低的隐私敏感性,因此更易于携带和处理,并具有更高的信息质量。在本文中,我们测试了以下假设:机器学习技术在这种性质的数据上的应用可以用于解决Health IT领域中的预测/预测问题。这种方法的新颖之处在于,医学数据(测试结果,诊断,医生记录等)不包含在预测变量的数据集中。此外,基于特定的健康状况来分离患者队列的需求有限。方法:我们对患者出院时提前再次入院的可能性进行了预测。我们提取真实的HIS数据并执行数据处理技术。然后,我们应用一系列机器学习算法(Logistic回归,支持向量机,高斯朴素贝叶斯,K最近邻和深度多层神经网络)并测量新兴模型的性能。结果:所有应用的方法都比随机猜测的效果好,即使最小化超参数调整也是如此。结论:鉴于实验提供了支持基本假设的证据,未来在更精细(因此更可靠)的模型上进行的实验可能会导致适用于生产环境的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号