首页> 外文会议>International Coference of Reliable Information and Communication Technology >Data Mining Techniques for Disease Risk Prediction Model: A Systematic Literature Review
【24h】

Data Mining Techniques for Disease Risk Prediction Model: A Systematic Literature Review

机译:疾病风险预测模型的数据挖掘技术:系统文献综述

获取原文

摘要

Risk prediction model estimates event occurrence based on related data. Conventional statistical metrics that utilized primary data generates simple descriptive analysis that often provide insufficient knowledge for decision making. In contrast, data mining techniques that have the capability to find hidden pattern from the secondary data in large databases and create prediction for de- sired output has become a popular approach to develop any risk prediction model. In healthcare particularly, data mining techniques can be applied in disease risk prediction model to provide reliable prediction on the possibility of acquiring the disease based on individual's clinical and non-clinical data. Due to the increased use of data mining in healthcare, this study aims at identifying the data mining techniques and algorithms that are commonly implemented in studies related to various disease risk prediction model as well as finding the accuracy of the algorithms. The accuracy evaluation consists of various method, but this paper is focusing on overall accuracy which is measured by the total number of correctly predicted output over the total number of prediction. A systematic literature review approach that search across five databases found 170 articles, of which 7 articles were selected in the final process. This review found that most prediction model used classification technique, with a focus on decision tree, neural network, support vector machines, and Naive Bayes algorithms where heart-related disease is commonly studied. Further research can apply similar algorithms to develop risk prediction model for other types of diseases, such as infectious disease prediction.
机译:风险预测模型估计基于相关数据的事件发生。使用主要数据的常规统计指标产生简单的描述性分析,通常提供对决策的不足知识。相反,具有能够从大型数据库中的次要数据找到隐藏模式的数据挖掘技术,并为正在进行的输出创建预测已经成为开发任何风险预测模型的流行方法。在医疗保健中,可以在疾病风险预测模型中应用数据挖掘技术,以提供基于个体临床和非临床数据获取疾病的可能性的可靠预测。由于在医疗保健中使用数据挖掘的增加,本研究旨在识别在与各种疾病风险预测模型相关的研究中常见的数据挖掘技术和算法以及找到算法的准确性。精度评估由各种方法组成,但本文专注于整体精度,该总精度是通过在预测总数上的正确预测输出的总数测量。系统的文献综述方法,在五个数据库中搜索的方法发现了170篇文章,其中在最终过程中选择了7篇文章。该审查发现,大多数预测模型使用的分类技术,专注于决策树,神经网络,支持向量机和通常研究心脏相关疾病的幼稚贝叶斯算法。进一步的研究可以应用类似的算法,以制定其他类型的疾病的风险预测模型,例如传染病预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号