Bagging-Based Logistic Regression With Spark: A Medical Data Mining Method

机译：基于袋装的Logistic回归与Spark：一种医学数据挖掘方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Medical data in various organizational forms is voluminous and heterogeneous, it is significant to utilize efficient data mining techniques to explore the development rules of diverse diseases. However, many single-node data analysis tools lack enough memory and computing power, therefore, distributed and parallel computing is in great demand. In this paper, we propose a comprehensive medical data mining method consisting of data preprocessing and bagging-based logistic regression with Spark (BLR algorithm) which is improved for better compatibility with Spark, a fast parallel computing framework. Experimental results indicated that although the BLR algorithm took a little more duration than logistic regression (LR), it was 2.12% higher than LR in accuracy and outperformed LR with other common evaluation indexes.

机译：各种组织形式的医疗数据是大量和异质的，利用有效的数据挖掘技术来探索多种疾病的发展规则是很大的。然而，许多单节点数据分析工具缺乏足够的内存和计算能力，因此，分布式和并行计算的需求很大。在本文中，我们提出了一种全面的医疗数据挖掘方法，包括具有火花（BLR算法）的数据预处理和基于袋的逻辑回归，这是为了更好地与火花兼容性，快速并行计算框架。实验结果表明，尽管BLR算法的持续时间比Logistic回归（LR）需要多于LR的持续时间，但对于其他常见评估指标，比LR高2.12％。

著录项

来源
《International Conference on Advances in Mechanical Engineering and Industrial Informatics》|2016年|831-1658p|共7页
会议地点
作者
Jian Pan; Yiang Hua; Xingtian Liu; Zhiqiang Chen; Zhaofeng Yan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TH-53;
关键词
Medical Data Mining; Bagging; Logistic Regression; Spark;

机译：医疗数据挖掘;袋装;物流回归;火花;

相似文献

外文文献
中文文献
专利

1. Data mining methods in the prediction of Dementia: A real-data comparison of the accuracy, sensitivity and specificity of linear discriminant analysis, logistic regression, neural networks, support vector machines, classification trees and random forests [J] . Jo?o Maroco, Dina Silva, Ana Rodrigues, BMC research notes . 2011,第1期

机译：痴呆症预测中的数据挖掘方法：线性判别分析，逻辑回归，神经网络，支持向量机，分类树和随机森林的准确性，敏感性和特异性的真实数据比较
2. Investigating factors affecting the interval between a burn and the start of treatment using data mining methods and logistic regression [J] . Touraj Ahmadi-Jouybari, Somayeh Najafi-Ghobadi, Reza Karami-Matin, BMC Medical Research Methodology . 2021,第1期

机译：使用数据挖掘方法和逻辑回归影响影响燃烧与治疗开始之间的间隔的因素
3. Comparing data mining methods with logistic regression in childhood obesity prediction [J] . Shaoyan Zhang, Christos Tjortjis, Xiaojun Zeng, Information systems frontiers . 2009,第4期

机译：儿童肥胖预测中数据挖掘方法与逻辑回归的比较
4. Bagging-Based Logistic Regression With Spark: A Medical Data Mining Method [C] . Jian Pan, Yiang Hua, Xingtian Liu, International Conference on Advances in Mechanical Engineering and Industrial Informatics . 2016

机译：基于袋装的Logistic回归与Spark：一种医学数据挖掘方法
5. Performance Enhancement of Logistic Regression for Big Data on Spark [D] . Wang, Mengyao. 2018

机译：Spark上大数据逻辑回归的性能增强
6. Data mining methods in the prediction of Dementia: A real-data comparison of the accuracy sensitivity and specificity of linear discriminant analysis logistic regression neural networks support vector machines classification trees and random forests [O] . João Maroco, Dina Silva, Ana Rodrigues, 2011

机译：痴呆症预测中的数据挖掘方法：线性判别分析逻辑回归神经网络支持向量机分类树和随机森林的准确性敏感性和特异性的真实数据比较
7. Bagging-Based Logistic Regression With Spark: A Medical Data Mining Method [O] . Jian Pan, Yiang Hua, Xingtian Liu, 2016

机译：基于袋装的Logistic回归与Spark：一种医学数据挖掘方法

Bagging-Based Logistic Regression With Spark: A Medical Data Mining Method

摘要

著录项

相似文献

相关主题

期刊订阅