An Improved Naive Bayesian Classification Algorithm for Massive Data

机译：一种改进的大型数据贝叶斯分类算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For the low speed and accuracy in massive data classification, an improved Naive Bayesian classification algorithm for mass data processing is proposed. Firstly, feature rough clustering is carried out to cluster the features to reduce the computational complexity of feature association. Secondly, the association rules algorithm is used to mine frequent item sets of rough clustering subsets, and the generated frequent item sets are used to filter the features based on the result of classification. And then, the feature set after feature selection is weighted to improve the accuracy. Finally, the improved algorithm is implemented on the MapReduce parallelization platform and tested with five data sets of different sizes. The experimental results show that the improved algorithm in this paper could save a lot of running time when dealing with large-scale data sets, and maintain high accuracy.

机译：对于大规模数据分类的低速和准确性，提出了一种改进的朴素贝叶斯分类算法，用于质量数据处理。首先，进行特征粗群以进行聚类，以降低特征关联的计算复杂性。其次，关联规则算法用于常用的粗群集群集合频繁的粗簇子集，并且生成的频繁项目集用于基于分类结果来过滤这些功能。然后，重量特征选择后的功能设置以提高精度。最后，改进的算法在MapreduceParleastization平台上实现，并用五种不同大小的数据集进行了测试。实验结果表明，在处理大规模数据集时，本文的改进算法可以节省大量运行时间，并保持高精度。

著录项

来源
《IEEE International Conference on Control Science and Systems Engineering》|2018年|563p|共6页
会议地点
作者
Sun Tongjing; Li Ji; Ning Ke;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP2-53;
关键词
Correlation; Classification algorithms; Bayes methods; Data mining; Clustering algorithms; Feature extraction; Correlation coefficient;

机译：相关性;分类算法;贝叶斯方法;数据挖掘;聚类算法;特征提取;相关系数;

相似文献

外文文献
中文文献
专利

1. Application of improved distributed naive Bayesian algorithms in text classification [J] . Gao Hongyi, Zeng Xi, Yao Chunhua Journal of supercomputing . 2019,第9期

机译：改进的分布式朴素贝叶斯算法在文本分类中的应用
2. Improving classification performance using unlabeled data: Naive Bayesian case [J] . Chang-Hwan Lee Knowledge-Based Systems . 2007,第3期

机译：使用未标记的数据提高分类性能：朴素贝叶斯案例
3. Differential Classification Method in Different Teaching Models of Accounting Courses Based on Naive Bayesian Classification Algorithm [J] . Xiuying Ou International Journal of Emerging Technologies in Learning (iJET) . 2019,第8期

机译：基于朴素贝叶斯分类算法的会计课程不同教学模式下的差异分类方法
4. An Improved Naive Bayesian Classification Algorithm for Massive Data [C] . Sun Tongjing, Li Ji, Ning Ke International Conference on Control Science and Systems Engineering . 2018

机译：改进的朴素贝叶斯海量分类算法
5. Identification of secondary and tertiary motifs in DNA sequences through naive Bayesian text classification. [D] . Villalobos, Rodney V. 2007

机译：通过朴素的贝叶斯文本分类识别DNA序列中的二级和三级基序。
6. Development of a clinical decision support system using genetic algorithms and Bayesian classification for improving the personalised management of women attending a colposcopy room [O] . Panagiotis Bountris, Elena Topaka, Abraham Pouliakis, 2016

机译：使用遗传算法和贝叶斯分类法开发临床决策支持系统以改善参加阴道镜检查室的女性的个性化管理
7. Improving the Classification Accuracy Using Unlabeled Data: A Naive Bayesian Case [O] . Chang-Hwan Lee 2006

机译：使用未标记数据提高分类准确性：天真的贝叶斯案
8. Privacy-Preserving Naive Bayesian Classification [R] . Zhan, Z. , Chang, L. , Matwin, S. 2004

机译：隐私保护朴素贝叶斯分类

An Improved Naive Bayesian Classification Algorithm for Massive Data

摘要

著录项

相似文献

相关主题

期刊订阅