首页> 外文期刊>Computers, Materials & Continua >SMK-means: An Improved Mini Batch K-means Algorithm Based on Mapreduce with Big Data
【24h】

SMK-means: An Improved Mini Batch K-means Algorithm Based on Mapreduce with Big Data

机译:SMK-means:基于大数据Mapreduce的改进的迷你批处理K-means算法

获取原文
获取原文并翻译 | 示例
           

摘要

In recent years, the rapid development of big data technology has also been favored by more and more scholars. Massive data storage and calculation problems have also been solved. At the same time, outlier detection problems in mass data have also come along with it. Therefore, more research work has been devoted to the problem of outlier detection in big data. However, the existing available methods have high computation time, the improved algorithm of outlier detection is presented, which has higher performance to detect outlier. In this paper, an improved algorithm is proposed. The SMK-means is a fusion algorithm which is achieved by Mini Batch K-means based on simulated annealing algorithm for anomalous detection of massive household electricity data, which can give the number of clusters and reduce the number of iterations and improve the accuracy of clustering. In this paper, several experiments are performed to compare and analyze multiple performances of the algorithm. Through analysis, we know that the proposed algorithm is superior to the existing algorithms.
机译:近年来,大数据技术的飞速发展也受到越来越多学者的青睐。海量数据存储和计算问题也已解决。同时,海量数据中的异常检测问题也随之而来。因此,更多的研究工作致力于大数据的异常检测问题。然而,现有的方法具有较高的计算时间,提出了一种改进的离群值检测算法,具有较高的离群值检测性能。本文提出了一种改进的算法。 SMK-means是一种融合算法,它是基于模拟退火算法的Mini Batch K-means实现的,用于异常检测大量家庭用电数据,它可以提供聚类数量,减少迭代次数,并提高聚类的准确性。在本文中,进行了一些实验以比较和分析该算法的多种性能。通过分析,我们知道该算法优于现有算法。

著录项

  • 来源
    《Computers, Materials & Continua》 |2018年第3期|365-379|共15页
  • 作者单位

    Nanjing Univ Informat Sci & Technol, Jiangsu Key Lab Atmospher Environm Monitoring & P, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Sch Environm Sci & Engn, 219 Ningliu Rd, Nanjing 210044, Jiangsu, Peoples R China;

    Nanjing Univ Informat Sci & Technol, Sch Comp & Software, 219 Ningliu Rd, Nanjing 210044, Jiangsu, Peoples R China;

    Edinburgh Napier Univ, Sch Comp, 10 Colinton Rd, Edinburgh EH10 5DT, Midlothian, Scotland;

    Edinburgh Napier Univ, Sch Comp, 10 Colinton Rd, Edinburgh EH10 5DT, Midlothian, Scotland;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Big data; outlier detection; SMK-means; Mini Batch K-means; simulated annealing;

    机译:大数据;异常检测;SMK均值;小批量K均值;模拟退火;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号