首页> 外文期刊>Journal of electronic imaging >Improved classification approach for use with large-scale scene images in the Hadoop cluster environment
【24h】

Improved classification approach for use with large-scale scene images in the Hadoop cluster environment

机译:改进的分类方法,可用于Hadoop集群环境中的大规模场景图像

获取原文
获取原文并翻译 | 示例
           

摘要

Faced with massive amounts of image data, the performance of classification algorithms based on traditional platforms with single-node architecture drops dramatically. We propose a classification method based on hybrid optimization and combination technology in a cluster environment that is suitable for use with large-scale scene images. Support vector machine (SVM) algorithms are optimized by the artificial bee colony and particle swarm optimization algorithms to produce weak classifiers; then, a strong classifier is constructed by combining the outputs from the 15 weak classifiers using the AdaBoost algorithm. The MapReduce parallel programming model in the Hadoop platform is used to parallelize the algorithm, and a parallel AdaBoost hybrid optimization (PAH)-SVM algorithm is proposed. Finally, a model is constructed for automatic classification of the large-scale scene images. Multiple sets of comparative experiments show that the average classification accuracy of the proposed algorithm when applied to the scene understanding (Caltech-256 and Pascal VOC 2012) database exceeds 85.0%, and its training time is 10 min when 170,000 images are used. Considering the cost of hardware, the execution time and accuracy of this algorithm are superior to those of mainstream classification algorithms, such as P-SVM and CNN. In addition, the speed of the system based on the proposed algorithm increases linearly, and the constructed Hadoop cluster shows good extensibility. The proposed algorithm is suitable for automatic classification and prediction using large-scale scene images. (C) 2018 SPIE and IS&T
机译:面对大量的图像数据,基于具有单节点体系结构的传统平台的分类算法的性能急剧下降。我们提出了一种在群集环境中基于混合优化和组合技术的分类方法,该方法适用于大型场景图像。支持向量机(SVM)算法通过人工蜂群和粒子群优化算法进行优化,以生成弱分类器;然后,通过使用AdaBoost算法将15个弱分类器的输出进行组合来构造一个强分类器。利用Hadoop平台中的MapReduce并行编程模型对该算法进行并行化处理,提出了并行的AdaBoost混合优化(PAH)-SVM算法。最后,构建了一个模型,用于对大型场景图像进行自动分类。多组对比实验表明,该算法应用于场景理解(Caltech-256和Pascal VOC 2012)数据库时,平均分类准确率超过85.0%,使用170,000张图像时,其训练时间小于10分钟。考虑到硬件成本,该算法的执行时间和准确性优于P-SVM和CNN等主流分类算法。此外,基于该算法的系统速度呈线性增长,构建的Hadoop集群具有良好的可扩展性。所提出的算法适用于使用大规模场景图像的自动分类和预测。 (C)2018 SPIE和IS&T

著录项

  • 来源
    《Journal of electronic imaging》 |2018年第6期|063027.1-063027.12|共12页
  • 作者单位

    Xinzhou Teachers Univ, Dept Comp Sci & Technol, Xinzhou, Peoples R China|Taiyuan Univ Sci & Technol, Sch Comp Sci & Technol, Taiyuan, Shanxi, Peoples R China;

    Taiyuan Univ Sci & Technol, Sch Comp Sci & Technol, Taiyuan, Shanxi, Peoples R China;

    Xinzhou Teachers Univ, Dept Comp Sci & Technol, Xinzhou, Peoples R China;

    Taiyuan Univ Sci & Technol, Sch Comp Sci & Technol, Taiyuan, Shanxi, Peoples R China;

    Xinzhou Teachers Univ, Dept Comp Sci & Technol, Xinzhou, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    hybrid optimization; AdaBoost algorithm; cluster environments; MapReduce;

    机译:混合优化;AdaBoost算法;集群环境;MapReduce;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号