首页> 外文会议>International Conference on Data Management, Analytics and Innovation >GRNN++: A Parallel and Distributed Version of GRNN Under Apache Spark for Big Data Regression
【24h】

GRNN++: A Parallel and Distributed Version of GRNN Under Apache Spark for Big Data Regression

机译:GRNN ++:在Apache Spark下的Grnn下的并行和分布式版本,用于大数据回归

获取原文

摘要

Among the neural network architectures for prediction, multi-layer perceptron (MLP), radial basis function (RBF), wavelet neural network (WNN), general regression neural network (GRNN), and group method of data handling (GMDH) are popular. Out of these architectures, GRNN is preferable because it involves single-pass learning and produces reasonably good results. Although GRNN involves single-pass learning, it cannot handle big datasets because a pattern layer is required to store all the cluster centers after clustering all the samples. Therefore, this paper proposes a hybrid architecture, GRNN++, which makes GRNN scalable for big data by invoking a parallel distributed version of K-means++, namely, K-means||, in the pattern layer of GRNN. The whole architecture is implemented in the distributed parallel computational architecture of Apache Spark with HDFS. The performance of the GRNN++ was measured on gas sensor dataset which has 613 MB of data under a ten-fold cross-validation setup. The proposed GRNN++ produces very low mean squared error (MSE). It is worthwhile to mention that the primary motivation of this article is to present a distributed and parallel version of the traditional GRNN.
机译:在用于预测的神经网络架构中,多层的Perceptron(MLP),径向基函数(RBF),小波神经网络(WNN),一般回归神经网络(GRNN)以及数据处理(GMDH)的组方法是流行的。出于这些架构中,GRNN是优选的,因为它涉及单通过学习并产生合理的效果。虽然GRNN涉及单通过学习,但它无法处理大数据集,因为在群集所有样本之后需要图案层来存储所有群集中心。因此,本文提出了一种混合架构GRNN ++,它通过调用GRNN的图案层,通过调用K-means ++的并行分布式版本,即K-means ||,使GRNN释放。整个架构是用HDFS的Apache Spark的分布式并行计算架构实现。在气体传感器数据集上测量Grnn ++的性能,在十倍交叉验证设置下具有613 MB的数据。所提出的Grnn ++产生非常低的平均平方误差(MSE)。值得一提的是,本文的主要动机是呈现传统GRNN的分布式和并行版本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号