...
首页> 外文期刊>Complexity >An Approach to Data Reduction for Learning from Big Datasets: Integrating Stacking, Rotation, and Agent Population Learning Techniques
【24h】

An Approach to Data Reduction for Learning from Big Datasets: Integrating Stacking, Rotation, and Agent Population Learning Techniques

机译:从大数据集学习的数据约简方法:集成堆叠,轮换和座席人口学习技术

获取原文
           

摘要

In the paper, several data reduction techniques for machine learning from big datasets are discussed and evaluated. The discussed approach focuses on combining several techniques including stacking, rotation, and data reduction aimed at improving the performance of the machine classification. Stacking is seen as the technique allowing to take advantage of the multiple classification models. The rotation-based techniques are used to increase the heterogeneity of the stacking ensembles. Data reduction makes it possible to classify instances belonging to big datasets. We propose to use an agent-based population learning algorithm for data reduction in the feature and instance dimensions. For diversification of the classifier ensembles within the rotation also, alternatively, principal component analysis and independent component analysis are used. The research question addressed in the paper is formulated as follows does the performance of a classifier using the reduced dataset be improved by integrating the data reduction mechanism with the rotation-based technique and the stacking?
机译:在本文中,讨论和评估了几种用于从大型数据集进行机器学习的数据约简技术。讨论的方法着重于结合多种技术,包括堆叠,旋转和数据精简,旨在提高机器分类的性能。堆叠被视为允许利用多个分类模型的技术。基于旋转的技术用于增加堆叠集成的异质性。数据减少使对大型数据集的实例进行分类成为可能。我们建议使用基于代理的种群学习算法来减少特征和实例维度中的数据。为了使旋转中的分类器集合多样化,也可以使用主成分分析和独立成分分析。本文解决的研究问题的公式如下:通过将数据约简机制与基于旋转的技术和堆叠相结合,是否可以使用简化后的数据集提高分类器的性能?

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号