...
首页> 外文期刊>Big Data Research >Efficient Machine Learning for Big Data: A Review
【24h】

Efficient Machine Learning for Big Data: A Review

机译:大数据的高效机器学习:回顾

获取原文
获取原文并翻译 | 示例
           

摘要

With the emerging technologies and all associated devices, it is predicted that massive amount of data will be created in the next few years – in fact, as much as 90% of current data were created in the last couple of years – a trend that will continue for the foreseeable future. Sustainable computing studies the process by which computer engineer/scientist designs computers and associated subsystems efficiently and effectively with minimal impact on the environment. However, current intelligent machine-learning systems are performance driven – the focus is on the predictive/classification accuracy, based on known properties learned from the training samples. For instance, most machine-learning-based nonparametric models are known to require high computational cost in order to find the global optima. With the learning task in a large dataset, the number of hidden nodes within the network will therefore increase significantly, which eventually leads to an exponential rise in computational complexity. This paper thus reviews the theoretical and experimental data-modelingliterature, in large-scale data-intensive fields, relating to: (1) model efficiency, including computational requirements in learning, and data-intensive areas’ structure and design, and introduces (2) new algorithmic approaches with the least memory requirements and processing to minimize computational cost, while maintaining/improving its predictive/classification accuracy and stability.
机译:预计随着新兴技术和所有相关设备的出现,未来几年将创建大量数据-实际上,最近两年中多达90%的当前数据已创建-这种趋势将在可预见的未来继续前进。可持续计算研究计算机工程师/科学家在不影响环境的情况下高效,有效地设计计算机和相关子系统的过程。但是,当前的智能机器学习系统是性能驱动的-基于从训练样本中学到的已知属性,重点是预测/分类准确性。例如,已知大多数基于机器学习的非参数模型都需要很高的计算成本才能找到全局最优值。在大型数据集中进行学习任务后,网络中隐藏节点的数量将因此显着增加,最终导致计算复杂度呈指数级增长。因此,本文回顾了大规模数据密集型领域的理论和实验数据建模文学,涉及以下方面:(1)模型效率,包括学习中的计算要求以及数据密集型领域的结构和设计,并介绍(2 )具有最少内存需求和处理的新算法,以最大程度地减少计算成本,同时保持/提高其预测/分类的准确性和稳定性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号