MSE-Optimal Neural Network Initialization via Layer Fusion

机译：通过层融合的MSE最优神经网络初始化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural networks achieve state-of-the-art performance for a range of classification and inference tasks. However, the use of stochastic gradient descent combined with the nonconvexity of the underlying optimization problems renders parameter learning susceptible to initialization. To address this issue, a variety of methods that rely on random parameter initialization or knowledge distillation have been proposed in the past. In this paper, we propose FuseInit, a novel method to initialize shallower networks by fusing neighboring layers of deeper networks that are trained with random initialization. We develop theoretical results and efficient algorithms for mean-square error (MSE)- optimal fusion of neighboring dense-dense, convolutional-dense, and convolutional-convolutional layers. We show experiments for a range of classification and regression datasets, which suggest that deeper neural networks are less sensitive to initialization and shallower networks can perform better (sometimes as well as their deeper counterparts) if initialized with FuseInit.

机译：深度神经网络可实现一系列分类和推理任务的最新性能。但是，随机梯度下降的使用与基础优化问题的不凸性相结合，使参数学习容易初始化。为了解决这个问题，过去已经提出了多种依赖于随机参数初始化或知识提炼的方法。在本文中，我们提出了FuseInit，这是一种通过融合经过随机初始化训练的较深网络的相邻层来初始化较浅网络的新方法。我们开发了均方误差（MSE）的理论结果和有效算法-相邻的密集密集层，卷积密集层和卷积卷积层的最佳融合。我们展示了一系列分类和回归数据集的实验，这些实验表明，如果使用FuseInit进行初始化，则较深的神经网络对初始化的敏感性较低，而较浅的网络则可以表现更好（有时以及较深的神经网络）。

著录项

来源
《Annual Conference on Information Sciences and Systems》|2020年|1-6|共6页
会议地点
作者
Ramina Ghods; Andrew S. Lan; Tom Goldstein; Christoph Studer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
gradient methods; learning (artificial intelligence); mean square error methods; neural nets; optimisation; pattern classification;

机译：梯度法;学习（人工智能）;均方误差法;神经网络;优化;模式分类;

相似文献

外文文献
中文文献
专利

1. Multimodal facial biometrics recognition: Dual-stream convolutional neural networks with multi-feature fusion layers [J] . Tiong Leslie Ching Ow, Kim Seong Tae, Ro Yong Man Image and Vision Computing . 2020,第Octa期

机译：多模式面部生物识别：双流卷积神经网络，具有多种融合层
2. Improving deep neural networks with multi-layer maxout networks and a novel initialization method [J] . Weichen Sun, Fei Su, Leiquan Wang Neurocomputing . 2018,第FEBa22期

机译：使用多层maxout网络和新的初始化方法来改善深度神经网络
3. Neural self-compressor: Collective interpretation by compressing multi-layered neural networks into non-layered networks [J] . Kamimura Ryotaro Neurocomputing . 2019,第JANa5期

机译：神经自压缩器：通过将多层神经网络压缩为非分层网络来进行集体解释
4. MSE-Optimal Neural Network Initialization via Layer Fusion [C] . Ramina Ghods, Andrew S. Lan, Tom Goldstein, Annual Conference on Information Sciences and Systems . 2020

机译：MSE-最佳神经网络初始化通过层融合
5. Fusion ARTMAP: Neural networks for multi-sensor fusion and classification. [D] . Asfour, Yousif Raja. 1995

机译：融合ARTMAP：用于多传感器融合和分类的神经网络。
6. Rapid Airplane Detection in Remote Sensing Images Based on Multilayer Feature Fusion in Fully Convolutional Neural Networks [O] . Yuelei Xu, Mingming Zhu, Peng Xin, 2018

机译：全卷积神经网络中基于多层特征融合的遥感图像快速飞机检测
7. Rapid Airplane Detection in Remote Sensing Images Based on Multilayer Feature Fusion in Fully Convolutional Neural Networks [O] . Yuelei Xu, Mingming Zhu, Peng Xin, 2018

机译：基于多层特征融合在全卷积神经网络中的遥感图像中的快速飞机检测

MSE-Optimal Neural Network Initialization via Layer Fusion

摘要

著录项

相似文献

相关主题

期刊订阅