首页> 外文期刊>Pattern recognition letters >Deep generative video prediction
【24h】

Deep generative video prediction

机译:深度生成视频预测

获取原文
获取原文并翻译 | 示例
       

摘要

Video prediction plays a fundamental role in video analysis and pattern recognition. However, the generated future frames are often blurred, which are not sufficient for further research. To overcome this obstacle, this paper proposes a new deep generative video prediction network under the framework of generative adversarial nets. The network consists of three components: a motion encoder, a frame generator and a frame discriminator. The motion encoder receives multiple frame differences (also known as Eulenan motion) as input and outputs a global video motion representation. The frame generator is a pseudo-reverse two-stream network to generate the future frame. The frame discriminator is a discriminative 3D convolution network to determine whether the given frame is derived from the true future frame distribution or not. The frame generator and frame discriminator train jointly in an adversarial manner until a Nash equilibrium. Motivated by theories on color filter array, this paper also designs a novel cross channel color gradient (3CG) loss as a guidance of deblurring. Experiments on two state-of-the-art data sets demonstrate that the proposed network is promising. (C) 2018 Elsevier B.V. All rights reserved.
机译:视频预测在视频分析和模式识别中起着基本作用。但是,生成的未来框架通常是模糊的,不足以进一步研究。为了克服这一障碍,本文在生成对抗网络的框架下提出了一种新的深度生成视频预测网络。该网络由三部分组成:运动编码器,帧发生器和帧鉴别器。运动编码器接收多个帧差异(也称为Eulenan运动)作为输入,并输出全局视频运动表示。帧生成器是伪反向两流网络,用于生成将来的帧。帧鉴别器是一个判别性3D卷积网络,用于确定给定帧是否来自真实的未来帧分布。帧生成器和帧鉴别器以对抗的方式联合训练,直到达到纳什均衡。受滤色器阵列理论的启发,本文还设计了一种新颖的跨通道色梯度(3CG)损耗作为去模糊的指导。在两个最先进的数据集上进行的实验表明,提出的网络很有希望。 (C)2018 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Pattern recognition letters》 |2018年第15期|58-65|共8页
  • 作者单位

    Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China;

    Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China;

    Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China;

    Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China;

    Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Video prediction; Two stream; Adversarial training; Convlstm;

    机译:视频预测;两流;专业训练;Convlstm;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号