...
首页> 外文期刊>Future Internet >Multimodal Deep Learning for Group Activity Recognition in Smart Office Environments
【24h】

Multimodal Deep Learning for Group Activity Recognition in Smart Office Environments

机译:智能办公环境中群体活动识别的多模式深度学习

获取原文
           

摘要

Deep learning (DL) models have emerged in recent years as the state-of-the-art technique across numerous machine learning application domains. In particular, image processing-related tasks have seen a significant improvement in terms of performance due to increased availability of large datasets and extensive growth of computing power. In this paper we investigate the problem of group activity recognition in office environments using a multimodal deep learning approach, by fusing audio and visual data from video. Group activity recognition is a complex classification task, given that it extends beyond identifying the activities of individuals, by focusing on the combinations of activities and the interactions between them. The proposed fusion network was trained based on the audio–visual stream from the AMI Corpus dataset. The procedure consists of two steps. First, we extract a joint audio–visual feature representation for activity recognition, and second, we account for the temporal dependencies in the video in order to complete the classification task. We provide a comprehensive set of experimental results showing that our proposed multimodal deep network architecture outperforms previous approaches, which have been designed for unimodal analysis, on the aforementioned AMI dataset.
机译:近年来,深度学习(DL)模型是众多机器学习应用领域的最先进技术。特别地,由于大量数据集的可用性和计算能力的广泛增长,图像处理相关任务在性能方面已经显着改进。在本文中,我们通过从视频中融合音频和视觉数据,调查办公室环境中的群体活动识别问题。小组活动识别是一个复杂的分类任务,鉴于它超出了识别个人的活动,专注于活动的组合和它们之间的相互作用。基于来自AMI语料库数据集的视听流培训所提出的融合网络。该过程由两个步骤组成。首先,我们提取用于活动识别的联合视听功能表示,而第二个,我们考虑了视频中的时间依赖性,以完成分类任务。我们提供了一整套实验结果,表明我们所提出的多模式深网络架构优于前面的方法,这些方法在上述AMI数据集上被设计用于单峰分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号