首页> 外文会议>2011 IEEE International Conference on Multimedia and Expo >Just-in-time multimodal association and fusion from home entertainment
【24h】

Just-in-time multimodal association and fusion from home entertainment

机译:即时多模式关联和家庭娱乐融合

获取原文

摘要

In this paper, we describe a real-time multimodal analysis system with just-in-time multimodal association and fusion for a living room environment, where multiple people may enter, interact and leave the observable world with no constraints. It comprises detection and tracking of up to 4 faces, detection and localisation of verbal and paralinguistic events, their association and fusion. The system is designed to be used in open, unconstrained environments like in next generation video conferencing systems that automatically “orchestrate” the transmitted video streams to improve the overall experience of interaction between spatially separated families and friends. Performance levels achieved to date on hand-labelled dataset have shown sufficient reliability at the same time as fulfilling real-time processing requirements.
机译:在本文中,我们描述了一种实时多模式分析系统,该系统具有实时多模式关联和融合功能,适用于客厅环境,其中多个人可以不受限制地进入,交互和离开可观察的世界。它包括检测和跟踪多达4个面部,检测和定位语言和副语言事件,它们的关联和融合。该系统旨在在开放,不受限制的环境中使用,例如在下一代视频会议系统中,该系统会自动“编排”所传输的视频流,以改善空间上分离的家人和朋友之间互动的整体体验。迄今为止,在手动标记的数据集上达到的性能水平已显示出足够的可靠性,同时满足了实时处理要求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号