Real-Time Audio-Visual Analysis for Multiperson Videoconferencing

PetrMotlicek; StefanDuffner; DanilKorchagin; HervéBourlard; CarlScheffler; Jean-MarcOdobez; GiovanniDel Galdo; MarkusKallinger; OliverThiergart

首页> 外文期刊>Advances in multimedia >Real-Time Audio-Visual Analysis for Multiperson Videoconferencing

【24h】

Real-Time Audio-Visual Analysis for Multiperson Videoconferencing

机译：多人视频会议的实时视听分析

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe the design of a system consisting of several state-of-the-art real-time audio and video processing components enabling multimodal stream manipulation (e.g., automatic online editing for multiparty videoconferencing applications) in open, unconstrained environments. The underlying algorithms are designed to allow multiple people to enter, interact, and leave the observable scene with no constraints. They comprise continuous localisation of audio objects and its application for spatial audio object coding, detection, and tracking of faces, estimation of head poses and visual focus of attention, detection and localisation of verbal and paralinguistic events, and the association and fusion of these different events. Combined all together, they represent multimodal streams with audio objects and semantic video objects and provide semantic information for stream manipulation systems (like a virtual director). Various experiments have been performed to evaluate the performance of the system. The obtained results demonstrate the effectiveness of the proposed design, the various algorithms, and the benefit of fusing different modalities in this scenario.

机译：我们描述了一个由几个最新的实时音频和视频处理组件组成的系统的设计，该组件可以在开放，不受限制的环境中进行多模式流操作（例如，用于多方视频会议应用程序的自动在线编辑）。基础算法旨在允许多个人不受限制地进入，交互和离开可观察场景。它们包括音频对象的连续定位及其在空间音频对象的编码，检测和面部跟踪，头部姿势和注意力的视觉焦点的估计，语言和副语言事件的检测和定位以及这些不同的关联和融合方面的应用。事件。它们组合在一起，代表了具有音频对象和语义视频对象的多模式流，并为流操纵系统（如虚拟导演）提供了语义信息。已经进行了各种实验以评估系统的性能。获得的结果证明了所提出设计的有效性，各种算法以及在这种情况下融合不同模式的好处。

著录项

来源
《Advances in multimedia》 |2013年第1期|共21页
作者
PetrMotlicek; StefanDuffner; DanilKorchagin; HervéBourlard; CarlScheffler; Jean-MarcOdobez; GiovanniDel Galdo; MarkusKallinger; OliverThiergart;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Real-Time Audio-Visual Analysis for Multiperson Videoconferencing [J] . Petr Motlicek, Stefan Duffner, Danil Korchagin, Advances in multimedia . 2013,第期

机译：多人视频会议的实时视听分析
2. Satisfaction and Experience with a Supervised Home-Based Real-Time Videoconferencing Telerehabilitation Exercise Program in People with Chronic Obstructive Pulmonary Disease (COPD) [J] . Chloe Moddel, David K McKenzie, Jennifer A Alison, International Journal of Telerehabilitation . 2016,第2期

机译：对患有慢性阻塞性肺疾病（COPD）的人进行有监督的家庭实时视频会议远程康复锻炼计划的满意度和经验
3. Satisfaction and Experience with a Supervised Home-Based Real-Time Videoconferencing Telerehabilitation Exercise Program in People with Chronic Obstructive Pulmonary Disease (COPD) [J] . Ling Ling Y Tsai, Renae J McNamara, Sarah M Dennis, International Journal of Telerehabilitation . 2016,第2期

机译：对患有慢性阻塞性肺疾病（COPD）的家庭指导的实时视频会议远程康复锻炼计划的满意度和经验
4. Real-time multiperson tracking in video surveillance [C] . Niu, W., Jiao, . 2003

机译：视频监控中的实时多人跟踪
5. Audio-Visual Asynchrony Modeling and Analysis for Speech Alignment and Recognition. [D] . Terry, Louis. 2011

机译：语音对齐和识别的视听异步建模和分析。
6. Satisfaction and Experience With a Supervised Home-Based Real-Time Videoconferencing Telerehabilitation Exercise Program in People with Chronic Obstructive Pulmonary Disease (COPD) [O] . LING LING Y. TSAI, RENAE J. MCNAMARA, SARAH M. DENNIS, 2016

机译：对患有慢性阻塞性肺疾病（COPD）的人进行有监督的基于家庭的实时视频会议远程康复锻炼计划的满意度和经验
7. Multilingual Multiperson Multimedia: Linking Audio-Visual with Text Material in Language Documentation [O] . McConvell Patrick 2004

机译：多语言多人多媒体：在语言文档中将视听与文本材料链接

Real-Time Audio-Visual Analysis for Multiperson Videoconferencing

摘要

著录项

相似文献

相关主题

期刊订阅