Using Audio, Visual, and Lexical Features in a Multi-modal Virtual Meeting Director

机译：在多模式虚拟会议主管中使用音频，视觉和词汇功能

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-modal recordings of meetings provide the basis for meeting browsing and for remote meetings. However it is often not useful to store or transmit all visual channels. In this work we show how a virtual meeting director selects one of seven possible video modes. We then present several audio, visual, and lexical features for a virtual director. In an experimental section we evaluate the features, their influence on the camera selection, and the properties of the generated video stream. The chosen features all allow a real- or near real-time processing and can therefore not only be applied to offline browsing, but also for a remote meeting assistant.

机译：会议的多模式记录为会议浏览和远程会议提供了基础。但是，存储或传输所有可视通道通常没有用。在这项工作中，我们展示了虚拟会议主管如何选择七个可能的视频模式之一。然后，我们介绍虚拟导演的几种音频，视觉和词汇功能。在实验部分，我们评估功能，它们对摄像机选择的影响以及生成的视频流的属性。所选功能均允许实时或近实时处理，因此不仅可以应用于脱机浏览，还可以用于远程会议助手。

著录项

来源
《Machine learning for multimodal interaction》|2006年|63-74|共12页
会议地点 Bethesda MD(US);Bethesda MD(US)
作者
Marc Al-Hames; Benedikt Hornier; Christoph Scheuermann; Gerhard Rigoll;
展开▼
作者单位

Institute for Human-Machine-Communication, Technische Universitaet Muenchen Arcisstr. 21, 80290 Munich, Germany;

Institute for Human-Machine-Communication, Technische Universitaet Muenchen Arcisstr. 21, 80290 Munich, Germany;

Institute for Human-Machine-Communication, Technische Universitaet Muenchen Arcisstr. 21, 80290 Munich, Germany;

Institute for Human-Machine-Communication, Technische Universitaet Muenchen Arcisstr. 21, 80290 Munich, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Lexical access versus lexical decision processes for auditory, visual, and audiovisual items: Insights from behavioral and neural measures [J] . Lopez Zunini Rocio A., Baart Martijn, Samuel Arthur G., Neuropsychologia . 2020,第期

机译：词汇权限与听觉，视觉和视听项目的词汇决策过程：行为和神经措施的见解
2. Omnidirectional Audio-Visual Talker Localization Based on Dynamic Fusion of Audio-Visual Features Using Validity and Reliability Criteria [J] . Yuki DENDA, Takanobu NISHIURA, Yoichi YAMASHITA IEICE Transactions on Information and Systems . 2008,第3期

机译：基于有效性和可靠性准则的视听特征动态融合的全向视听讲话者定位
3. A Temporal Dependency Based Multi-modal Active Learning Approach for Audiovisual Event Detection [J] . Patrick Thiam, Sascha Meudt, Günther Palm, Neural processing letters . 2018,第2期

机译：基于时间依赖的多模式主动学习方法在视听事件检测中的应用
4. Using Audio, Visual, and Lexical Features in a Multi-modal Virtual Meeting Director [C] . Marc Al-Hames, Benedikt Hornler, Christoph Scheuermann, International workshop on machine learning for multimodal interaction . 2006

机译：在多模态虚拟会议导演中使用音频，视觉和词汇功能
5. THE PRESENT AND FUTURE FUNCTIONS OF THE PUBLIC SCHOOL DISTRICT MEDIA DIRECTOR: COMPARATIVE PERCEPTIONS OF PROFESSORS OF MEDIA EDUCATION, STATE EDUCATION MEDIA ADMINISTRATORS, INDIANA DISTRICT MEDIA DIRECTORS AND THEIR IMMEDIATE ADMINISTRATIVE SUPERVISORS (AUDIO-VISUAL, LIBRARIES) [D] . HELD, FREDERICK WILLIAM 1986

机译：公立学校区级媒体总监的当前和未来功能：媒体教育专业人士，州教育媒体管理员，印第安纳州地区媒体总监及其即时行政主管（视听，图书馆）的比较看法
6. Auditory and Visual Lexical Neighborhoods in Audiovisual Speech Perception [O] . Nancy Tye-Murray, Mitchell Sommers, Brent Spehar 2007

机译：视听语音感知中的听觉和视觉词汇邻域
7. Using audio, visual, and lexical features in a multi-modal virtual meeting director [O] . Marc Al-hames, Benedikt Hörnler, Christoph Scheuermann, 2013

机译：在多模式虚拟会议总监中使用音频，视觉和词汇功能

Using Audio, Visual, and Lexical Features in a Multi-modal Virtual Meeting Director

摘要

著录项

相似文献

相关主题

期刊订阅