首页> 外文会议>Machine learning for multimodal interaction >Using Audio, Visual, and Lexical Features in a Multi-modal Virtual Meeting Director
【24h】

Using Audio, Visual, and Lexical Features in a Multi-modal Virtual Meeting Director

机译:在多模式虚拟会议主管中使用音频,视觉和词汇功能

获取原文
获取原文并翻译 | 示例

摘要

Multi-modal recordings of meetings provide the basis for meeting browsing and for remote meetings. However it is often not useful to store or transmit all visual channels. In this work we show how a virtual meeting director selects one of seven possible video modes. We then present several audio, visual, and lexical features for a virtual director. In an experimental section we evaluate the features, their influence on the camera selection, and the properties of the generated video stream. The chosen features all allow a real- or near real-time processing and can therefore not only be applied to offline browsing, but also for a remote meeting assistant.
机译:会议的多模式记录为会议浏览和远程会议提供了基础。但是,存储或传输所有可视通道通常没有用。在这项工作中,我们展示了虚拟会议主管如何选择七个可能的视频模式之一。然后,我们介绍虚拟导演的几种音频,视觉和词汇功能。在实验部分,我们评估功能,它们对摄像机选择的影响以及生成的视频流的属性。所选功能均允许实时或近实时处理,因此不仅可以应用于脱机浏览,还可以用于远程会议助手。

著录项

  • 来源
  • 会议地点 Bethesda MD(US);Bethesda MD(US)
  • 作者单位

    Institute for Human-Machine-Communication, Technische Universitaet Muenchen Arcisstr. 21, 80290 Munich, Germany;

    Institute for Human-Machine-Communication, Technische Universitaet Muenchen Arcisstr. 21, 80290 Munich, Germany;

    Institute for Human-Machine-Communication, Technische Universitaet Muenchen Arcisstr. 21, 80290 Munich, Germany;

    Institute for Human-Machine-Communication, Technische Universitaet Muenchen Arcisstr. 21, 80290 Munich, Germany;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 程序语言、算法语言;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号