首页> 外文会议>Molecular Electronic Devices >Multi-modal audio-visual event recognition for football analysis
【24h】

Multi-modal audio-visual event recognition for football analysis

机译:用于足球分析的多模式视听事件识别

获取原文
获取原文并翻译 | 示例

摘要

The recognition of events within multi-modal data is a challenging problem. In this paper we focus on the recognition of events by using both audio and video data. We investigate the use of data fusion techniques in order to recognise these sequences within the framework of hidden Markov models (HMM) used to model audio and video data sequences. Specifically we look at the recognition of play and break sequences in football and the segmentation of football games based on these two events. Recognising relatively simple semantic events such as this is an important step towards full automatic indexing of such video material. These experiments were done using approximately 3 hours of data from two games of the Euro96 competition. We propose that modelling the audio and video streams separately for each sequence and fusing the decisions from each stream should yield an accurate and robust method of segmenting multi-modal data.
机译:多模式数据中事件的识别是一个具有挑战性的问题。在本文中,我们专注于通过使用音频和视频数据来识别事件。我们调查了数据融合技术的使用,以便在用于对音频和视频数据序列建模的隐马尔可夫模型(HMM)的框架内识别这些序列。具体而言,我们着眼于足球中比赛顺序和休息时间的识别以及基于这两个事件的足球比赛的细分。识别诸如此类的相对简单的语义事件是朝着这种视频材料的全自动索引迈出的重要一步。这些实验是使用来自Euro96竞赛两场比赛的大约3个小时的数据完成的。我们建议针对每个序列分别对音频和视频流进行建模,并融合来自每个流的决策,应该会产生一种准确而强大的分割多模式数据的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号