首页> 外文会议>IEEE International Symposium on Multimedia >Detection of Inconsistency Between Subject and Speaker Based on the Co-occurrence of Lip Motion and Voice Towards Speech Scene Extraction from News Videos
【24h】

Detection of Inconsistency Between Subject and Speaker Based on the Co-occurrence of Lip Motion and Voice Towards Speech Scene Extraction from News Videos

机译:根据新闻视频的唇观运动和语音提取的唇观运动和语音的共同发生,检测主题与扬声器的不一致

获取原文

摘要

We propose a method to detect the inconsistency between a subject and the speaker for extracting speech scenes from news videos. Speech scenes in news videos contain a wealth of multimedia information, and are valuable as archived material. In order to extract speech scenes from news videos, there is an approach that uses the position and size of a face region. However, it is difficult to extract them with only such approach, since news videos contain non-speech scenes where the speaker is not the subject, such as narrated scenes. To solve this problem, we propose a method to discriminate between speech scenes and narrated scenes based on the co-occurrence between a subject's lip motion and the speaker's voice. The proposed method uses lip shape and degree of lip opening as visual features representing a subject's lip motion, and uses voice volume and phoneme as audio feature representing a speaker's voice. Then, the proposed method discriminates between speech scenes and narrated scenes based on the correlations of these features. We report the results of experiments on videos captured in a laboratory condition and also on actual broadcast news videos. Their results showed the effectiveness of our method and the feasibility of our research goal.
机译:我们提出了一种检测主题与扬声器之间不一致的方法,用于从新闻视频中提取语音场景。新闻视频中的语音场景包含丰富的多媒体信息,并且有价值作为归档材料。为了从新闻视频中提取语音场景,存在一种方法,它使用面部区域的位置和大小。然而,只有这样的方法很难提取它们,因为新闻视频包含扬声器不是主题的非语音场景,例如叙述场景。为了解决这个问题,我们提出了一种方法来基于受试者的唇部运动和扬声器的声音之间的共同发生来区分语音场景和叙述场景。该方法使用唇部形状和唇部开口,作为代表受试者的唇部运动的视觉特征,并使用语音卷和音素作为表示扬声器的声音的音频特征。然后,所提出的方法基于这些特征的相关性来判断语音场景和叙述场景。我们报告了在实验室条件中捕获的视频的实验结果以及实际广播新闻视频。他们的结果表明了我们的方法的有效性和我们的研究目标的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号