首页> 外文会议>Advances in multimedia information processing - PCM 2009 >Persian Viseme Classification for Developing Visual Speech Training Application
【24h】

Persian Viseme Classification for Developing Visual Speech Training Application

机译:波斯语视位分类在视觉语音训练中的应用

获取原文
获取原文并翻译 | 示例

摘要

Viseme classification and analysis in every language is among the most important preliminaries for conducting various multimedia researches as talking head, lip reading, lip synchronization and computer assisted pronunciation training applications. Viseme classification and analysis is language dependent. For that reason, in different languages and based on the target applications, visemes of a language are classified. Up to date, there has been no such research in Persian language, in that it makes it rather impossible for researches to be conducted in AVSR system or lip synchronization. In this paper, we propose a novel method adopting an image-based approach for grouping visemes in Persian language considering coarticulation effect. For each phoneme, the central frame is selected in several images representing different positions in various syllables. Having obtained eigenlips of each phoneme, we project each viseme on another viseme's eigenspace. Then the weight value as a result of reconstruction is set as the criterion for comparing viseme similarity. The experimental results indicate an ideal precision and robustness of the proposed algorithm.
机译:Viseme的每种语言分类和分析是进行各种多媒体研究的最重要的前提之一,这些研究包括说话头,嘴唇读取,嘴唇同步和计算机辅助发音训练应用。 Viseme的分类和分析取决于语言。因此,基于目标应用程序,以不同的语言来分类语言的视位素。迄今为止,还没有用波斯语进行过这样的研究,因为这使得不可能在AVSR系统或口型同步中进行研究。在本文中,我们提出了一种新的方法,该方法采用基于图像的方法来考虑波斯语的协同发音效果,从而对波斯语中的语音素进行分组。对于每个音素,在代表各个音节中不同位置的几幅图像中选择中心框架。获得每个音素的特征唇之后,我们将每个视素投影到另一个视位的特征空间上。然后,将作为重构结果的权重值设置为比较视位素相似度的标准。实验结果表明该算法具有理想的精度和鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号