首页> 外国专利> METHOD OF VISUAL VOICE RECOGNITION BY FOLLOWING-UP THE LOCAL DEFORMATIONS OF A SET OF POINTS OF INTEREST OF THE SPEAKER'S MOUTH

METHOD OF VISUAL VOICE RECOGNITION BY FOLLOWING-UP THE LOCAL DEFORMATIONS OF A SET OF POINTS OF INTEREST OF THE SPEAKER'S MOUTH

机译：跟随说话人口的兴趣点局部变形的视觉语音识别方法

页面导航

摘要
著录项
相似文献

摘要

The method comprises steps of: a) for each point of interest of each image, calculating a local gradient descriptor and a local movement descriptor; b) forming microstructures of n points of interest, each defined by a tuple of order n, with n≧1; c) determining, for each tuple of a vector of structured visual characteristics (d₀. . . d₃. . . ) based on the local descriptors; d) for each tuple, mapping this vector by a classification algorithm selecting a single codeword among a set of codewords forming a codebook (CB); e) generating an ordered time series of the codewords (a₀. . . a₃. . . ) for the successive images of the video sequence; and f) measuring, by means of a function of the String Kernel type, the similarity of the time series of codewords with another time series of codewords coming from another speaker.

机译：该方法包括以下步骤：a）对于每个图像的每个兴趣点，计算局部梯度描述符和局部运动描述符; b）形成n个关注点的微结构，每个关注点由n阶元组定义，其中n≥1; c）根据局部描述符为每个元组确定结构化视觉特征矢量（d _{0 .. d _{3 ....）; d）对于每个元组，通过分类算法将该向量映射，从而在形成码本（CB）的一组码字中选择单个码字; e）为视频序列的连续图像生成码字的有序时间序列（a _{0 .. a _{3 ..）; f）通过String Kernel类型的函数，测量码字的时间序列与来自另一个说话者的另一个码字的时间序列的相似性。}}}}

著录项

公开/公告号US2014343945A1

专利类型
公开/公告日2014-11-20

原文格式PDF
申请/专利权人 PARROT;
展开▼

申请/专利号US201414273273
发明设计人 ERIC BENHAIM;HICHEM SAHBI;
展开▼

申请日2014-05-08
分类号G10L15/18;G10L21/10;
国家 US
入库时间 2022-08-21 15:23:39

相似文献

专利
外文文献
中文文献