首页> 外国专利> Using machine-learning models to determine movements of a mouth corresponding to live speech

Using machine-learning models to determine movements of a mouth corresponding to live speech

机译：使用机器学习模型来确定与实时语音相对应的嘴巴运动

页面导航

摘要
著录项
相似文献

摘要

Disclosed systems and methods predict visemes from an audio sequence. A viseme-generation application accesses a first set of training data that includes a first audio sequence representing a sentence spoken by a first speaker and a sequence of visemes. Each viseme is mapped to a respective audio sample of the first audio sequence. The viseme-generation application creates a second set of training data adjusting a second audio sequence spoken by a second speaker speaking the sentence such that the second and first sequences have the same length and at least one phoneme occurs at the same time stamp in the first sequence and in the second sequence. The viseme-generation application maps the sequence of visemes to the second audio sequence and trains a viseme prediction model to predict a sequence of visemes from an audio sequence.

机译：公开的系统和方法根据音频序列预测视位素。视位生成应用程序访问第一组训练数据，该数据包括代表第一说话者说出的句子的第一音频序列和视位序列。每个视位素都映射到第一音频序列的相应音频样本。视位生成应用程序创建第二组训练数据，以调整第二说话者说出句子的第二音频序列，使得第二序列和第一序列具有相同的长度，并且至少一个音素出现在第一序列的相同时间戳上顺序和第二个顺序。视位生成应用将视位序列映射到第二音频序列，并训练视位预测模型以根据音频序列预测视位序列。

著录项

公开/公告号GB2574920B

专利类型
公开/公告日2020-10-14

原文格式PDF
申请/专利权人 ADOBE INC.;
展开▼

申请/专利号GB20190003967
发明设计人 WILMOT LI;JOVAN POPOVIC;DEEPALI ANEJA;DAVID SIMONS;
展开▼

申请日2019-03-22
分类号G10L21/10;G06T13/40;G10L25/57;
国家 GB
入库时间 2022-08-21 10:59:51

相似文献

专利
外文文献
中文文献