首页> 外国专利> Using machine-learning models to determine movements of a mouth corresponding to live speech

Using machine-learning models to determine movements of a mouth corresponding to live speech

机译：使用机器学习模型来确定对应于现场演讲的嘴的移动

页面导航

摘要
著录项
相似文献

摘要

Disclosed systems and methods predict visemes from an audio sequence. In an example, a viseme-generation application accesses a first audio sequence that is mapped to a sequence of visemes. The first audio sequence has a first length and represents phonemes. The application adjusts a second length of a second audio sequence such that the second length equals the first length and represents the phonemes. The application adjusts the sequence of visemes to the second audio sequence such that phonemes in the second audio sequence correspond to the phonemes in the first audio sequence. The application trains a machine-learning model with the second audio sequence and the sequence of visemes. The machine-learning model predicts an additional sequence of visemes based on an additional sequence of audio.

机译：所公开的系统和方法预测来自音频序列的探测。在一个示例中，Viseme-Generation应用程序访问映射到一系列鼠标的第一音频序列。第一个音频序列具有第一长度并表示音素。应用程序调整第二音频序列的第二长度，使得第二长度等于第一长度并且表示音素。应用程序将探测序列调整为第二音频序列，使得第二音频序列中的音素对应于第一音频序列中的音素。该应用程序列举了具有第二个音频序列和鼠标序列的机器学习模型。基于附加的音频序列，机器学习模型预测了额外的Viseme序列。

著录项

公开/公告号US11211060B2

专利类型
公开/公告日2021-12-28

原文格式PDF
申请/专利权人 ADOBE INC.;
展开▼

申请/专利号US202016887418
发明设计人 WILMOT LI;JOVAN POPOVIC;DEEPALI ANEJA;DAVID SIMONS;
展开▼

申请日2020-05-29
分类号G10L15/197;G06N3/04;G06N3/08;G10L15/02;G10L15/06;G10L21/0316;G10L25/21;G10L25/24;
国家 US
入库时间 2022-08-24 23:04:10

相似文献

专利
外文文献
中文文献