Facial expression GAN for voice-driven face generation

Fang Zheng; Liu Zhen; Liu TingtingHung Chih-ChiehXiao JiangjianFeng Guangjin

首页> 外文期刊>The visual computer >Facial expression GAN for voice-driven face generation

【24h】

Facial expression GAN for voice-driven face generation

机译：Facial expression GAN for voice-driven face generation

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Cross-modal audiovisual generation is an emerging topic in machine learning. In particular, voice-to-face is one of the most popular research branches, which aims to generate faces from human voice clips. Most recent works in voice-to-face generation do not take emotion information into account. However, it could be widely observed that expressions are the key face attributes to reconstruct sharper and more discriminative faces. In this paper, we propose a novel facial expression GAN (FE-GAN) which takes emotion and expressions into account in face generation. To achieve this goal, we use two auxiliary classifiers to learn more emotion and identity representations between different modalities, respectively. Moreover, we design two discriminators, each focusing on a different aspect of the faces, to measure identity and emotion semantic relevance in generating. The triple loss is designed to make FE-GAN robust to voice variety and keep balance in two different modalities. Extensive experiments are conducted on two real datasets to demonstrate the effectiveness of FE-GAN in both quantitative and qualitative perspectives. The experimental results show that FE-GAN can not only outperform the previous models in terms of FID and IS values, but also generate more realistic face images compared with previous models.

著录项

来源
《The visual computer》 |2022年第3期|1151-1164|共14页
作者
Fang Zheng; Liu Zhen; Liu TingtingHung Chih-ChiehXiao JiangjianFeng Guangjin;
展开▼
作者单位

Ningbo Univ, Fac Elect Engn & Comp Sci, Ningbo, Peoples R China;

Ningbo Univ, Coll Sci & Technol, Ningbo, Peoples R China;

Natl Chung Hsing Univ, Taichung, TaiwanChinese Acad Sci, Ningbo Inst Mat, Ningbo, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
Expression reconstruction; Cross-model generation; Voice-to-face generation; Generative adversarial networks;

Facial expression GAN for voice-driven face generation

摘要

著录项

相关主题

期刊订阅