首页> 外国专利> IMAGE MANIPULATION BY TEXT INSTRUCTION

IMAGE MANIPULATION BY TEXT INSTRUCTION

机译：图像操作文本指令

页面导航

摘要
著录项
相似文献

摘要

A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

机译：描述从输入图像生成输出图像的方法和指定使用神经网络应用于应用于输入图像的编辑的位置和修改的输入文本指令。神经网络包括图像编码器，图像解码器和指令注意网络。该方法包括接收输入图像和输入文本指令; 从输入图像中提取输入图像特征，该输入图像特征表示使用图像编码器表示输入图像的特征; 使用指令注意网络从输入文本指令生成空间特征和修改功能; 从输入图像功能，空间特征和修改功能生成编辑的图像特征; 并使用图像解码器从编辑图像特征生成输出图像。

著录项

公开/公告号US2021383584A1

专利类型
公开/公告日2021-12-09

原文格式PDF
申请/专利权人 GOOGLE LLC;
展开▼

申请/专利号US202117340671
发明设计人 TIANHAO ZHANG;WEILONG YANG;HONGLAK LEE;HUNG-YU TSENG;IRFAN AZIZ ESSA;LU JIANG;
展开▼

申请日2021-06-07
分类号G06T11/60;G06T3;G06T3/40;G06N3/08;G06N3/04;
国家 US
入库时间 2022-08-24 22:42:42

相似文献

专利
外文文献
中文文献