【24h】

Aesthetic-Aware Text to Image Synthesis

机译:审美意识文本到图像合成

获取原文

摘要

Synthesizing an image from natural language description is an important task to many applications such as photo-editing, art generation, and computer aided-design. However, to synthesize an appealing image from the text, image aesthetics criteria should be maintained. In this study, we propose a new framework which first generates a set of mask maps from the input text via mask map generator (MG), and then we compute and rank the image aesthetics score for all generated mask maps via Pre-IG Aesthetic Ranking that contains two composition rules, i.e., the rule of thirds along with the rule of formal balance. At the next stage, we feed the subset of the mask maps, which are the highest, lowest, and the average aesthetic scores, to image generator (IG). The photorealistic images are ranked at the second round through, namely Post-IG Aesthetic Ranking, to determine the lowest aesthetic score and return the most appealing generated image. The experiments on COCO-stuff dataset demonstrate that our framework yields better results compared to previous text-to- image models.
机译:从自然语言描述中合成图像是许多应用程序的重要任务,例如照片编辑,艺术创作和计算机辅助设计。但是,为了从文本中合成出吸引人的图像,应该保持图像美观的标准。在这项研究中,我们提出了一个新框架,该框架首先通过蒙版贴图生成器(MG)从输入文本生成一组蒙版贴图,然后我们通过Pre-IG美学排名为所有生成的蒙版贴图计算图像美学评分并将其排名其中包含两个组成规则,即三分之二规则和正式平衡规则。在下一阶段,我们将蒙版贴图的子集(最高,最低和平均审美得分)提供给图像生成器(IG)。在第二轮中将真实感图像排名,即IG后美学排名,以确定最低的美学得分并返回最具吸引力的生成图像。在COCO-stuff数据集上的实验表明,与以前的文本到图像模型相比,我们的框架产生了更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号