场景图与上下文化对象布局细化到图像生成

论文标题

场景图与上下文化对象布局细化到图像生成

Scene Graph to Image Generation with Contextualized Object Layout Refinement

论文作者

Ivgi, Maor, Benny, Yaniv, Ben-David, Avichai, Berant, Jonathan, Wolf, Lior

论文摘要

从场景图生成图像是一项艰巨的任务，最近引起了重大兴趣。先前的工作通过生成目标图像的中间布局描述来处理此任务。但是，布局中每个对象的表示形式是独立生成的，这导致高重叠，低覆盖范围和整体模糊布局。我们提出了一种新颖的方法，可以通过逐渐产生整个布局描述来改善对象间的依赖性来减轻这些问题。我们从经验上在可可固定数据集上表明，我们的方法提高了中间布局和最终图像的质量。我们的方法将布局覆盖范围提高了近20点，并将对象重叠至可忽略的数量。

Generating images from scene graphs is a challenging task that attracted substantial interest recently. Prior works have approached this task by generating an intermediate layout description of the target image. However, the representation of each object in the layout was generated independently, which resulted in high overlap, low coverage, and an overall blurry layout. We propose a novel method that alleviates these issues by generating the entire layout description gradually to improve inter-object dependency. We empirically show on the COCO-STUFF dataset that our approach improves the quality of both the intermediate layout and the final image. Our approach improves the layout coverage by almost 20 points and drops object overlap to negligible amounts.

下载PDF全文

下载文献需遵守相关版权规定

论文标题