专辑封面艺术图像生成具有生成对抗网络

论文标题

专辑封面艺术图像生成具有生成对抗网络

Album cover art image generation with Generative Adversarial Networks

论文作者

Stoppa, Felipe Perez, Vidaña-Vila, Ester, Navarro, Joan

论文摘要

Goodfellow在2014年引入了生成的对抗网络（GAN），此后已在构建生成人工智能模型中流行。但是，此类网络的缺点很多，例如较长的训练时间，对超参数调整的敏感性，几种类型的损失和优化功能以及其他困难（例如模式崩溃）。当前的gan应用包括生成光真实的人脸，动物和物体。但是，我想通过使用现有模型并向它们学习，从而更详细地探索甘恩的艺术能力。该论文涵盖了神经网络的基础知识，并致力于gan的特定方面，以及对现有可用模型的实验和修改，从最不复杂到大多数。目的是查看最先进的gan（特别是stylegan2）是否可以产生专辑封面，以及是否可以通过类型来量身定制它们。首先将自己熟悉现有的3个木架体系结构，包括The The Stylegan的状态。 stylegan2代码用于训练一个模型，其中包含80k专辑封面图像的数据集，然后通过选择精选的图像和混合样式来样式图像。

Generative Adversarial Networks (GANs) were introduced by Goodfellow in 2014, and since then have become popular for constructing generative artificial intelligence models. However, the drawbacks of such networks are numerous, like their longer training times, their sensitivity to hyperparameter tuning, several types of loss and optimization functions and other difficulties like mode collapse. Current applications of GANs include generating photo-realistic human faces, animals and objects. However, I wanted to explore the artistic ability of GANs in more detail, by using existing models and learning from them. This dissertation covers the basics of neural networks and works its way up to the particular aspects of GANs, together with experimentation and modification of existing available models, from least complex to most. The intention is to see if state of the art GANs (specifically StyleGAN2) can generate album art covers and if it is possible to tailor them by genre. This was attempted by first familiarizing myself with 3 existing GANs architectures, including the state of the art StyleGAN2. The StyleGAN2 code was used to train a model with a dataset containing 80K album cover images, then used to style images by picking curated images and mixing their styles.

下载PDF全文

下载文献需遵守相关版权规定

论文标题