拼写的标志：dall-e 2，无形图像和特征空间的种族政治

论文标题

拼写的标志：dall-e 2，无形图像和特征空间的种族政治

A Sign That Spells: DALL-E 2, Invisual Images and The Racial Politics of Feature Space

论文作者

Offert, Fabian, Phan, Thao

论文摘要

在本文中，我们研究了生成机器学习系统如何产生新的视觉文化政治。我们将重点放在DALL-E 2和相关模型上，作为通过特征提取和语义压缩的文化技术运作的新兴图像制作方法。我们认为，这些技术是不人道的，无形的和不透明的，但仍然以具有讽刺意味的一切都太人性化的悖论而被捕获：白色的一致繁殖是主要的视觉文化的潜在特征。我们使用开放的AI失败的努力来“ DEBIAS”其系统作为质疑DALL-E 2之类的系统如何解散和重建政治上显着的人类概念等关键开放。这个示例生动地说明了这一转型时刻的利益，当所谓的基础模型重新配置视觉文化的界限以及“做”反种族主义时，意味着采用快速的技术修复来减轻个人不适，或者更重要的是潜在的商业损失。

In this paper, we examine how generative machine learning systems produce a new politics of visual culture. We focus on DALL-E 2 and related models as an emergent approach to image-making that operates through the cultural techniques of feature extraction and semantic compression. These techniques, we argue, are inhuman, invisual, and opaque, yet are still caught in a paradox that is ironically all too human: the consistent reproduction of whiteness as a latent feature of dominant visual culture. We use Open AI's failed efforts to 'debias' their system as a critical opening to interrogate how systems like DALL-E 2 dissolve and reconstitute politically salient human concepts like race. This example vividly illustrates the stakes of this moment of transformation, when so-called foundation models reconfigure the boundaries of visual culture and when 'doing' anti-racism means deploying quick technical fixes to mitigate personal discomfort, or more importantly, potential commercial loss.

下载PDF全文

下载文献需遵守相关版权规定

论文标题