论文标题
拼写的标志:dall-e 2,无形图像和特征空间的种族政治
A Sign That Spells: DALL-E 2, Invisual Images and The Racial Politics of Feature Space
论文作者
论文摘要
在本文中,我们研究了生成机器学习系统如何产生新的视觉文化政治。我们将重点放在DALL-E 2和相关模型上,作为通过特征提取和语义压缩的文化技术运作的新兴图像制作方法。我们认为,这些技术是不人道的,无形的和不透明的,但仍然以具有讽刺意味的一切都太人性化的悖论而被捕获:白色的一致繁殖是主要的视觉文化的潜在特征。我们使用开放的AI失败的努力来“ DEBIAS”其系统作为质疑DALL-E 2之类的系统如何解散和重建政治上显着的人类概念等关键开放。这个示例生动地说明了这一转型时刻的利益,当所谓的基础模型重新配置视觉文化的界限以及“做”反种族主义时,意味着采用快速的技术修复来减轻个人不适,或者更重要的是潜在的商业损失。
In this paper, we examine how generative machine learning systems produce a new politics of visual culture. We focus on DALL-E 2 and related models as an emergent approach to image-making that operates through the cultural techniques of feature extraction and semantic compression. These techniques, we argue, are inhuman, invisual, and opaque, yet are still caught in a paradox that is ironically all too human: the consistent reproduction of whiteness as a latent feature of dominant visual culture. We use Open AI's failed efforts to 'debias' their system as a critical opening to interrogate how systems like DALL-E 2 dissolve and reconstitute politically salient human concepts like race. This example vividly illustrates the stakes of this moment of transformation, when so-called foundation models reconfigure the boundaries of visual culture and when 'doing' anti-racism means deploying quick technical fixes to mitigate personal discomfort, or more importantly, potential commercial loss.