Datamodels：从培训数据中预测预测

论文标题

Datamodels：从培训数据中预测预测

Datamodels: Predicting Predictions from Training Data

论文作者

Ilyas, Andrew, Park, Sung Min, Engstrom, Logan, Leclerc, Guillaume, Madry, Aleksander

论文摘要

我们提出了一个概念框架，即数据编码，用于分析培训数据的模型类的行为。 For any fixed "target" example $x$, training set $S$, and learning algorithm, a datamodel is a parameterized function $2^S \to \mathbb{R}$ that for any subset of $S' \subset S$ -- using only information about which examples of $S$ are contained in $S'$ -- predicts the outcome of training a model on $S'$ and evaluating on $x$.尽管近似基础过程的潜在复杂性（例如，对深神经网络的端到端训练和评估），但我们表明，即使是简单的线性数据磁模型也可以成功预测模型输出。然后，我们证明数据模型会产生各种应用程序，例如：准确预测数据集反事实的效果；确定脆弱的预测；找到语义上相似的例子；量化火车测试泄漏；并将数据嵌入一个富有特征且功能丰富的表示空间中。本文的数据（包括预先计算的数据模型以及来自400万培训的深神经网络的原始预测），请访问https://github.com/madrylab/datamodels-data。

We present a conceptual framework, datamodeling, for analyzing the behavior of a model class in terms of the training data. For any fixed "target" example $x$, training set $S$, and learning algorithm, a datamodel is a parameterized function $2^S \to \mathbb{R}$ that for any subset of $S' \subset S$ -- using only information about which examples of $S$ are contained in $S'$ -- predicts the outcome of training a model on $S'$ and evaluating on $x$. Despite the potential complexity of the underlying process being approximated (e.g., end-to-end training and evaluation of deep neural networks), we show that even simple linear datamodels can successfully predict model outputs. We then demonstrate that datamodels give rise to a variety of applications, such as: accurately predicting the effect of dataset counterfactuals; identifying brittle predictions; finding semantically similar examples; quantifying train-test leakage; and embedding data into a well-behaved and feature-rich representation space. Data for this paper (including pre-computed datamodels as well as raw predictions from four million trained deep neural networks) is available at https://github.com/MadryLab/datamodels-data .

下载PDF全文

下载文献需遵守相关版权规定

论文标题