论文标题
神经网络中的概括:广泛的调查
Generalization in Neural Networks: A Broad Survey
论文作者
论文摘要
本文回顾了概念,建模方法和最新发现,沿着神经网络模型的不同级别的抽象范围,包括跨(1)个样本的概括,(2)分布,(3)域,(4)任务,(5)模态和(6)范围。讨论了(1)从培训到测试数据的样本概括的策略,并提出了暗示性的证据,至少对于Imagenet数据集,流行的分类模型表现出很大的过度拟合。统计数据的经验例子和观点突出了模型的概括如何受益于因果关系和反事实场景。 (3)域概括的转移学习方法和结果总结了,可用的域泛化基准数据集的财富也是如此。 (4)任务概括中调查的最新突破包括很少的元学习方法以及基于变压器的基础模型(例如用于语言处理的基础模型)的出现。综述了执行(5)模式概括的研究,包括整合图像和文本数据的研究,并在嗅觉,视觉和听觉方式上应用生物学启发的网络。调查了更高级别(6)范围的概括结果,包括基于图的方法来表示网络中的象征性知识以及提高网络解释性的归因策略。此外,讨论了关于大脑的模块化结构以及多巴胺驱动的条件导致抽象思维的概念。
This paper reviews concepts, modeling approaches, and recent findings along a spectrum of different levels of abstraction of neural network models including generalization across (1) Samples, (2) Distributions, (3) Domains, (4) Tasks, (5) Modalities, and (6) Scopes. Strategies for (1) sample generalization from training to test data are discussed, with suggestive evidence presented that, at least for the ImageNet dataset, popular classification models show substantial overfitting. An empirical example and perspectives from statistics highlight how models' (2) distribution generalization can benefit from consideration of causal relationships and counterfactual scenarios. Transfer learning approaches and results for (3) domain generalization are summarized, as is the wealth of domain generalization benchmark datasets available. Recent breakthroughs surveyed in (4) task generalization include few-shot meta-learning approaches and the emergence of transformer-based foundation models such as those used for language processing. Studies performing (5) modality generalization are reviewed, including those that integrate image and text data and that apply a biologically-inspired network across olfactory, visual, and auditory modalities. Higher-level (6) scope generalization results are surveyed, including graph-based approaches to represent symbolic knowledge in networks and attribution strategies for improving networks' explainability. Additionally, concepts from neuroscience are discussed on the modular architecture of brains and the steps by which dopamine-driven conditioning leads to abstract thinking.