论文标题
了解在常见腐败下多exit模型的鲁棒性
Understanding the Robustness of Multi-Exit Models under Common Corruptions
论文作者
论文摘要
多exit模型(MEMS)使用早期外观策略来提高深神经网络(DNN)的准确性和效率,通过允许样品在最后一层之前退出网络。但是,在存在分布变化的情况下,MEM的有效性在很大程度上尚未探索。我们的工作研究了共同图像腐败产生的分布变化如何影响MEM的准确性/效率。我们发现,在常见的腐败下,早期出现在第一个正确的出口下降低了推理成本,并显着提高了准确性(10%),而不是在最后一层退出。但是,凭借现实的早期策略(不假定正确的出口知识),MEMS仍会降低推理成本,但与在最后一层退出相比,准确性(1%)具有边际提高。此外,与分布数据的差距相比,分布偏移的存在扩大了MEM的最大分类精度和现实的早期外出策略之间的差距5%。我们的经验分析表明,由于分配变化而缺乏校准会增加此类早期远期策略早期退出并提高错误分类率的敏感性。此外,与分布数据的评估相比,缺乏校准会增加跨退出模型的预测的不一致,从而导致推理效率低下和更多的错误分类。最后,我们提出了两个指标,即思考和过度思考,这些指标量化了分配转移的实际早期远期策略的不同行为,并提供了改善MEMS实际实用性的见解。
Multi-Exit models (MEMs) use an early-exit strategy to improve the accuracy and efficiency of deep neural networks (DNNs) by allowing samples to exit the network before the last layer. However, the effectiveness of MEMs in the presence of distribution shifts remains largely unexplored. Our work examines how distribution shifts generated by common image corruptions affect the accuracy/efficiency of MEMs. We find that under common corruptions, early-exiting at the first correct exit reduces the inference cost and provides a significant boost in accuracy ( 10%) over exiting at the last layer. However, with realistic early-exit strategies, which do not assume knowledge about the correct exits, MEMs still reduce inference cost but provide a marginal improvement in accuracy (1%) compared to exiting at the last layer. Moreover, the presence of distribution shift widens the gap between an MEM's maximum classification accuracy and realistic early-exit strategies by 5% on average compared with the gap on in-distribution data. Our empirical analysis shows that the lack of calibration due to a distribution shift increases the susceptibility of such early-exit strategies to exit early and increases misclassification rates. Furthermore, the lack of calibration increases the inconsistency in the predictions of the model across exits, leading to both inefficient inference and more misclassifications compared with evaluation on in-distribution data. Finally, we propose two metrics, underthinking and overthinking, that quantify the different behavior of practical early-exit strategy under distribution shifts, and provide insights into improving the practical utility of MEMs.