论文标题

理智检查修剪方法:随机门票可以赢得大奖

Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot

论文作者

Su, Jingtong, Chen, Yihang, Cai, Tianle, Wu, Tianhao, Gao, Ruiqi, Wang, Liwei, Lee, Jason D.

论文摘要

网络修剪是一种减少测试时间计算资源需求并以最小的性能降解的方法。修剪算法的传统观点表明:(1)修剪方法利用培训数据中的信息来找到良好的子网; (2)修剪网络的架构对于良好的性能至关重要。在本文中,我们对几种最近的非结构化修剪方法进行了上述信念的理智检查,并且令人惊讶地发现:(1)一组旨在找到随机定位网络的良好子网(我们称为“初始票证”)的良好子网,几乎可以从培训数据中利用任何信息; (2)对于通过这些方法获得的修剪网络,随机更改每一层中保留的权重,同时保持每层保留的保留权重的总数不变,不会影响最终性能。这些发现激发了我们为每一层选择一系列简单的\ emph {数据独立}的修剪比例,并随机修剪每一层以获取子网(我们称之为“随机票”)。实验结果表明,与现有的“初始门票”相比,我们的零拍随机票都优于或取得相似的性能。此外,我们确定一种通过我们的理智检查的现有修剪方法。我们将随机票中的比率与此方法杂交,并提出了一种称为“混合机票”的新方法,该方法取得了进一步的改进。 (我们的代码可在https://github.com/jingtongsu/sanity-checking-pruning上公开获取)

Network pruning is a method for reducing test-time computational resource requirements with minimal performance degradation. Conventional wisdom of pruning algorithms suggests that: (1) Pruning methods exploit information from training data to find good subnetworks; (2) The architecture of the pruned network is crucial for good performance. In this paper, we conduct sanity checks for the above beliefs on several recent unstructured pruning methods and surprisingly find that: (1) A set of methods which aims to find good subnetworks of the randomly-initialized network (which we call "initial tickets"), hardly exploits any information from the training data; (2) For the pruned networks obtained by these methods, randomly changing the preserved weights in each layer, while keeping the total number of preserved weights unchanged per layer, does not affect the final performance. These findings inspire us to choose a series of simple \emph{data-independent} prune ratios for each layer, and randomly prune each layer accordingly to get a subnetwork (which we call "random tickets"). Experimental results show that our zero-shot random tickets outperform or attain a similar performance compared to existing "initial tickets". In addition, we identify one existing pruning method that passes our sanity checks. We hybridize the ratios in our random ticket with this method and propose a new method called "hybrid tickets", which achieves further improvement. (Our code is publicly available at https://github.com/JingtongSu/sanity-checking-pruning)

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源