论文标题
改善机器学习研究中的可重复性(Neurips 2019可重复性计划的报告)
Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)
论文作者
论文摘要
机器学习研究的挑战之一是确保出现和发布的结果是合理而可靠的。可重复性,即使用相同的代码和数据(如果有的话)获得了与论文或谈话中提出的相似结果,这是验证研究结果可靠性的必要步骤。可重复性也是促进开放且可访问的研究的重要步骤,从而使科学界能够快速整合新发现并将思想转化为实践。可重复性还促进了强大的实验工作流的使用,这可能会减少无意的错误。 2019年,神经信息处理系统(Neurips)会议是机器学习研究国际研究会议,引入了可重复性计划,旨在提高社区的标准,以如何进行,交流和评估机器学习研究。该计划包含三个组成部分:代码提交策略,社区范围的可重复性挑战以及机器学习可重复性清单作为论文提交过程的一部分。在本文中,我们描述了这些组件中的每一个,如何部署以及我们能够从该计划中学习的内容。
One of the challenges in machine learning research is to ensure that presented and published results are sound and reliable. Reproducibility, that is obtaining similar results as presented in a paper or talk, using the same code and data (when available), is a necessary step to verify the reliability of research findings. Reproducibility is also an important step to promote open and accessible research, thereby allowing the scientific community to quickly integrate new findings and convert ideas to practice. Reproducibility also promotes the use of robust experimental workflows, which potentially reduce unintentional errors. In 2019, the Neural Information Processing Systems (NeurIPS) conference, the premier international conference for research in machine learning, introduced a reproducibility program, designed to improve the standards across the community for how we conduct, communicate, and evaluate machine learning research. The program contained three components: a code submission policy, a community-wide reproducibility challenge, and the inclusion of the Machine Learning Reproducibility checklist as part of the paper submission process. In this paper, we describe each of these components, how it was deployed, as well as what we were able to learn from this initiative.