论文标题

绩效可重复性的幽灵过去

The Ghost of Performance Reproducibility Past

论文作者

Ramesh, Srinivasan, Titov, Mikhail, Turilli, Matteo, Jha, Shantenu, Malony, Allen

论文摘要

整体计算的重要性已经确定。但是,大规模执行合奏会引入有趣的绩效波动,这些波动尚未得到很好的研究。在本文中,我们追踪了揭示整体应用的性能波动(主要构成Gromacs任务的工作流程)的经验,以及到目前为止尝试识别性能波动的根本原因的尝试不成功。无法识别因果或促成因素的能力失败的原因吗?还是想象力?波动是否在系统或软件的某些难以理解的方面具有起源?它是否值得重新评估和重新思考我们如何假设和概念化性能可重复性?这些问题的答案并不直接,也不是直接或明显的。最后,我们讨论了集合应用的性能,并就我们如何定义和衡量应用程序绩效的含义进行了反思。

The importance of ensemble computing is well established. However, executing ensembles at scale introduces interesting performance fluctuations that have not been well investigated. In this paper, we trace our experience uncovering performance fluctuations of ensemble applications (primarily constituting a workflow of GROMACS tasks), and unsuccessful attempts, so far, at trying to discern the underlying cause(s) of performance fluctuations. Is the failure to discern the causative or contributing factors a failure of capability? Or imagination? Do the fluctuations have their genesis in some inscrutable aspect of the system or software? Does it warrant a fundamental reassessment and rethinking of how we assume and conceptualize performance reproducibility? Answers to these questions are not straightforward, nor are they immediate or obvious. We conclude with a discussion about the performance of ensemble applications and ruminate over the implications for how we define and measure application performance.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源