论文标题
LBNL超级官员项目报告
The LBNL Superfacility Project Report
论文作者
论文摘要
超级效率模型旨在利用HPC进行实验科学。它不仅仅是连接实验,网络和HPC设施的模型;它涵盖了使连接设施易于使用所需的基础架构,软件,工具和专业知识的完整生态系统。劳伦斯·伯克利国家实验室(LBNL)的三年劳伦斯伯克利国家实验室(LBNL)超级实验性项目于2019年启动,以协调LBNL进行的工作,以支持该模型,并提供一组连贯且全面的科学要求,以推动现有和新工作。 该项目的一个关键组成部分是与八个科学案例的八个科学团队进行了深入的交战,这些科学案例在科学科学办公室中挑战。在项目结束时,我们实现了我们的项目目标,使我们的科学应用程序参与证明了自动化管道,这些管道可以大规模地分析远程设施的数据,而无需常规的人类干预。在某些情况下,我们已经超越了示威活动,现在提供生产级别的服务。为了实现这一目标,超级智能团队为近实时计算支持,动态高性能网络,数据管理和移动工具,API驱动自动化,通过Jupyter,使用联合身份认证和基于容器的Edge Services提供支持。 我们在该项目中学到的教训为未来的大型,复杂,跨学科的合作提供了宝贵的模型。迫切需要跨国家设施进行连贯的计算基础架构,而LBNL的超级智能项目是成功地应对多个科学领域的硬件,软件,政策和服务所面临的挑战的独特模型。
The Superfacility model is designed to leverage HPC for experimental science. It is more than simply a model of connected experiment, network, and HPC facilities; it encompasses the full ecosystem of infrastructure, software, tools, and expertise needed to make connected facilities easy to use. The three-year Lawrence Berkeley National Laboratory (LBNL) Superfacility project was initiated in 2019 to coordinate work being performed at LBNL to support this model, and to provide a coherent and comprehensive set of science requirements to drive existing and new work. A key component of the project was the in-depth engagements with eight science teams that represent challenging use cases across the DOE Office of Science. By the close of the project, we met our project goal by enabling our science application engagements to demonstrate automated pipelines that analyze data from remote facilities at large scale, without routine human intervention. In several cases, we have gone beyond demonstrations and now provide production-level services. To achieve this goal, the Superfacility team developed tools, infrastructure, and policies for near-real-time computing support, dynamic high-performance networking, data management and movement tools, API-driven automation, HPC-scale notebooks via Jupyter, authentication using Federated Identity and container-based edge services supported. The lessons we learned during this project provide a valuable model for future large, complex, cross-disciplinary collaborations. There is a pressing need for a coherent computing infrastructure across national facilities, and LBNL's Superfacility project is a unique model for success in tackling the challenges that will be faced in hardware, software, policies, and services across multiple science domains.