论文标题

HALO 1.0:一个硬件敏捷的加速器编排框架,用于启用硬件 - 敏捷编程,具有真实的性能可移植性,用于异质HPC

HALO 1.0: A Hardware-agnostic Accelerator Orchestration Framework for Enabling Hardware-agnostic Programming with True Performance Portability for Heterogeneous HPC

论文作者

Riera, Michael, Tavakoli, Erfan Bank, Quraishi, Masudul Hassan, Ren, Fengbo

论文摘要

本文介绍了Halo 1.0,这是一种开放式的可扩展多代理软件框架,该框架实现了一组建议的硬件 - 敏捷加速器编排(HALO)原理。 Halo实现了一个新颖的以计算为中心的消息传递接口(C^2MPI)规范,以启用跨异构加速器的硬件 - 敏捷主机应用程序的性能便携式执行。 The experiment results of evaluating eight widely used HPC subroutines based on Intel Xeon E5-2620 CPUs, Intel Arria 10 GX FPGAs, and NVIDIA GeForce RTX 2080 Ti GPUs show that HALO 1.0 allows for a unified control flow for host programs to run across all the computing devices with a consistently top performance portability score, which is up to five orders of magnitude higher than the OpenCL-based 解决方案。

This paper presents HALO 1.0, an open-ended extensible multi-agent software framework that implements a set of proposed hardware-agnostic accelerator orchestration (HALO) principles. HALO implements a novel compute-centric message passing interface (C^2MPI) specification for enabling the performance portable execution of a hardware-agnostic host application across heterogeneous accelerators. The experiment results of evaluating eight widely used HPC subroutines based on Intel Xeon E5-2620 CPUs, Intel Arria 10 GX FPGAs, and NVIDIA GeForce RTX 2080 Ti GPUs show that HALO 1.0 allows for a unified control flow for host programs to run across all the computing devices with a consistently top performance portability score, which is up to five orders of magnitude higher than the OpenCL-based solution.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源