论文标题

流量:与HPC的跨繁殖云

StreamFlow: cross-breeding cloud with HPC

论文作者

Colonnelli, Iacopo, Cantalupo, Barbara, Merelli, Ivan, Aldinucci, Marco

论文摘要

工作流是各种执行环境中最常用的工具之一。他们中的许多人针对特定的环境;他们中很少有人能够在不同环境中执行整个工作流程,例如kubernetes和批处集群。我们提出了一种用于工作流执行的新颖方法,称为流量流,该方法与潜在复杂的执行环境的声明性描述补充了工作流程图,这使得可以在不共享共同数据空间的多个站点上执行。然后,在新型的生物信息学管道上为单细胞转录组数据分析工作流进行了示例。

Workflows are among the most commonly used tools in a variety of execution environments. Many of them target a specific environment; few of them make it possible to execute an entire workflow in different environments, e.g. Kubernetes and batch clusters. We present a novel approach to workflow execution, called StreamFlow, that complements the workflow graph with the declarative description of potentially complex execution environments, and that makes it possible the execution onto multiple sites not sharing a common data space. StreamFlow is then exemplified on a novel bioinformatics pipeline for single-cell transcriptomic data analysis workflow.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源