论文标题
可视化与somnus的数据争吵的脚本
Visualizing the Scripts of Data Wrangling with SOMNUS
论文作者
论文摘要
数据工作者使用各种脚本语言进行数据转换,例如SAS,R和Python。但是,了解复杂的代码需要高级编程技能,这会阻碍数据工作者轻松了解数据转换的想法。程序可视化对调试和教育是有益的,并且有可能在直觉上和互动上说明转型。在本文中,我们探讨了可视化设计,以在数据转换的背景下演示代码片段的语义。首先,为了描绘单个数据转换,我们通过两个主要维度构建了一个设计空间,即要编码的关键参数以及要映射的可能的视觉通道。然后,我们得出了23个字形的集合,这些字形可视化转换语义。接下来,我们设计了一条名为Somnus的管道,该管道使用出处图提供了数据表的创建和演变的概述。同时,它可以详细研究单个转换。用户对SOMNUS的反馈是积极的。我们的研究参与者在使用SOMNUS的时间更少的时间内实现了更好的准确性,而不是精心制作的文本描述。此外,我们提供了两个示例应用程序,以证明somnus的效用和多功能性。
Data workers use various scripting languages for data transformation, such as SAS, R, and Python. However, understanding intricate code pieces requires advanced programming skills, which hinders data workers from grasping the idea of data transformation at ease. Program visualization is beneficial for debugging and education and has the potential to illustrate transformations intuitively and interactively. In this paper, we explore visualization design for demonstrating the semantics of code pieces in the context of data transformation. First, to depict individual data transformations, we structure a design space by two primary dimensions, i.e., key parameters to encode and possible visual channels to be mapped. Then, we derive a collection of 23 glyphs that visualize the semantics of transformations. Next, we design a pipeline, named Somnus, that provides an overview of the creation and evolution of data tables using a provenance graph. At the same time, it allows detailed investigation of individual transformations. User feedback on Somnus is positive. Our study participants achieved better accuracy with less time using Somnus, and preferred it over carefully-crafted textual description. Further, we provide two example applications to demonstrate the utility and versatility of Somnus.