论文标题

世界上所有的(超级)图:数据戏

All the World's a (Hyper)Graph: A Data Drama

论文作者

Coupette, Corinna, Vreeken, Jilles, Rieck, Bastian

论文摘要

我们介绍了夸张,这是一个来自莎士比亚戏剧的不同关系数据表示的数据集。我们的表示范围从单个场景中捕获字符共发生的简单图表到编码复杂的通信设置和角色贡献的超图像具有边缘特异性节点权重的Hyperedges。通过使多个直观表示形式容易可用于实验,我们促进了严格的表示图,图形挖掘和网络分析中的鲁棒性检查,突出了特定表示的优点和缺点。利用高音中发布的数据,我们证明了许多流行图挖掘问题的解决方案高度取决于表示的选择,从而使当前的图形策划实践提出了疑问。作为对我们的数据源的敬意,并断言科学也可以是艺术,我们以戏剧的形式介绍了所有观点。

We introduce Hyperbard, a dataset of diverse relational data representations derived from Shakespeare's plays. Our representations range from simple graphs capturing character co-occurrence in single scenes to hypergraphs encoding complex communication settings and character contributions as hyperedges with edge-specific node weights. By making multiple intuitive representations readily available for experimentation, we facilitate rigorous representation robustness checks in graph learning, graph mining, and network analysis, highlighting the advantages and drawbacks of specific representations. Leveraging the data released in Hyperbard, we demonstrate that many solutions to popular graph mining problems are highly dependent on the representation choice, thus calling current graph curation practices into question. As an homage to our data source, and asserting that science can also be art, we present all our points in the form of a play.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源