论文标题
符号图形的司额群集甲骨文
Sublinear-Time Clustering Oracle for Signed Graphs
论文作者
论文摘要
社交网络通常是使用签名图对社交网络进行建模的,其中顶点与用户相对应,并且边缘的符号表明用户之间的相互作用是正面还是负面。出现的符号图通常包含一个清晰的社区结构,因为该图可以分配到少量的极化社区中,每个群落都定义了稀疏的切割,并且不可分割地分为较小的极化子共同体。我们为具有如此清晰的社区结构的签名图提供了当地的聚类甲骨文,可以回答会员资格查询,即“给定一个顶点$ v $,哪个社区属于哪个社区?正式地,当图形具有最高度且社区数量最多为$ o(\ log n)$时,然后使用$ \ tilde {o}(\ sqrt {n} \ sqrt {n} \ operatorname {poly}(poly}(1/\ varepsilon)) $ \ tilde {o}(\ sqrt {n} \ operatorAtorName {poly}(1/\ varepsilon))$ time,并且它正确地对$(1- \ varepsilon)$ - Vertices W.R.T.的分数进行了分类。一组隐藏的种植地面真实社区。我们的甲骨文在仅需要少数顶点需要的聚类信息的应用中是可取的。以前,这样的局部聚类甲壳仅因无符号图而闻名。我们对签名图的概括需要许多新的想法,并对随机步行的行为进行了新的光谱分析。我们评估了我们的算法,用于在合成和现实世界数据集上构建这种甲骨文并回答会员资格查询,从而在实践中验证其性能。
Social networks are often modeled using signed graphs, where vertices correspond to users and edges have a sign that indicates whether an interaction between users was positive or negative. The arising signed graphs typically contain a clear community structure in the sense that the graph can be partitioned into a small number of polarized communities, each defining a sparse cut and indivisible into smaller polarized sub-communities. We provide a local clustering oracle for signed graphs with such a clear community structure, that can answer membership queries, i.e., "Given a vertex $v$, which community does $v$ belong to?", in sublinear time by reading only a small portion of the graph. Formally, when the graph has bounded maximum degree and the number of communities is at most $O(\log n)$, then with $\tilde{O}(\sqrt{n}\operatorname{poly}(1/\varepsilon))$ preprocessing time, our oracle can answer each membership query in $\tilde{O}(\sqrt{n}\operatorname{poly}(1/\varepsilon))$ time, and it correctly classifies a $(1-\varepsilon)$-fraction of vertices w.r.t. a set of hidden planted ground-truth communities. Our oracle is desirable in applications where the clustering information is needed for only a small number of vertices. Previously, such local clustering oracles were only known for unsigned graphs; our generalization to signed graphs requires a number of new ideas and gives a novel spectral analysis of the behavior of random walks with signs. We evaluate our algorithm for constructing such an oracle and answering membership queries on both synthetic and real-world datasets, validating its performance in practice.