论文标题
关于稳健性和当地差异隐私
On robustness and local differential privacy
论文作者
论文摘要
开发统计分析工具的需求飙升,这些工具可抵抗污染以及保存单个数据所有者的隐私。尽管这两个主题都拥有丰富的文献,但据我们所知,我们是第一个系统地研究Huber污染模型和当地差异隐私(LDP)约束之间的最佳联系的人。 在本文中,我们从一般的minimax下限结果开始,这消除了与Huber的污染和保留LDP的强大成本。我们进一步研究了四个具体示例:一个两点测试问题,一个潜在的平均估计问题,非参数密度估计问题和单变量的中位数估计问题。对于每个问题,我们演示了在污染和最不发达国公司约束的情况下最佳的过程,对与最新方法的联系进行评论,这些方法仅在污染或隐私约束下进行的最新方法,并通过部分地回答LDP是否有能力私密的程序来揭示稳健性和LDP之间的联系。总体而言,我们的工作展示了对鲁棒性和当地差异隐私共同研究的有希望的前景。
It is of soaring demand to develop statistical analysis tools that are robust against contamination as well as preserving individual data owners' privacy. In spite of the fact that both topics host a rich body of literature, to the best of our knowledge, we are the first to systematically study the connections between the optimality under Huber's contamination model and the local differential privacy (LDP) constraints. In this paper, we start with a general minimax lower bound result, which disentangles the costs of being robust against Huber's contamination and preserving LDP. We further study four concrete examples: a two-point testing problem, a potentially-diverging mean estimation problem, a nonparametric density estimation problem and a univariate median estimation problem. For each problem, we demonstrate procedures that are optimal in the presence of both contamination and LDP constraints, comment on the connections with the state-of-the-art methods that are only studied under either contamination or privacy constraints, and unveil the connections between robustness and LDP via partially answering whether LDP procedures are robust and whether robust procedures can be efficiently privatised. Overall, our work showcases a promising prospect of joint study for robustness and local differential privacy.