论文标题

用于聚类数据分析的灵活和简约的建模策略

A Flexible and Parsimonious Modelling Strategy for Clustered Data Analysis

论文作者

Huang, Tao, Pei, Youquan, You, Jinhong, Zhang, Wenyang

论文摘要

统计建模策略是数据分析成功的关键。灵活性和简约之间的权衡在统计建模中起着至关重要的作用。在群集数据分析中,为了说明簇之间的异质性,在建模中需要进行一定的灵活性,但是还需要简短的灵活性,以防止群集之间的复杂性并说明均匀性。在本文中,我们提出了一种灵活的,简约的建模策略,用于群集数据分析。该策略在灵活性和简约性之间取得了很好的平衡,并且在群集之间很好地说明了异质性和同质性,这通常具有很强的实际含义。实际上,它的有用性超出了群集数据分析,它还为转移学习提供了有希望的灯光。为结果模型中的未知数开发了一个估计程序,并建立了估计器的渐近性能。进行了密集的仿真研究,以证明所提出的方法的工作状况,还提供了真实的数据分析,以说明如何应用建模策略和相关估计程序,以回答由现实生活引起的一些实际问题。

Statistical modelling strategy is the key for success in data analysis. The trade-off between flexibility and parsimony plays a vital role in statistical modelling. In clustered data analysis, in order to account for the heterogeneity between the clusters, certain flexibility is necessary in the modelling, yet parsimony is also needed to guard against the complexity and account for the homogeneity among the clusters. In this paper, we propose a flexible and parsimonious modelling strategy for clustered data analysis. The strategy strikes a nice balance between flexibility and parsimony, and accounts for both heterogeneity and homogeneity well among the clusters, which often come with strong practical meanings. In fact, its usefulness has gone beyond clustered data analysis, it also sheds promising lights on transfer learning. An estimation procedure is developed for the unknowns in the resulting model, and asymptotic properties of the estimators are established. Intensive simulation studies are conducted to demonstrate how well the proposed methods work, and a real data analysis is also presented to illustrate how to apply the modelling strategy and associated estimation procedure to answer some real problems arising from real life.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源