论文标题
在Covid-19大流行期间,中国在线谣言的传播机制和影响力衡量
Spread Mechanism and Influence Measurement of Online Rumors in China During the COVID-19 Pandemic
论文作者
论文摘要
在2020年初,2019年的电晕病毒疾病(Covid-19)大流行席卷了世界。在中国,库维德19造成了严重的后果。此外,在19009年大流行期间的在线谣言增加了人们对公共卫生和社会稳定的恐慌。目前,了解和遏制在线谣言的传播是一项紧迫的任务。因此,我们分析了谣言传播机制,并提出了一种通过新内部人士的速度来量化谣言影响的方法。谣言的搜索频率用作新内部人士的观察变量。计算峰值系数和衰减系数的搜索频率,这符合指数分布。我们设计了几种谣言特征,并将上述两个系数用作可预测的标签。使用均方根误差(MSE)作为损耗函数的5倍交叉验证实验表明,决策树适合预测峰值系数,而线性回归模型是预测衰减系数的理想选择。我们的功能分析表明,前体特征对于爆发系数最为重要,而位置信息和谣言实体信息对于衰减系数最为重要。同时,有利于爆发的特征通常会对谣言的持续传播有害。同时,焦虑是导致因素的关键谣言。最后,我们讨论如何使用深度学习技术通过使用Transformers(BERT)模型的双向编码器表示来减少预测损失。
In early 2020, the Corona Virus Disease 2019 (COVID-19) pandemic swept the world.In China, COVID-19 has caused severe consequences. Moreover, online rumors during the COVID-19 pandemic increased people's panic about public health and social stability. At present, understanding and curbing the spread of online rumors is an urgent task. Therefore, we analyzed the rumor spreading mechanism and propose a method to quantify a rumors' influence by the speed of new insiders. The search frequency of the rumor is used as an observation variable of new insiders. The peak coefficient and the attenuation coefficient are calculated for the search frequency, which conforms to the exponential distribution. We designed several rumor features and used the above two coefficients as predictable labels. A 5-fold cross-validation experiment using the mean square error (MSE) as the loss function showed that the decision tree was suitable for predicting the peak coefficient, and the linear regression model was ideal for predicting the attenuation coefficient. Our feature analysis showed that precursor features were the most important for the outbreak coefficient, while location information and rumor entity information were the most important for the attenuation coefficient. Meanwhile, features that were conducive to the outbreak were usually harmful to the continued spread of rumors. At the same time, anxiety was a crucial rumor causing factor. Finally, we discuss how to use deep learning technology to reduce the forecast loss by using the Bidirectional Encoder Representations from Transformers (BERT) model.