论文标题
通过社交媒体的实时知识汇总,快速流行信息传播中的挑战和机遇
Challenges and Opportunities in Rapid Epidemic Information Propagation with Live Knowledge Aggregation from Social Media
论文作者
论文摘要
由于其不可预测性,诸如Covid-19大流行之类的快速发展的情况是AI/ML模型的重大挑战。 %最可靠的大流行指标是测试阳性病例的数量。但是,这些测试既不完整(由于未经测试的无症状病例)和迟到(由于初始接触事件的滞后,症状恶化和测试结果)。社交媒体可以通过更快和更高的覆盖范围来补充物理测试数据,但它们提出了不同的挑战:大量的噪音,错误信息和虚假信息。我们认为,只要满足两个条件,社交媒体就可以成为大流行的良好指标。第一个(真正的新颖性)是从不可预测的情况下捕获了新的,未知的信息。第二个(事实与小说)是可验证的事实与错误信息和虚假信息的区别。满足这两个条件的社交媒体信息称为实时知识。我们采用基于证据的知识获取方法(EBKA)方法来通过将社交媒体来源与权威来源整合在一起来收集,过滤和更新实时知识。尽管数量有限,但来自权威来源的可靠培训数据却可以过滤错误信息以及捕获真正的新信息。我们描述了实施EBKA的EDNA/LITMUS工具,将Twitter和Facebook等社交媒体与WHO和CDC等权威来源集成在一起,创建和更新有关COVID-19-19的大流行知识。
A rapidly evolving situation such as the COVID-19 pandemic is a significant challenge for AI/ML models because of its unpredictability. %The most reliable indicator of the pandemic spreading has been the number of test positive cases. However, the tests are both incomplete (due to untested asymptomatic cases) and late (due the lag from the initial contact event, worsening symptoms, and test results). Social media can complement physical test data due to faster and higher coverage, but they present a different challenge: significant amounts of noise, misinformation and disinformation. We believe that social media can become good indicators of pandemic, provided two conditions are met. The first (True Novelty) is the capture of new, previously unknown, information from unpredictably evolving situations. The second (Fact vs. Fiction) is the distinction of verifiable facts from misinformation and disinformation. Social media information that satisfy those two conditions are called live knowledge. We apply evidence-based knowledge acquisition (EBKA) approach to collect, filter, and update live knowledge through the integration of social media sources with authoritative sources. Although limited in quantity, the reliable training data from authoritative sources enable the filtering of misinformation as well as capturing truly new information. We describe the EDNA/LITMUS tools that implement EBKA, integrating social media such as Twitter and Facebook with authoritative sources such as WHO and CDC, creating and updating live knowledge on the COVID-19 pandemic.