论文标题

部分可观测时空混沌系统的无模型预测

Treating Crowdsourcing as Examination: How to Score Tasks and Online Workers?

论文作者

Han, Guangyang, Li, Sufang, Wang, Runmin, Wu, Chunming

论文摘要

众包是一种在线外包模式,可以解决当前机器学习算法对大量标记数据的敦促。请求者在众包平台上发布任务,这些平台通过Internet雇用在线工人来完成任务,然后汇总并将结果返回给请求者。如何建模不同类型的工人和任务之间的相互作用是一个热点。在本文中,我们试图根据他们的能力将工人建模为四种类型:专家,普通工人,草率的工人和垃圾邮件发送者,并根据他们的困难将任务分为硬,中等和容易的任务。我们认为,即使是专家在艰巨的任务中挣扎,而草率的工人也可以正确完成任务,并且垃圾邮件发送者总是故意给出错误的答案。因此,良好的检查任务应具有适度的难度和可区分性,以更具客观地评分工人。因此,我们首先在中等困难的任务上为工人的能力评分,然后在推断任务的地面真相时减少了草率工人的答案重量并修改垃圾邮件发送者的答案。采用概率图模型来模拟任务执行过程,并采用了一种迭代方法来计算和更新地面真相,工人的能力以及任务的难度。我们在模拟和真实的众包场景中验证算法的正确性和有效性。

Crowdsourcing is an online outsourcing mode which can solve the current machine learning algorithm's urge need for massive labeled data. Requester posts tasks on crowdsourcing platforms, which employ online workers over the Internet to complete tasks, then aggregate and return results to requester. How to model the interaction between different types of workers and tasks is a hot spot. In this paper, we try to model workers as four types based on their ability: expert, normal worker, sloppy worker and spammer, and divide tasks into hard, medium and easy task according to their difficulty. We believe that even experts struggle with difficult tasks while sloppy workers can get easy tasks right, and spammers always give out wrong answers deliberately. So, good examination tasks should have moderate degree of difficulty and discriminability to score workers more objectively. Thus, we first score workers' ability mainly on the medium difficult tasks, then reducing the weight of answers from sloppy workers and modifying the answers from spammers when inferring the tasks' ground truth. A probability graph model is adopted to simulate the task execution process, and an iterative method is adopted to calculate and update the ground truth, the ability of workers and the difficulty of the task successively. We verify the rightness and effectiveness of our algorithm both in simulated and real crowdsourcing scenes.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源