论文标题
挖掘重复的堆栈溢出问题
Mining Duplicate Questions of Stack Overflow
论文作者
论文摘要
在过去的十年中,社区问答站点(CQA)的使用的使用率显着增加,这主要是由于他们利用人群的智慧的能力。重复的问题对这些网站的质量产生了残酷的影响。因此,解决重复的问题是提高CQA质量的重要一步。在这方面,我们提出了两个基于神经网络的架构,以在堆栈溢出上进行重复的问题检测。我们还建议对问题中存在的代码进行明确建模,以达到超过最新状态的结果。
There has a been a significant rise in the use of Community Question Answering sites (CQAs) over the last decade owing primarily to their ability to leverage the wisdom of the crowd. Duplicate questions have a crippling effect on the quality of these sites. Tackling duplicate questions is therefore an important step towards improving quality of CQAs. In this regard, we propose two neural network based architectures for duplicate question detection on Stack Overflow. We also propose explicitly modeling the code present in questions to achieve results that surpass the state of the art.