论文标题

Docker Development中的挑战:使用堆栈溢出的大规模研究

Challenges in Docker Development: A Large-scale Study Using Stack Overflow

论文作者

Haque, Mubin Ul, Iwaya, Leonardo Horn, Babar, M. Ali

论文摘要

Docker Technology越来越多地在软件开发人员中使用了许多项目。这种日益增长的兴趣是由于Docker Technology支持创建和建造容器,促进开发人员和运营团队之间的密切合作以及实现连续软件交付的方便过程。作为一种快速发展的技术,重要的是要确定开发人员面临的最流行以及现有的挑战和困难的与Docker相关的主题。本文提出了一项大规模的经验研究,通过挖掘堆栈溢出(SOF)社区的矿业柱来识别从业人员对Docker技术的观点。方法:基于一组相关标签和内容创建了113,922个与Docker相关的帖子的数据集。数据集进行了清洁和准备。主题建模是使用潜在的Dirichlet分配(LDA)进行的,从而识别域中的主要主题。我们的结果表明,大多数开发人员使用SOF询问广泛的Docker主题,包括框架开发,应用程序部署,持续集成,网络服务器配置等等。我们确定开发人员讨论的30个主题可以分为13个主要类别。大多数帖子属于应用程序开发,配置和网络的类别。另一方面,我们发现与其他主题相比,有关监视状态,传输数据和身份验证用户的帖子在开发人员中更受欢迎。具体来说,开发人员在Web浏览器问题中面临挑战,网络错误和内存管理。此外,该领域缺乏专家。我们的研究结果将指导未来的新工具和技术开发工作,帮助社区集中精力并了解码头主题的现有权衡。

Docker technology has been increasingly used among software developers in a multitude of projects. This growing interest is due to the fact that Docker technology supports a convenient process for creating and building containers, promoting close cooperation between developer and operations teams, and enabling continuous software delivery. As a fast-growing technology, it is important to identify the Docker-related topics that are most popular as well as existing challenges and difficulties that developers face. This paper presents a large-scale empirical study identifying practitioners' perspectives on Docker technology by mining posts from the Stack Overflow (SoF) community. Method: A dataset of 113,922 Docker-related posts was created based on a set of relevant tags and contents. The dataset was cleaned and prepared. Topic modelling was conducted using Latent Dirichlet Allocation (LDA), allowing the identification of dominant topics in the domain. Our results show that most developers use SoF to ask about a broad spectrum of Docker topics including framework development, application deployment, continuous integration, web-server configuration and many more. We determined that 30 topics that developers discuss can be grouped into 13 main categories. Most of the posts belong to categories of application development, configuration, and networking. On the other hand, we find that the posts on monitoring status, transferring data, and authenticating users are more popular among developers compared to the other topics. Specifically, developers face challenges in web browser issues, networking error and memory management. Besides, there is a lack of experts in this domain. Our research findings will guide future work on the development of new tools and techniques, helping the community to focus efforts and understand existing trade-offs on Docker topics.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源