论文标题
Procdata:用于过程数据分析的R软件包
ProcData: An R Package for Process Data Analysis
论文作者
论文摘要
流程数据是指基于计算机项目的日志文件中记录的数据。这些数据(表示为时间戳的动作序列)跟踪受访者解决项目的响应过程。流程数据分析旨在通过利用响应过程中包含的丰富信息来提高教育评估准确性并提供其他评估目的。本文介绍的R Packagdata旨在提供用于处理,描述和分析过程数据的工具。我们为组织过程数据定义了一个S3类“ PROC”,并扩展了“ Proc”类的通用方法摘要和打印。在软件包中实现了两种用于过程数据的特征提取方法,以将不规则响应过程中的信息压缩为常规数字向量。 Procdata还提供了拟合和从基于神经网络的序列模型进行预测的功能。这些功能称为keras中的相关功能,用于构建和培训神经网络。此外,包含在2012年国际学生评估计划中的几个响应过程生成器和气候控制项目的响应过程的真实数据集。
Process data refer to data recorded in the log files of computer-based items. These data, represented as timestamped action sequences, keep track of respondents' response processes of solving the items. Process data analysis aims at enhancing educational assessment accuracy and serving other assessment purposes by utilizing the rich information contained in response processes. The R package ProcData presented in this article is designed to provide tools for processing, describing, and analyzing process data. We define an S3 class "proc" for organizing process data and extend generic methods summary and print for class "proc". Two feature extraction methods for process data are implemented in the package for compressing information in the irregular response processes into regular numeric vectors. ProcData also provides functions for fitting and making predictions from a neural-network-based sequence model. These functions call relevant functions in package keras for constructing and training neural networks. In addition, several response process generators and a real dataset of response processes of the climate control item in the 2012 Programme for International Student Assessment are included in the package.