时空动作的多人视频数据集注释方法

论文标题

时空动作的多人视频数据集注释方法

A Multi-Person Video Dataset Annotation Method of Spatio-Temporally Actions

论文作者

Yang, Fan

论文摘要

时空动作检测是视频理解中的一个重要且具有挑战性的问题。 However, the application of the existing large-scale spatio-temporal action datasets in specific fields is limited, and there is currently no public tool for making spatio-temporal action datasets, it takes a lot of time and effort for researchers to customize the spatio-temporal action datasets, so we propose a multi-Person video dataset Annotation Method of spatio-temporally actions.First, we use ffmpeg to crop the视频和框架视频；然后使用Yolov5在视频框架中检测人，然后使用深层排序来检测视频框架中人的ID。通过处理Yolov5和Deep Sort的检测结果，我们可以获取时空操作数据集的注释文件，以完成自定义时空动作数据集的工作。 https://github.com/whiffe/custom-ava-dataset_custom-patio-patio-patio-tormally-action-video-dataset

Spatio-temporal action detection is an important and challenging problem in video understanding. However, the application of the existing large-scale spatio-temporal action datasets in specific fields is limited, and there is currently no public tool for making spatio-temporal action datasets, it takes a lot of time and effort for researchers to customize the spatio-temporal action datasets, so we propose a multi-Person video dataset Annotation Method of spatio-temporally actions.First, we use ffmpeg to crop the videos and frame the videos; then use yolov5 to detect human in the video frame, and then use deep sort to detect the ID of the human in the video frame. By processing the detection results of yolov5 and deep sort, we can get the annotation file of the spatio-temporal action dataset to complete the work of customizing the spatio-temporal action dataset. https://github.com/Whiffe/Custom-ava-dataset_Custom-Spatio-Temporally-Action-Video-Dataset

下载PDF全文

下载文献需遵守相关版权规定

论文标题