论文标题
多模式视频法医平台,用于调查后恐怖袭击场景
Multi-Modal Video Forensic Platform for Investigating Post-Terrorist Attack Scenarios
论文作者
论文摘要
对恐怖袭击的法医调查对调查当局构成了重大挑战,因此经常必须观看数千小时的录像。大型视频分析平台(VAP)协助执法机构(LEA)确定嫌疑犯并确保证据。当前的平台主要关注不同的计算机视觉方法的集成,因此仅限于单一模态。我们提出了一个视频分析平台,该平台集成了视觉和音频分析模块,并融合了来自目击者的监视摄像机和视频上传的信息。视频根据其声学和视觉内容进行分析。具体而言,音频事件检测用于根据攻击特异性的声学概念索引内容。音频相似性搜索用于识别从不同角度录制的相似视频序列。视觉对象检测和跟踪用于根据相关概念索引内容。引入了创新的用户界面概念,以利用分析模块的异质结果的全部潜力,从而使研究人员能够更快地对铅和目击者报告进行更快的跟进。
The forensic investigation of a terrorist attack poses a significant challenge to the investigative authorities, as often several thousand hours of video footage must be viewed. Large scale Video Analytic Platforms (VAP) assist law enforcement agencies (LEA) in identifying suspects and securing evidence. Current platforms focus primarily on the integration of different computer vision methods and thus are restricted to a single modality. We present a video analytic platform that integrates visual and audio analytic modules and fuses information from surveillance cameras and video uploads from eyewitnesses. Videos are analyzed according their acoustic and visual content. Specifically, Audio Event Detection is applied to index the content according to attack-specific acoustic concepts. Audio similarity search is utilized to identify similar video sequences recorded from different perspectives. Visual object detection and tracking are used to index the content according to relevant concepts. Innovative user-interface concepts are introduced to harness the full potential of the heterogeneous results of the analytical modules, allowing investigators to more quickly follow-up on leads and eyewitness reports.