论文标题
使用输入过滤神经ODES的简短事件相机流的实时分类
Real-time Classification from Short Event-Camera Streams using Input-filtering Neural ODEs
论文作者
论文摘要
基于事件的相机是受人体视觉系统启发的新颖,有效的传感器,产生了异步的,像素的数据流。从这些数据中学习通常是通过大量预处理和事件集成到图像中进行的。这需要缓冲可能的长序列,并且可以限制推理系统的响应时间。在这项工作中,我们建议直接使用DVS摄像头,强度变化及其空间坐标的事件。该序列用作新型\ emph {异步} rnn样架构的输入,即输入过滤神经odes(inode)。这是受动力学系统和过滤文献的启发。 Inode是神经odes(节点)的扩展,它允许输入信号连续馈送到网络,例如过滤。该方法通过实现批量前向Euler求解器来自然处理时间序列的批次序列。 Inode像标准RNN一样受过训练,它学会了区分短期事件序列并在线推论进行事件。我们在一系列分类任务上演示了我们的方法,并与一组LSTM基准进行了比较。我们表明,与摄像机分辨率无关,Inode可以在ASL任务上大幅度优于基准,并且与NCALTECH任务的LSTM相当。最后,我们证明即使提供了很少的事件,Inode也是准确的。
Event-based cameras are novel, efficient sensors inspired by the human vision system, generating an asynchronous, pixel-wise stream of data. Learning from such data is generally performed through heavy preprocessing and event integration into images. This requires buffering of possibly long sequences and can limit the response time of the inference system. In this work, we instead propose to directly use events from a DVS camera, a stream of intensity changes and their spatial coordinates. This sequence is used as the input for a novel \emph{asynchronous} RNN-like architecture, the Input-filtering Neural ODEs (INODE). This is inspired by the dynamical systems and filtering literature. INODE is an extension of Neural ODEs (NODE) that allows for input signals to be continuously fed to the network, like in filtering. The approach naturally handles batches of time series with irregular time-stamps by implementing a batch forward Euler solver. INODE is trained like a standard RNN, it learns to discriminate short event sequences and to perform event-by-event online inference. We demonstrate our approach on a series of classification tasks, comparing against a set of LSTM baselines. We show that, independently of the camera resolution, INODE can outperform the baselines by a large margin on the ASL task and it's on par with a much larger LSTM for the NCALTECH task. Finally, we show that INODE is accurate even when provided with very few events.