SuperDeconfuse：一个受监督的深卷卷动转换基于金融交易系统的融合框架

论文标题

SuperDeconfuse：一个受监督的深卷卷动转换基于金融交易系统的融合框架

SuperDeConFuse: A Supervised Deep Convolutional Transform based Fusion Framework for Financial Trading Systems

论文作者

Gupta, Pooja, Majumdar, Angshul, Chouzenoux, Emilie, Chierchia, Giovanni

论文摘要

这项工作提出了一个有监督的多渠道时间序列学习框架，用于金融股票交易。尽管最近在该域中提出了许多深度学习模型，但其中大多数将股票交易时间序列数据视为2-D图像数据，而其真实性质是1-D时间序列数据。由于股票交易系统是多通道数据，因此许多现有技术将其视为1D时间序列数据，并不是任何技术来有效融合多个渠道所携带的信息。为了为这两个缺点做出贡献，我们提出了一个受到先前建立（无监督的）卷积转换学习框架的启发的端到端监督学习框架。我们的方法包括通过单独的1-D卷积层处理数据通道，然后将输出与一系列完全连接的层融合，最后应用软马克斯分类层。我们的框架的特殊性 - SuperDeconFuse（SDCF），是我们删除位于多渠道卷积层和完全连接的层之间的非线性激活，以及位于后者和输出层之间的非线性激活。我们通过在训练阶段对上述层输出和过滤器引入合适的正则化来弥补这种去除。具体而言，我们在层过滤器上应用对数决定因素正规化，以破坏对称转换中的对称性和力多样性，而我们对层输出的非阴性约束强制限制以减轻死神经元的问题。这导致有关标准卷积神经网络的有效学习一组更丰富的功能和过滤器。数值实验证实，所提出的模型的结果比真实世界交易问题的最先进的深度学习技术要好得多。

This work proposes a supervised multi-channel time-series learning framework for financial stock trading. Although many deep learning models have recently been proposed in this domain, most of them treat the stock trading time-series data as 2-D image data, whereas its true nature is 1-D time-series data. Since the stock trading systems are multi-channel data, many existing techniques treating them as 1-D time-series data are not suggestive of any technique to effectively fusion the information carried by the multiple channels. To contribute towards both of these shortcomings, we propose an end-to-end supervised learning framework inspired by the previously established (unsupervised) convolution transform learning framework. Our approach consists of processing the data channels through separate 1-D convolution layers, then fusing the outputs with a series of fully-connected layers, and finally applying a softmax classification layer. The peculiarity of our framework - SuperDeConFuse (SDCF), is that we remove the nonlinear activation located between the multi-channel convolution layers and the fully-connected layers, as well as the one located between the latter and the output layer. We compensate for this removal by introducing a suitable regularization on the aforementioned layer outputs and filters during the training phase. Specifically, we apply a logarithm determinant regularization on the layer filters to break symmetry and force diversity in the learnt transforms, whereas we enforce the non-negativity constraint on the layer outputs to mitigate the issue of dead neurons. This results in the effective learning of a richer set of features and filters with respect to a standard convolutional neural network. Numerical experiments confirm that the proposed model yields considerably better results than state-of-the-art deep learning techniques for real-world problem of stock trading.

下载PDF全文

下载文献需遵守相关版权规定

论文标题