通过稀疏的高斯近似值进行信用欺诈检测

论文标题

通过稀疏的高斯近似值进行信用欺诈检测

Towards Credit-Fraud Detection via Sparsely Varying Gaussian Approximations

论文作者

Sharma, Harshit, Gandhi, Harsh K., Jain, Apoorv

论文摘要

对于许多金融机构来说，欺诈活动是一个昂贵的问题，每年耗资数十亿美元。在这方面，更常见的活动是信用卡欺诈。在这种情况下，信用卡欺诈检测概念是在将不确定性纳入我们的预测系统中以确保在如此关键任务中更好地判断的界限开发的。我们建议使用稀疏的高斯分类方法与大数据集合使用，并使用伪或诱导输入的概念。我们使用不同的内核组执行相同的操作，并且通过选择具有更高数量诱导点的RBF内核获得了不同数量的诱导数据点以显示最佳精度。鉴于我们采用的方法的随机性质，并且在预测方面的差异较低，表明我们模型中的信心和鲁棒性，我们的方法能够在大量财务数据上工作。使用贝叶斯学习技术的方法论，与掺入的诱导点现象相关，可以成功获得健康的准确性和较高的置信度评分。

Fraudulent activities are an expensive problem for many financial institutions, costing billions of dollars to corporations annually. More commonly occurring activities in this regard are credit card frauds. In this context, the credit card fraud detection concept has been developed over the lines of incorporating the uncertainty in our prediction system to ensure better judgment in such a crucial task. We propose to use a sparse Gaussian classification method to work with the large data-set and use the concept of pseudo or inducing inputs. We perform the same with different sets of kernels and the different number of inducing data points to show the best accuracy was obtained with the selection of RBF kernel with a higher number of inducing points. Our approach was able to work over large financial data given the stochastic nature of our method employed and also good test accuracy with low variance over the prediction suggesting confidence and robustness in our model. Using the methodologies of Bayesian learning techniques with the incorporated inducing points phenomenon, are successfully able to obtain a healthy accuracy and a high confidence score.

下载PDF全文

下载文献需遵守相关版权规定

论文标题