Document Type : Applied Article

Authors

School of Computer Engineering, Iran University of Science and Technology, Tehran, Iran.

Abstract

In recent years, new technologies have brought new innovations into the financial and commercial world, giving fraudsters many ways to commit fraud and cost companies big time. We can build systems that detect fraudulent patterns and prevent future incidents using advanced technologies. Machine learning algorithms are being used more for fraud detection in financial data. But the common challenge is the imbalance of the dataset which hinders traditional machine learning methods. Finding the best approach towards these imbalance datasets is the problem many of the researchers are facing when trying to use machine learning methods. In this paper, we propose the method called FinFD-GCN that use Graph Convolutional Networks (GCNs) for fraud detection in credit card transaction datasets. FinFD-GCN represents transactions as graph in which each node represents a transaction and each edge represents similarity between transactions. By using this graph representation FinFD-GCN can capture complex relationships and anomalies that may have been overlooked by traditional methods or were even impossible to detect with conventional approaches, thus enhancing the accuracy and robustness of fraud detection in financial data. We use common evaluation metrics and confusion matrices to evaluate the proposed method. FinFD-GCN achieves significant improvements in recall and AUC compared to traditional methods such as logistic regression, support vector machines, and random forests, making it a robust solution for credit card fraud detection. By using the GCN model for fraud detection in this credit card dataset we outperformed base models 5% and 10%, with respect to F1 and AUC, respectively.

Keywords

Main Subjects

[1] S. prakash. "Online Fraud Is Bad For Business And Customers." Forbes. https://www.forbes.com/sites/forbestechcouncil/2020/07/21/online-fraud-is-bad-for-business-and-customers (accessed).
 
[2] K. G. Al-Hashedi and P. Magalingam, "Financial fraud detection applying data mining techniques: A comprehensive review from 2009 to 2019," Computer Science Review, vol. 40, p. 100402, 2021/05/01/ 2021, doi: https://doi.org/10.1016/j.cosrev.2021.100402.
[3] S. Barman, U. Pal, M. Sarfaraj, B. Biswas, A. Mahata, and P. Mandal, "A complete literature review on financial fraud detection applying data mining techniques," International Journal of Trust Management in Computing and Communications, vol. 3, p. 336, 01/01 2016, doi: 10.1504/IJTMCC.2016.084561.
 
[4] N. Boutaher, A. Elomri, N. Abghour, K. Moussaid, and M. Rida, A Review of Credit Card Fraud Detection Using Machine Learning Techniques. 2020, pp. 1-5.
 
[5] W. Hilal, S. Gadsden, and J. Yawney, "Financial Fraud: A Review of Anomaly Detection Techniques and Recent Advances," Expert Systems with Applications, vol. 193, p. 116429, 12/01 2021, doi: 10.1016/j.eswa.2021.116429.
 
[6] A. Trozze et al., "Cryptocurrencies and future financial crime," Crime Science, vol. 11, 01/05 2022, doi: 10.1186/s40163-021-00163-8.
 
[7] M. Nelsen. "Outsmarting Fraudsters with Advanced Analytics." visa. https://usa.visa.com/visa-everywhere/security/outsmarting-fraudsters-with-advanced-analytics.html (accessed).
 
[8] T. Kipf and M. Welling, "Semi-Supervised Classification with Graph Convolutional Networks," 09/09 2016.
 
[9] A. Mahajan, V. S. Baghel, and R. Jayaraman, "Credit Card Fraud Detection using Logistic Regression with Imbalanced Dataset," in 2023 10th International Conference on Computing for Sustainable Global Development (INDIACom), 15-17 March 2023 2023, pp. 339-342.
 
[10] T. Yiu. "Understanding Random Forest." Towards Data Science. https://towardsdatascience.com/understanding-random-forest-58381e0602d2 (accessed).
 
[11] N. K. Gyamfi and J. D. Abdulai, "Bank Fraud Detection Using Support Vector Machine," in 2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), 1-3 Nov. 2018 2018, pp. 37-41, doi: 10.1109/IEMCON.2018.8614994.
 
[12] S. Beigi and M. R. Amin Naseri, "Credit Card Fraud Detection using Data mining and Statistical Methods," (in en), Journal of AI and Data Mining, vol. 8, no. 2, pp. 149-160, 2020, doi: 10.22044/jadm.2019.7506.1894.
 
[13] P. W. Battaglia et al., "Relational inductive biases, deep learning, and graph networks," ArXiv, vol. abs/1806.01261, 2018.
 
[14] J. Bruna, W. Zaremba, A. Szlam, and Y. LeCun, "Spectral Networks and Locally Connected Networks on Graphs," CoRR, vol. abs/1312.6203, 2013.
 
[15] H. Cai, V. W. Zheng, and K. C.-C. Chang, "A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications," IEEE Transactions on Knowledge and Data Engineering, vol. 30, pp. 1616-1637, 2017.
 
[16] M. Henaff, J. Bruna, and Y. LeCun, "Deep Convolutional Networks on Graph-Structured Data," ArXiv, vol. abs/1506.05163, 2015.
 
[17] X. Ma et al., "A Comprehensive Survey on Graph Anomaly Detection With Deep Learning," IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 12, pp. 12012-12038, 2023, doi: 10.1109/TKDE.2021.3118815.
 
[18] Y. Dou, Z. Liu, L. Sun, Y. Deng, H. Peng, and P. S. Yu, "Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters," presented at the Proceedings of the 29th ACM International Conference on Information & Knowledge Management, Virtual Event, Ireland, 2020. [Online]. Available: https://doi.org/10.1145/3340531.3411903.
 
[19] P. Li, H. Yu, X. Luo, and J. Wu, "LGM-GNN: A Local and Global Aware Memory-Based Graph Neural Network for Fraud Detection," IEEE Transactions on Big Data, vol. 9, no. 4, pp. 1116-1127, 2023, doi: 10.1109/TBDATA.2023.3234529.
 
[20] Q. Zheng and Y. Zhang, "DSTAGCN: Dynamic Spatial-Temporal Adjacent Graph Convolutional Network for Traffic Forecasting," IEEE Transactions on Big Data, vol. 9, no. 1, pp. 241-253, 2023, doi: 10.1109/TBDATA.2022.3156366.
 
[21] Y. Liu et al., "Pick and Choose: A GNN-based Imbalanced Learning Approach for Fraud Detection," presented at the Proceedings of the Web Conference 2021, Ljubljana, Slovenia, 2021. [Online]. Available: https://doi.org/10.1145/3442381.3449989.
 
[22] M. L. G.-. ULB. "Credit Card Fraud Detection." https://www.kaggle.com/datasets/mlgulb/ creditcardfraud (accessed).
 
[23] H. Zhou, L. Wei, G. Chen, P. Lin, and Y. Lin, Credit Card Fraud Identification Based on Principal Component Analysis and Improved Adaboost Algorithm. 2019, pp. 507-510.
 
[24] WDong, M. Charikar, and K. Li, "Efficient k-nearestnighbor graph construction for generic similarity measures," in The Web Conference, 2011.
 
[25] "Pyhon machine learning library based on torch." https://pytorch.org/ (accessed).
 
[26] S.Ioffe and C. Szegedy, "Batch normalization: acceleratng deep network training by reducing internal covariate shift," presented at the Proceedings of the 32nd International Conference on International Conference on Machine Learning - Volume 37, Lille, France, 2015.
 
[27] E. ichardson, R. Trevizani, J. A. Greenbaum, H. Carter, . Nielsen, and B. Peters, "The receiver operating characteristic curve accurately assesses imbalanced datasets," Patterns, vol. 5, no. 6, p. 100994, 2024/06/14/ 2024, doi: https://doi.org/10.1016/j.patter.2024.100994.