Machine Learning for Fraud Detection in Insurance Claims using Time-Series Anomaly Detection
DOI:
https://doi.org/10.63282/3050-922X.IJERET-V5I4P112Keywords:
Fraud Detection, Insurance Claims, Machine Learning, Time-Series Analysis, Anomaly Detection, LSTM Autoencoder, Isolation Forest, Predictive Modeling, Financial Crime, Unsupervised LearningAbstract
Insurance fraud is a significant challenge that continues to grow before the eyes of the insurers, causing disruptions in the scale of billions of dollars a year. Rule-based traditional systems are unable to detect customized evolving and rather complex fraudulent patterns, especially with a massive dataset. This paper aims at giving attention to the machine learning (ML) approach to investigating fraud detection in insurance claims from the viewpoint of time-series anomalies. By operating on the temporal dimension of claim filing, time-series ML models build the intervening link that leads from normal claim behavior to temporary extensions in deviation, which mark the spotlight of improbability_cont_<LSTM Auto encoders, Isolation Forests, Temporal Convolutional Networks>. We have used real and synthetic datasets to compare algorithms for anomaly detection in various criteria: accuracy, precision, recall, and AUC_ROC. Results suggest that time-series-based models are uniquely suited to detecting dynamic and complex forms of fraud, especially in the form of late reporting, charge inflation, and claim bursts, whereas traditional classifiers fail. We also look at how unsupervised learning can be a serious contender in instances where the label data is scant or non-existent. With respect to the existing literature, this work proposes an end-to-end framework to integrate the ML anomaly detection into workflows for insurance fraud detection. The work ends with a discussion of interpretability issues, deployment challenges, and ethical considerations for algorithmic decision-making in financial services
References
[1] Ahmed, M., Mahmood, A. N., & Hu, J. (2016). A survey of network anomaly detection techniques. Journal of Network and Computer Applications, 60, 19–31. https://doi.org/10.1016/j.jnca.2015.11.016
[2] Bauder, R. A., & Khoshgoftaar, T. M. (2018). A survey of data sampling and class imbalance in fraud detection. Journal of Big Data, 5(1), 1–22. https://doi.org/10.1186/s40537-018-0141-6
[3] Chandola, V., Banerjee, A., & Kumar, V. (2009). Anomaly Detection: A Survey. ACM Computing Surveys.
[4] Coalition Against Insurance Fraud. (2022). Annual fraud report: Fraud stats and prevention trends. https://insurancefraud.org
[5] Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
[6] Sakurada, M., & Yairi, T. (2014). Anomaly Detection Using Autoencoders with Nonlinear Dimensionality Reduction. Proceedings of the MLSDA 2014 Workshop.
[7] Rautaray, S., & Tayagi, D. (2023). Artificial Intelligence in Telecommunications: Applications, Risks, and Governance in the 5G and Beyond Era. Artificial Intelligence
[8] Lo, A. W. (2017). Adaptive markets: Financial evolution at the speed of thought. Princeton University Press.
[9] Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A. (2021). A survey on bias and fairness in machine learning. ACM Computing Surveys, 54(6), 1–35. https://doi.org/10.1145/3457607
[10] CT Aghaunor. (2023). From Data to Decisions: Harnessing AI and Analytics. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(3), 76-84. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I3P109
[11] Malhotra, P., Ramakrishnan, A., Anand, G., Vig, L., Agarwal, P., & Shroff, G. (2016). LSTM-based Encoder–Decoder for Multi-sensor Anomaly Detection. arXiv:1607.00148.
[12] T Anthony. (2021). AI Models for Real Time Risk Assessment in Decentralized Finance. Annals of Applied Sciences, 2(1). Retrieved from https://annalsofappliedsciences.com/index.php/aas/article/view/30
[13] Wu, H., Xu, J., Wang, J., & Long, M. (2021). Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Advances in Neural Information Processing Systems, 34, 22419–22430.
[14] S Mishra, and A Jain, “Leveraging IoT-Driven Customer Intelligence for Adaptive Financial Services”, IJAIDSML, vol. 4, no. 3, pp. 60–71, Oct. 2023, doi: 10.63282/3050-9262.IJAIDSML-V4I3P107
[15] Zerveas, G., Jayaraman, S., Patel, D., Bhamidipaty, A., & Eickhoff, C. (2021). A transformer-based framework for multivariate time series representation learning. Proceedings of the ACM SIGKDD, 2114–2124. https://doi.org/10.1145/3447548.3467401
[16] Zhang, C., Song, D., Chen, Y., Feng, X., Lumezanu, C., Cheng, W., Ni, J., & Chawla, N. V. (2020). A deep neural network for unsupervised anomaly detection and diagnosis in multivariate time series data. AAAI Conference on Artificial Intelligence, 34(4), 443–450.
[17] Laxman doddipatla, & Sai Teja Sharma R.(2023). The Role of AI and Machine Learning in Strengthening Digital Wallet Security against Fraud. Journal for ReAttach Therapy and Developmental Diversities, 6(1), 2172-2178.
[18] Maiano, L., et al. (2023). A deep-learning–based antifraud system for car-insurance claims. Expert Systems with Applications.
[19] Asgarian, A., Saha, R., Jakubovitz, D., & Peyre, J. (2023). AutoFraudNet: A Multimodal Network to Detect Fraud in the Auto Insurance Industry. arXiv:2301.07526.
[20] B Naticchia, “Unified Framework of Blockchain and AI for Business Intelligence in Modern Banking ”, IJERET, vol. 3, no. 4, pp. 32–42, Dec. 2022, doi: 10.63282/3050-922X.IJERET-V3I4P105
[21] Viaene, S., Derrig, R. A., Baesens, B., & Dedene, G. (2002). A Comparison of State-of-the-Art Classification Techniques for Expert Automobile Insurance Claim Fraud Detection. Journal of Risk and Insurance, 69(3), 373–421
[22] Vajiram, J., Senthil, N., & Adhith, P. N. (2023). Correlating Medi-Claim Service by Deep Learning Neural Networks. arXiv:2308.04469.
[23] R. R. Yerram, "Risk management in foreign exchange for crossborder payments:Strategies for minimizing exposure," Turkish Online Journal of Qualitative Inquiry, pp. 892-900, 2020.
[24] JB Lowe, Financial Security and Transparency with Blockchain Solutions (May 01, 2021). Turkish Online Journal of Qualitative Inquiry, 2021[10.53555/w60q8320], Available at SSRN: https://ssrn.com/abstract=5339013 or http://dx.doi.org/10.53555/w60q8320http://dx.doi.org/10.53555/w60q8320
[25] Breunig, M. M., Kriegel, H. P., Ng, R. T., & Sander, J. (2000). LOF: Identifying density-based local outliers. ACM SIGMOD Record, 29(2), 93–104. https://doi.org/10.1145/335191.335388
[26] D Alexander.(2022). EMERGING TRENDS IN FINTECH: HOW TECHNOLOGY IS RESHAPING THE GLOBAL FINANCIAL LANDSCAPE. Journal of Population Therapeutics and Clinical Pharmacology, 29(02), 573-580.
[27] Scholkopf, B., Platt, J., Shawe-Taylor, J., Smola, A. J., & Williamson, R. C. (2001). Estimating the support of a high-dimensional distribution. Neural Computation, 13(7), 1443–1471. https://doi.org/10.1162/089976601750264965
[28] Tian, Y., & Liu, G. (2023). Transaction Fraud Detection via Spatial-Temporal-Aware Graph Transformer (STA-GT). arXiv:2307.05121.
[29] AB Dorothy. GREEN FINTECH AND ITS INFLUENCE ON SUSTAINABLE FINANCIAL PRACTICES. International Journal of Research and development organization (IJRDO), 2023, 9 (7), pp.1-9. ⟨10.53555/bm.v9i7.6393⟩. ⟨hal-05215332⟩
[30] Yang, C., et al. (2023). DDMT: Denoising Diffusion Mask Transformer Models for Multivariate Time Series Anomaly Detection. arXiv:2310.08800.
[31] Badawi, S. A., Guessoum, D., Elbadawi, I., & Albadawi, A. (2022). A novel time-series transformation and machine-learning-based method for NTL fraud detection in utility companies. Mathematics, 10(12), 2051. https://doi.org/10.3390/math10122051
[32] K Peter. (2022). Multi-Modal GANs for Real-Time Anomaly Detection in Machine and Financial Activity Streams. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(1), 39-48. https://doi.org/10.63282/3050-9262.IJAIDSML-V3I1P105
[33] Fursov, I., Kovtun, E., Rivera-Castro, R., Zaytsev, A., Zaytsev, M., & Nikolenko, S. (2022). Sequence embeddings help detect insurance fraud. In 2022 IEEE International Conference on Big Data (Big Data) (pp. 5277–5284). IEEE. https://doi.org/10.1109/BigData55660.2022.10021116
[34] Hemalatha Naga Himabindu, Gurajada. (2022). Unlocking Insights: The Power of Data Science and AI in Data Visualization. International Journal of Computer Science and Information Technology Research (IJCSITR), 3(1), 154-179. https://doi.org/10.63530/IJCSITR_2022_03_01_016
[35] Lu, J., Fung, B. C. M., & Cheung, W. K. (2020). Embedding for anomaly detection on health insurance claims. In 2020 IEEE International Conference on Data Science and Advanced Analytics (DSAA) (pp. 537–546). IEEE. https://doi.org/10.1109/DSAA49011.2020.00073