Machine learning approach to identity resolution for criminal profiling

Kazemian, Hassan and Shrestha, Subeksha (2025) Machine learning approach to identity resolution for criminal profiling. Journal of Cyber Security Technology. pp. 1-29. ISSN 2374-2917

Abstract

A common dilemma when working on criminal data is that often people manipulate their details to disguise themselves and hide their identities which leads to creating ambiguous and false identities. Deep Neural Network (DNN) is applied to work well on fraudulent and imbalanced data. Two subcategories of DNN, the Multilayer Perceptron (MLP) and Long Short-Term Memory (LSTM) with Python libraries such as TensorFlow and Keras are applied to the dataset for the detection of suspects with false identities to assist the process of analytical investigation by law enforcement agencies. Upon application of this approach to the anonymized policing dataset from SPIRIT Project funded by European Union’s Horizon, 5 main suspects with false identities were identified out of 23 targets on 39 million records. As working on criminal data is quite sensitive so to avoid data leakage while training and testing the data, K-fold validation has been applied. Furthermore, cascading both MLP and LSTM models on the policing dataset resulted in improved model prediction and accuracy compared to using each model individually. The cascaded model notably reduced false predictions for the recurring criminal patterns such as age-specific trends and most prevalent crime activities.

Documents
10274:52165
[thumbnail of ML-Approach-to-ID-Resolution-for-Criminal-Profiling_March2025.pdf]
ML-Approach-to-ID-Resolution-for-Criminal-Profiling_March2025.pdf - Accepted Version
Restricted to Repository staff only until 24 March 2026.
Available under License Creative Commons Attribution Non-commercial 4.0.

Download (1MB) | Request a copy
Details
Record
View Item View Item