Comparative study of data pre-processing techniques for enhancing fake review detection: a novel pipeline approach

Quyyam, Tayybaha; Yu, Qicheng

London Met Repository

Lists

Tools

Quyyam, Tayybaha and Yu, Qicheng (2025) Comparative study of data pre-processing techniques for enhancing fake review detection: a novel pipeline approach. In: 13th International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA-2025) June 06 - 07, 2025, 6-7 June 2025, London Metropolitan University, London (UK) / Online. (In Press)

Abstract
Documents
Details
Record

[+][-]

Abstract

The rise of online reviews has significantly influenced consumer purchasing decisions, but it has also led to an increase in fraudulent reviews that can artificially boost or tarnish a business's reputation, particularly affecting small businesses. To combat this, we propose a novel pipeline for fake review detection that integrates text data, handcrafted features, and rating-category features to enhance robustness. Our pipeline includes innovative components such as a Context Aware Preprocessor, Fake Review Feature Adder, Category Rating Embedding, Tokenizer Padding, Adaptive Fusion Layer, and Trust Aware Berta Classifier. Applied to the Amazon dataset, our BERT model with the Adaptive Fusion Layer achieves an AUC-ROC score of 0.96, demonstrating its effectiveness in detecting fake reviews. This research underscores the potential of advanced NLP techniques to maintain the authenticity of online reviews, thereby protecting both businesses and consumers from the negative impacts of fraudulent reviews.

Documents

10377:52625

[+][-]

10377:52625

[thumbnail of Comparative Study of Data Pre-Processing Techniques for Enhancing Fake Review Detection.pdf]

Comparative Study of Data Pre-Processing Techniques for Enhancing Fake Review Detection.pdf - Accepted Version
Restricted to Repository staff only until 1 May 2026.

Download (409kB) | Request a copy

Details

Title:

Comparative study of data pre-processing techniques for enhancing fake review detection: a novel pipeline approach

Creators:

Quyyam, Tayybaha and Yu, Qicheng

Date:

21 April 2025

Subjects:

000 Computer science, information & general works
000 Computer science, information & general works > 020 Library & information sciences

Department:

School of Computing and Digital Media

Uncontrolled Keywords:

Natural language processing, Fake reviews, Text pre-processing, Amazon, Machine learning, Deep Learning, Feature extraction

Record

URI:

https://repository.londonmet.ac.uk/id/eprint/10377

Item Type:

Conference or Workshop Item

Presentation Type:

Paper

Depositing User:

Qicheng Yu

Date Deposited:

01 May 2025 08:33

Revision:

Last Modified:

01 May 2025 08:33

View Item

CORE (COnnecting REpositories)

London Met Repository London Met Repository London Met Repository

Comparative study of data pre-processing techniques for enhancing fake review detection: a novel pipeline approach

London Met Repository