Detection of Fake Reviews on Online Review Platforms using Deep Learning Architectures

Dataset: https://s3.amazonaws.com/fast-ai-nlp/yelp_review_polarity_csv.tgz
https://www.kaggle.com/rtatman/deceptive-opinion-spam-corpus
The data includes 1,569,264 samples from the Yelp Dataset Challenge 2015. This subset has 280,000 training samples and 19,000 test samples in each polarity.

Also, if you happen to refer my work, a citation would do wonders for me. Thanks!

Salunkhe, Ashish. "Attention-based Bidirectional LSTM for Deceptive Opinion Spam Classification." arXiv preprint arXiv:2112.14789 (2021).

The following implementations are done:

Bidirectional LSTM with GLoVE 50D word embeddings
LSTM with GLoVE 100D word embeddings
LSTM with GLoVE 300D word embeddings
CNN-LSTM with Doc2Vec and TF-IDF
Attention mechanism with GLoVe 100D word embeddings
Logistic Regression
Multinomial Naive Bayes
Support Vector Machine - Stochastic Gradient Descent (SGD)

The results obtained were as follows:

Sr. No.	Model Accuracy (%)	Precision Score	Recall Score	F1 Score
1	MultinomialNB	90.25	0.9325	0.8601
2	Stochastic Gradient Descent (SGD)	87.75	0.8913	0.8497
3	Logistic Regression	87.00	0.8691	0.8601
4	Support Vector Machine	56.25	0.525	0.9792
5	Gaussian Naive Bayes	63.5	0.6424	0.6169
6	K-Nearest Neighbour	57.5	0.8604	0.1840
7	Decision tree	68.5	0.6681	0.7412

Model	Training accuracy(%)	Testing accuracy(%)
Bidirectional LSTM + GLoVe(50D)	92.17	88.13
LSTM + GLoVe(100D)	99.18	85.75
CNN + LSTM + Doc2Vec +TF-IDF	96.23	92.19
CNN + Attention + GLoVe(100D)	99.00	90.25
BiLSTM + Attention + GLoVe(100D)	99.18	89.27
CNN + BiLSTM + Attention + GLoVe(100D)	99.75	81.25
LogisticRegression + TF-IDF	99.11	87.21

Future scope includes improvement in the attention layer to increase testing accuracy. BERT and XLNet can be implemented to improve the performance further.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github		.github
test		test
CNN-LSTM with Doc2Vec plus TF-IDF.ipynb		CNN-LSTM with Doc2Vec plus TF-IDF.ipynb
EDAonYelpReviews.ipynb		EDAonYelpReviews.ipynb
GLoVe-LSTM-Conv1D-DecepOpSpam.ipynb		GLoVe-LSTM-Conv1D-DecepOpSpam.ipynb
NLP using Glove and Spacy .ipynb		NLP using Glove and Spacy .ipynb
NLP_using_Glove_and_Spacy_.ipynb		NLP_using_Glove_and_Spacy_.ipynb
README.md		README.md
attention_biLstm_glove.ipynb		attention_biLstm_glove.ipynb
basicMLmodelsDecepOpSpam (1).ipynb		basicMLmodelsDecepOpSpam (1).ipynb
basicMLmodelsDecepOpSpam.ipynb		basicMLmodelsDecepOpSpam.ipynb
bidirectional-lstm-spacy-on-yelp-reviews.ipynb		bidirectional-lstm-spacy-on-yelp-reviews.ipynb
cnn-lstm-with-doc2vec-plus-tf-idf.ipynb		cnn-lstm-with-doc2vec-plus-tf-idf.ipynb
deceptive_true_review.ipynb		deceptive_true_review.ipynb
model_lstm.ipynb		model_lstm.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Detection of Fake Reviews on Online Review Platforms using Deep Learning Architectures

About

Releases

Sponsor this project

Packages

Languages

ashishsalunkhe/DeepSpamReview-Detection-of-Fake-Reviews-on-Online-Review-Platforms-using-DeepLearning-Architectures

Folders and files

Latest commit

History

Repository files navigation

Detection of Fake Reviews on Online Review Platforms using Deep Learning Architectures

About

Topics

Resources

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages