GitHub - danort92/Social-Media-Toxic-Comment-Filter: Analysis of several Deep Learning algorithms performance to define the best one as anti hater filter for social media

Social Media Toxic Comment Filter

The model can filter user comments based on the degree of harmfulness of the language through these steps:

Preprocess the text by eliminating the set of tokens that do not contribute significantly to the semantic level;
Transform the text corpus into sequences;
Build a deep learning model including recurring layers for a multilabel classification task;
In prediction time, the model must return a vector containing a 1 or a 0 in correspondence with each label present in the dataset (toxic, severe_toxic, obscene, threat, insult, identity_hate).

The used dataset is heavily unbalanced (the vast majority of comments is non-toxic, so different cases are studied to rebalance and boost.

For each case the Deep Learning model hyperparameters do not substancially change (just small adjustments) and is characterized by: Embedding, Bidirectional, TimeDistributed, Flatten Dense Droput and Dense layers.

The analyzed cases are the following:

Downsampling non-toxic train dataset;
Oversampling toxic train dataset;
Label sensitive oversampling toxic train dataset;
Words Embedding;
Oversampling oxic train dataset plus Words Embedding;
Label sensitive oversampling toxic train dataset plus Words Embedding.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
Toxic_Comments_Filter.ipynb		Toxic_Comments_Filter.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Social Media Toxic Comment Filter

About

Releases

Packages

Languages

danort92/Social-Media-Toxic-Comment-Filter

Folders and files

Latest commit

History

Repository files navigation

Social Media Toxic Comment Filter

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages