Natural-Language-Processing

Toxic Comments Classification Challenge

Big Data and Brews

July 7, 2018

In this social media era, many businesses are moving to incorporate user interaction and discussion in their platforms. However, a minority of users use the technology inappropriately, using it to threaten, insult, or create a generally toxic atmosphere.

We work to classify these toxic comments, without censoring the population as a whole.

We investigate ~160,000 comments, about 89.8% are normal user interaction, and 10.2% are toxic. The comments are classified in the following 6 ways:

Toxic
Severely toxic
Obscene
Insult
Threat
Identity hate

Many toxic comments are classified with more than one label. The labels have been classified by human raters.

A link to the challenge can be found here: https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge/data

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.ipynb_checkpoints		.ipynb_checkpoints
assets		assets
files		files
README.md		README.md
ToxicComments.ipynb		ToxicComments.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural-Language-Processing

Toxic Comments Classification Challenge

Big Data and Brews

About

Releases

Packages

Languages

Jackie789/Natural-Language-Processing

Folders and files

Latest commit

History

Repository files navigation

Natural-Language-Processing

Toxic Comments Classification Challenge

Big Data and Brews

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages