Degree Of Profanity Computation For Twitter Tweets

Problem Statement

Imagine there is a file full of Twitter tweets by various users and you are provided a set of words that indicates racial slurs. Write a program that can indicate the degree of profanity for each sentence in the file. Write in any programming language (preferably in Python) - make any assumptions, but remember to state them.

Assumptions Made

i. Imagining that a dataframe df_tweets has been prepared out of the said twitter's tweets file in which there are two columns -- USERS containing user names and TWEETS containing the repective tweets. And the objective is to compute the degree of profanity for each tweet made by a certain user.

ii. racial_slurs is a list where each element represents a profane word in its base (lemmatized) form.

iii. Additionaly, I'm presuming that before degree of profanity is to be computed for a tweet, the tweet has to be cleaned -- removal of stop words accompanied by the lemmatization of the word tokens, for the least part.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Degree Of Profanity Computation For Twitter Tweets

Problem Statement

Assumptions Made

About

Releases

Packages

Languages

princebhatt9588/Degree-Of-Profanity--Computation-for-Twitter-Tweets

Folders and files

Latest commit

History

Repository files navigation

Degree Of Profanity Computation For Twitter Tweets

Problem Statement

Assumptions Made

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages