Final Project for CDT 30380

Abby Gervase

Building the Corpus

The script that makes use of the Reddit API is located in the scraper folder. If the proper information for a Reddit API project is given (username, password, etc) then it will scrape the responses for r/WritingPrompts and store them in data/texts.

The Corpus

The corpus is located in data/texts. The naming convention for the text files is prompt{prompt number}_{response number}_{final score}.txt.

Text Analysis

The Jupyter Notebooks in which I performed the actual text analysis are in the exercises folder.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data/texts		data/texts
exercises		exercises
libraries		libraries
scraper		scraper
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Final Project for CDT 30380

Abby Gervase

Building the Corpus

The Corpus

Text Analysis

About

Releases

Packages

Languages

agervase/textmining_finalproject

Folders and files

Latest commit

History

Repository files navigation

Final Project for CDT 30380

Abby Gervase

Building the Corpus

The Corpus

Text Analysis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages