PRETO: A High-performance Text Mining Tool for Preprocessing Turkish Texts
-
Updated
May 30, 2019 - Java
PRETO: A High-performance Text Mining Tool for Preprocessing Turkish Texts
Leveraged NLP techniques such as sentiment analysis and topic modeling to analyze different stand-up comedians using LDA, lemmatization, markov models, etc.
This is a project about the tidytext library created for a college project for a text mining class.
Text miner, polarity rater with results between -100% and +100%
This project builds a sentiment analysis model for music lyrics using R and R Shiny. We collect and label lyrics from Spotify and Genius APIs, clean the data, and use a Naive Bayes classifier with TF-IDF features. The model is deployed via R Shiny for interactive sentiment classification.
Python scripts used to calculate 3 basic similarity measures, suitable for ad hoc information retrieval systems: Levenshtein Edit Distance, Jaccard, and a Term-Document matrix.
A script that automatically infers the topics discussed in a collection of documents.
📈 Regression and Classification with UC Davis student quiz data and exam data
A text analysis project on collection of script dialogue between characters for the episode 4,5,6 of star wars
Importing and analyzing Twitter Data with R. Conencting to Twitter, Importing Data, Cleaning and Analyzing the Tweets.
Make a DocumentTermMatrix faster
Add a description, image, and links to the document-term-matrix topic page so that developers can more easily learn about it.
To associate your repository with the document-term-matrix topic, visit your repo's landing page and select "manage topics."