document-term-matrix

Star

Here are 12 public repositories matching this topic...

volkantunali / preto

Star

PRETO: A High-performance Text Mining Tool for Preprocessing Turkish Texts

text-mining preprocessing document-term-matrix turkish-nlp zemberek-library

Updated May 30, 2019
Java

akshay-madar / nlp-for-standup-comedy-analysis

Star

Leveraged NLP techniques such as sentiment analysis and topic modeling to analyze different stand-up comedians using LDA, lemmatization, markov models, etc.

nlp sentiment-analysis markov-chain corpus text-generation comedy topic-modeling standup lda tokenization lemmatization textblob document-term-matrix

Updated Apr 18, 2020
Jupyter Notebook

zuzannapiekarczyk / Tidytext

Star

This is a project about the tidytext library created for a college project for a text mining class.

r tidyverse rmarkdown tidy rmd tidytext textmining document-term-matrix

Updated May 6, 2023
HTML

mehmet-kozan / polarity-rate

Star

Text miner, polarity rater with results between -100% and +100%

nlp unicode sentiment-analysis afinn document-term-matrix polarity-score

Updated Oct 20, 2022
JavaScript

yobahBertrandYonkou / music_sentiment_analysis

Star

This project builds a sentiment analysis model for music lyrics using R and R Shiny. We collect and label lyrics from Spotify and Genius APIs, clean the data, and use a Naive Bayes classifier with TF-IDF features. The model is deployed via R Shiny for interactive sentiment classification.

sentiment-analysis spotify-api r-shiny data-cleaning r-programming genius-api document-term-matrix music-lyrics

Updated Aug 12, 2024
Jupyter Notebook

crfmc / similarity-measures

Star

Python scripts used to calculate 3 basic similarity measures, suitable for ad hoc information retrieval systems: Levenshtein Edit Distance, Jaccard, and a Term-Document matrix.

information-retrieval jaccard-similarity levenshtein-algorithm edit-distance-algorithm document-term-matrix