Skip to content

Latest commit

 

History

History
18 lines (14 loc) · 1.01 KB

README.md

File metadata and controls

18 lines (14 loc) · 1.01 KB

Text mining classes

Tasks and projects carried out as part of the course "Qualitative data analysis and text mining".

Topics covered during the classes by branch:

  • lab_1 "Regular Expression" - building regular expressions using re library
  • lab_2 "Stemming and Lemmatization" - cleaning and processing of selected text
  • lab_3 "WordCloud" - building word cloud from csv file
  • lab_4 "Tokenization and vectorization of text"
  • lab_5 "Term-Document Matrix" - operations on matrix
  • lab_6 "Visualizations" - visualization based on matrix operations (bar charts, prettytable)
  • classification "Classification" - simple news classification
  • entity_matching "Distance and similarity between documents"
  • kolokwium - test
  • project "Final project" - analysis and classification of coronavirus tweets