Tasks and projects carried out as part of the course "Qualitative data analysis and text mining".
Topics covered during the classes by branch:
lab_1
"Regular Expression" - building regular expressions usingre
librarylab_2
"Stemming and Lemmatization" - cleaning and processing of selected textlab_3
"WordCloud" - building word cloud from csv filelab_4
"Tokenization and vectorization of text"lab_5
"Term-Document Matrix" - operations on matrixlab_6
"Visualizations" - visualization based on matrix operations (bar charts, prettytable)classification
"Classification" - simple news classificationentity_matching
"Distance and similarity between documents"kolokwium
- testproject
"Final project" - analysis and classification of coronavirus tweets