Text Analytics

Language Moddeling

We create bigram, trigram and linear interpolation language models which are used for language generation and spell correction.

Source code Report

Sentiment Classification and POS Tagging tasks

We create deep learning models using the Transformers\Datasets, Pytorch and Tensorflow libraries. We also use the keras_tuner / transformers_trainer frameworks to optimize hyperparameters and model architecture.

We briefly mention additional tasks carried out:

Sentiment Analysis: Dataset selection, exploratory analysis, custom stopwords, data augmentation.
POS Taggging: Dataset selection, exploratory analysis, custom parsing, custom baseline ("smart dummy") model, local caching of heavy computations, automated results generation (python -> LaTeX).

Each task features two IPython notebooks containing the executed code, python source files for repeated custom tasks and a unified report.

The reports discuss in detail the design decisions for each classifier and include graphs and aggregated results comparing the current model to the previous models.

Name		Name	Last commit message	Last commit date
Latest commit History 319 Commits
cnn		cnn
language_modeling		language_modeling
mlp_nlp		mlp_nlp
rnn		rnn
transformers		transformers
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Analytics

Language Moddeling

Sentiment Classification and POS Tagging tasks

Simple MLP model

RNN Model

CNN Model

BERT Model

About

Releases

Packages

Contributors 2

Languages

dimits-ts/text_analytics

Folders and files

Latest commit

History

Repository files navigation

Text Analytics

Language Moddeling

Sentiment Classification and POS Tagging tasks

Simple MLP model

RNN Model

CNN Model

BERT Model

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages