Skip to content

NLP Related

Dorai Thodla edited this page Sep 17, 2021 · 2 revisions

The Main Approaches to Natural Language Processing Tasks

Tags: Machine Learning, Neural Networks, NLP, Text Classification

Let's have a look at the main approaches to NLP tasks that we have at our disposal. We will then have a look at the concrete NLP tasks we can tackle with said approaches.

http://bit.ly/2EDx9iv

A Framework for Approaching Textual Data Science Tasks

Although NLP and text mining are not the same thing, they are closely related, deal with the same raw data type, and have some crossover in their uses. Let's discuss the steps in approaching these types of tasks.

http://bit.ly/2K1aOJi

Tags: Modeling, Natural Language Processing, NLP, Text Analytics, Text Mining

Application of BERT : Sentence semantic similarity

https://bit.ly/2XtMqfW

How to Create a Vocabulary for NLP Tasks in Python - KDnuggets

When performing a natural language processing task, our text data transformation proceeds more or less in this manner:

raw text corpus → processed text → tokenized text → corpus vocabulary → text representation

Keep in mind that this all happens prior to the actual NLP task even beginning.

The corpus vocabulary is a holding area for processed text before it is transformed into some representation for the impending task, be it classification, or language modeling, or something else.

The vocabulary serves a few primary purposes:

  • help in the preprocessing of the corpus text
  • serve as storage location in memory for processed text corpus
  • collect and store metadata about the corpus
  • allow for pre-task munging, exploration, and experimentation

https://bit.ly/2VMxkS2

Data Representation for Natural Language Processing Tasks

https://bit.ly/3tP6KEh

Tags: NLP, Representation, Text Mining, Word Embeddings, word2vec