Skip to content
This repository has been archived by the owner on Sep 25, 2023. It is now read-only.

Latest commit

 

History

History
93 lines (69 loc) · 4.43 KB

README.md

File metadata and controls

93 lines (69 loc) · 4.43 KB

Polyglot

Build Status Build status Coverage Status License Pharo version Pharo version Pharo version

This repository is marked as a public archive and it will be deleted in the future. It contains duplicated code that it can be found in the other pharo-ai repositories. We encourage you to look into the other NLP repositories that we have inside pharo-ai.

A library for Natural Language Processing implemented in Pharo. To get more information, check out the Polyglot Booklet.

Installation

To install Polyglot, go to the Playground (Ctrl+OW) in your fresh Pharo image and execute the following Metacello script (select it and press Do-it button or Ctrl+D):

Metacello new
  baseline: 'Polyglot';
  repository: 'github://PolyMathOrg/Polyglot/src';
  load.

List of Supported Features

  • Tokenization
  • N-grams
  • Term Frequency-Inverse Document Frequency Scoring
  • N-Gram Language Modelling
  • Stemming
  • Part of Speech Tagging
  • Named Entity Recognizer
  • Dependency Parser
  • Modified Atlas Bridge
  • Common Vector Metrics

Google Summer of Code 2019 Report

Author: Nikhil Pinnaparaju

Organisation: Pharo

Project: Polyglot

Mentors: Oleksandr Zaitsev, Alexandre Bergel

A library for Natural Language Processing implemented in Pharo.

Features Implemented

  • Tokenization
  • N-grams
  • Term Frequency-Inverse Document Frequency Scoring
  • N-Gram Language Modelling
  • Stemming
  • Part of Speech Tagging
  • Named Entity Recognizer
  • Dependency Parser
  • Modified Atlas Bridge
  • Common Vector Metrics

Code Contribution

Documentation

Blog Posts

Booklets

Project Demonstration/Presentation