Skip to content

Latest commit

 

History

History
21 lines (16 loc) · 491 Bytes

README.md

File metadata and controls

21 lines (16 loc) · 491 Bytes

simple-tokenizer

An example of developing services as a python package

Installation

pip install simpletokenizer

Usage

>>> import simpletokenizer
>>> simpletokenizer.tokenize("the fox jumps over the lazy dog")
['the', 'fox', 'jumps', 'over', 'the', 'lazy', 'dog']
>>> simpletokenizer.count_tokens("the fox jumps over the lazy dog")
7
>>> simpletokenizer.get_unique_words("the fox jumps over the lazy dog")
['fox', 'jumps', 'over', 'the', 'dog', 'lazy']