tokenized-sentences

Star

Here are 5 public repositories matching this topic...

lfcipriani / punkt-segmenter

Star

Ruby port of the NLTK Punkt sentence segmentation algorithm

ruby nltk ruby-port nlp-library sentence-tokenizer rubynlp sentence-boundaries tokenized-sentences punkt-segmenter

Updated Jun 10, 2018
Ruby

matbahasa / TALPCo

Star

TUFS Asian Language Parallel Corpus

Updated May 1, 2023
TeX

linguistic-dev / n-gram-extractor

Star

A PHP Library to extract n-grams from a text. Simple preprocessing tools (cleaning, tokenizing) included.

nlp php natural-language-processing php7 php-library tokenizer ngram ngrams tokenize tokenization ngram-analysis tokenized-sentences

Updated Dec 5, 2017
PHP

Manaliagarwal / Offensive-Language-Detection

Star

In the same time, flames (such as rants, taunts, and squalid phrases) are offensive/abusive phrases which might attack or offend the users for a variety of reasons. An automatic discriminative software with a sensitivity parameter for flame or abusive language detection would be a useful tool.

numpy gensim matplotlib stopwords embedding tokenization tqdm tokenized-sentences wordnetlemmatizer onehot lancaster-stemmer padsequence

Updated Apr 16, 2021
Jupyter Notebook

sushantdhumak / Generating-TV-Scripts

Star

Deep Learning Nanodegree Project : To generate Simpsons TV scripts using part of the Simpsons data-set of scripts from 27 seasons.

unit-testing neural-network tensorflow word-embeddings batching lstm preprocessing tuning-parameters hyperparameter-tuning tokenization tokenized-sentences recurrent-neural-network lookup-table lstm-neural-network

Updated Nov 15, 2019
Jupyter Notebook

Improve this page

Add a description, image, and links to the tokenized-sentences topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tokenized-sentences topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tokenized-sentences

Here are 5 public repositories matching this topic...

lfcipriani / punkt-segmenter

matbahasa / TALPCo

linguistic-dev / n-gram-extractor

Manaliagarwal / Offensive-Language-Detection

sushantdhumak / Generating-TV-Scripts

Improve this page

Add this topic to your repo