Bangla NLP dataset. This repository contains sbnltk datasets, which were used in Bangla nlp toolkit - sbnltk . Also , Existing Datasets are being listed here!
OUR DATASET IS IN LFS MODE! SO YOU HAVE TO CLONE IT FOR GETTING DATA!
WE WILL SOON UPLOAD ALL DEEP LEARNING BASED DATASETS!
- Bangla Number List drive
- Bangla root word List drive
- Bangla Word List (highest to lowest occurrence) drive
- Bangla Wiki Dump word drive
- Bangla POStag static dataset(single word) drive
- Bangla NER Static Dataset(single word) drive
- Bangla Stop word list drive
- Bangla Dump Pos tag drive
- Bangla Dump question Classification Dataset drive
- Bangla Dump Sentiment Analysis drive
- Google Translation Dataset drive
- NER Existing Dataset(Modified + adding Date entity) drive
- News Article Dataset drive
- POS tag converted Data drive
- POS tag human evaluated Data drive
- DUMP NER data (active and passive both) drive
- DUMP NER data(active only) drive
- Extractive Text Summarization github
- Abstractive Text Summarization(newspaper) drive kaggle
- News Article Classification(text Classification) drive kaggle
- Topic Keywords classfication(keywords generator) drive kaggle
- Text Summarization paper cite
I am not the owner of these following datasets. It's just a collection to find amazing peoples and their works Please give them support! Your support will encourage them to do more amazing things.
- wiki Articles
- Bangladesh Protidin News paper
- Bangla NewsPaper dataset
- Banlga Largest Dataset
- 40k News Article Dataset
- All types of Wikipedia Articles
- bdNews24 largest dataset
- Bengali Text to Speech Dataset
- Bengali Automatic Speech Recognition Dataset
- Numta Handwritten Bengali Digits
- Social media Comments
- Sentiment Analysis
- News article classification
- Bangla Drama review Dataset
- Bengali News Comments
- News Headline Classification
- Bangla Newspaper article classification-Big
- Bangla youtube sentiment/emotion dataset for classification
- Multilingual dataset for sentiment 81 language - low size
- Twitter Dataset
- Articles Summarization - extractive - abstractive
- BANSData: A Dataset for Bengali Abstractive News Summarization
- Articles - 3 human evaluated summary