Skip to content

Latest commit

 

History

History
13 lines (10 loc) · 1002 Bytes

README.md

File metadata and controls

13 lines (10 loc) · 1002 Bytes

Multi dialect Arabic stop words

A large list of domain-independent multi-dialect Arabic stop words.

This repository includes two files

  1. a list of 405 domain-independent multi-dialect Arabic stop words.
  2. a lsit of 1,177 Arabic stop-words after adding the multi-dialect list to the NLTK and Alrefaie’s lists.

If you use these lists in your research, please cite the following paper:

Alharbi, Alaa, and Mark Lee. "Kawarith: an Arabic Twitter Corpus for Crisis Events." Proceedings of the Sixth Arabic Natural Language Processing Workshop. 2021

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.