Skip to content

Latest commit

 

History

History
22 lines (14 loc) · 821 Bytes

README.md

File metadata and controls

22 lines (14 loc) · 821 Bytes

extra-stopwords

This repository contains the set of stopwords I used with NLTK for the WbSrch search engine.

It contains some stopword lists from NLTK and ones cobbled together from other sources. The word lists are of varying quality. Feel free to modify them to suit your own needs -- I make no claim about their level of usefulness.

In order to use them with NLTK, they should be copied into your stopwords directory after you've set up NLTK and downloaded its own stopword list. The copy.sh file will do that, assuming NLTK's data directory is in your home directory.

License

MIT license. See the "LICENSE" file for full text.

Contibutors

The following people have contributed to improving this library:

  • Jason Champion
  • Mohammed Gholami
  • Jan Pipek
  • Pavle Vidanović
  • @BLKSerene
  • Yanaki Kolarov