This repository is created to collect a complete list of profanity for different languages to be used in e.g. profanity detection tools / automated solutions for audio and text censorship.
Here is the table with the available languages:
🇬🇧 English | 🏳️ Esperanto | 🇪🇸 Spanish | 🇮🇷 Farsi |
---|---|---|---|
🇫🇮 Finnish | 🇵🇭 Filipino | 🇫🇷 French | 🇮🇳 Hindi |
🇭🇺 Hungarian | 🇮🇹 Italian | 🇯🇵 Japanese | 🇰🇷 Korean |
🇳🇱 Dutch | 🇳🇴 Norwegian | 🇵🇱 Polish | 🇵🇹 Portuguese |
🏳️ Russian | 🇸🇪 Swedish | 🇹🇭 Thai | 🇹🇷 Turkish |
🇺🇦 Ukrainian | 🇨🇳 Chinese |
Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
You can quickly start editing on GitHub Codespaces right away
You can also use vscode online editor, you just need to press the dot . key 🪄🔮
Or follow this link: github.dev/okineadev/profanity-list
You can also use the standard method:
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/AmazingFeature
) - Commit your Changes (
git commit -m 'Add some AmazingFeature'
) - Push to the Branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
Also, please read our Code of Conduct, and follow it in all your interactions with the project.
- https://github.com/LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
- https://github.com/coffee-and-fun/google-profanity-words/blob/main/data/en.txt
- https://github.com/rominf/profanity-filter/blob/master/profanity_filter/data/en_profane_words.txt
- https://github.com/profanitas/abuse/blob/master/abuse/dataset_en.csv