The repo contains various computational linguistics techniques and algorithms for processing human languages; data structures and algorithms for text processing, tokenization, and part-of-speech tagging including topics such as word2vec model word analogy, etc.; and basic linguistics, probability concepts & techniques for working with large amounts of data, and familiarity with common resources such as the Penn Treebank and WordNet.
Each Projects have their own readme to follow!
Feel free to contact me or open a ticket (PRs are always welcome!) with any questions, comments, suggestions, bug reports, etc.
This project is licensed under the terms of the MIT license.