https://towardsdatascience.com/benchmarking-python-nlp-tokenizers-3ac4735100c5 https://stackoverflow.com/questions/54941966/how-can-i-calculate-perplexity-using-nltk/55043954