Unigram Tokenization Used by SentencePiece. See N-Gram https://huggingface.co/learn/nlp-course/en/chapter6/7