https://fastdatascience.com/natural-language-processing/nlp-on-under-resourced-languages/