[tlhIngan Hol] Klingon corpus tool

Iikka Hauhio iyqa at outlook.com
Thu Dec 17 14:45:21 PST 2020


some time ago I made assembled a corpus containing approximately 240,000 Klingon words. I included publicly available texts and a couple of texts with permission. It includes most of the Okrandian canon. You can search the corpus here: https://klingon-corpus.herokuapp.com/ . It allows limiting search to only some sources and using regexes as search queries. It also has a builtin dictionary that can be used to check meanings of included words. I hope it is useful for both language learners and researchers alike. If any of you here have any suggestions to improve this tool I'd be happy to hear them.

Also, I would be pleased if any of you donated texts to me to include them in the corpus. To protect the copyright of the authors, I have limited the number of search results to one hundred. This way it is not possible to get an entire text using an empty search query. The purpose of the website is to be a search engine, not a way to download copyrighted material.

Due to this reason, I will not publicly share the whole corpus, but I'm willing to do analysis on it if someone wants. I have already published frequency lists of words, morphemes and syllables on the web site that I hope can be used for example when crafting beginner's word lists etc.

Best regards,
Iikka "fergusq" Hauhio
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.kli.org/pipermail/tlhingan-hol-kli.org/attachments/20201217/1a4bc50f/attachment-0001.htm>

More information about the tlhIngan-Hol mailing list