<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>
</head>
<body dir="ltr">
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Hi,</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
some time ago I made assembled a corpus containing approximately 240,000 Klingon words. I included publicly available texts and a couple of texts with permission. It includes most of the Okrandian canon. You can search the corpus here:
<a href="https://klingon-corpus.herokuapp.com/" id="LPlnk828232">https://klingon-corpus.herokuapp.com/</a> . It allows limiting search to only some sources and using regexes as search queries. It also has a builtin dictionary that can be used to check meanings
of included words. I hope it is useful for both language learners and researchers alike. If any of you here have any suggestions to improve this tool I'd be happy to hear them.<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Also, I would be pleased if any of you donated texts to me to include them in the corpus. To protect the copyright of the authors, I have limited the number of search results to one hundred. This way it is not possible to get an entire text using an empty search
query. The purpose of the website is to be a search engine, not a way to download copyrighted material.</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Due to this reason, I will not publicly share the whole corpus, but I'm willing to do analysis on it if someone wants. I have already published frequency lists of words, morphemes and syllables on the web site that I hope can be used for example when crafting
beginner's word lists etc.</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Best regards,</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Iikka "fergusq" Hauhio<br>
</div>
</body>
</html>