Subscribe to Blog via Email
Archive:
Month: February 2010
TLG updates
The TLG has just released a new update to its corpus. As of tonight, the automatic recognition of lemmata in the TLG which I’ve been working on has just reached 95% of all wordforms. With these two milestones, I’ll be posting a few things about the current corpus; I’ve already put up some Wordles, as […]
Comparison, TLG BC and AD
In the previous post, I used Wordle to illustrate stop words in Greek (and, by the by, the exponential distribution of function words following Zipf’s Law). After getting rid of a whole bunch of stop words, I ended up with a Wordle of the lemmata of the TLG:But I stopped short of making sense of […]