Tag: TLG

TLG Updates, May 2010

By: | Post date: 2010-05-11 | Comments: No Comments
Posted in categories: Ancient Greek, Linguistics, Literature, Mediaeval Greek
Tags: , , , , ,

The TLG has just released the latest updates to its text collection. This is what has been added, from the oldest to the most recent texts, with Early Modern Greek texts separate: Philodemus (i BC): On Anger (ed. Indelli, 1988) Philodemus is a Hellenistic philosopher, who we know about mainly thanks to Mt Vesuvius, carbonising […]

What are the longest words of Greek?

By: | Post date: 2010-03-02 | Comments: 11 Comments
Posted in categories: Ancient Greek, Linguistics, Mediaeval Greek
Tags: , , , , ,

Everyone knows (or should know) about the longest word of Greek ever—the word that broke the title bar of Wikipedia, Aristophanes’ fantastical dish of 17 ingredients at the end of the Ecclesiazusae, that lopado-temacho-thing: λοπαδοτεμαχοσελαχογαλεοκρανιολειψανοδριμυποτριμματοσιλφιολιπαρομελιτοκατακεχυμενοκιχλεπικοσσυφοφαττοπεριστεραλεκτρυονοπτοπιφαλλιδοκιγκλοπελειολαγῳοσιραιοβαφητραγανοπτερυγών (172 chars) Ah. It breaks blogspot too. 🙂 λοπαδοτεμαχοσελαχογαλεοκρανιολειψανοδριμυποτριμματοσιλφιο­λιπαρομελιτοκατακεχυμενοκιχλεπικοσσυφοφαττοπεριστεραλεκτρυονοπτο­πιφαλλιδοκιγκλοπελειολαγῳοσιραιοβαφητραγανοπτερυγών (172 chars) Have you ever wondered what the next longest words […]

New TLG words in DGE VII

By: | Post date: 2010-02-15 | Comments: 1 Comment
Posted in categories: Ancient Greek, Linguistics
Tags: , , ,

As I posted last month, the new volume of DGE (Diccionario Griego–Español) has appeared, spanning ἐκπελλεύω–ἔξαυος. As with any lexicographic work of an older language, some philology and textual emendation has been involved; this paper by Eugenio Luján Martínez gives four such instances, in Epicurus, Aretaeus, Nicander, and Galen. I have gone through this volume […]

Comparison, TLG BC and AD: log-likelihood

By: | Post date: 2010-02-06 | Comments: 3 Comments
Posted in categories: Ancient Greek, Linguistics, Mediaeval Greek
Tags: , , , , ,

Helma Dik left a comment on my post on comparing TLG AD and BC through Wordle, suggesting I use Dunning’s Log-Likelihood measure of differential word frequencies in corpora, as Wordled by Martin Mueller. That lets you work out what the real shifts in frequency are, rather than trying to eyeball them through the aggregate word […]

TLG updates

By: | Post date: 2010-02-02 | Comments: 5 Comments
Posted in categories: Linguistics, Mediaeval Greek, Modern Greek
Tags: , , , , ,

The TLG has just released a new update to its corpus. As of tonight, the automatic recognition of lemmata in the TLG which I’ve been working on has just reached 95% of all wordforms. With these two milestones, I’ll be posting a few things about the current corpus; I’ve already put up some Wordles, as […]

Comparison, TLG BC and AD

By: | Post date: 2010-02-01 | Comments: 6 Comments
Posted in categories: Ancient Greek, Linguistics
Tags: , , , ,

In the previous post, I used Wordle to illustrate stop words in Greek (and, by the by, the exponential distribution of function words following Zipf’s Law). After getting rid of a whole bunch of stop words, I ended up with a Wordle of the lemmata of the TLG:But I stopped short of making sense of […]

Wordle and Greek stop words

By: | Post date: 2010-01-31 | Comments: 7 Comments
Posted in categories: Ancient Greek, Linguistics
Tags: , , , ,

Some of you may be familiar with Wordle, an online tool which displays the words in a text with different sizes, depending on their frequency. Wordle is a convenient tool for seeing what the frequently mentioned concepts are in a text, so it gets a fair amount of use in blogs. It’s the same concept […]

The 23 to 29 Apolloniuses of Classical Literature

By: | Post date: 2009-10-12 | Comments: 6 Comments
Posted in categories: Ancient Greek, Linguistics
Tags: , ,

I’m parking this posting here for lack of somewhere else to park it. (It’s not strictly language-related, but I’m realising philology posts are probably better pitched here than in The Other Place.) In my day-job capacity, I’m posting on the fluidity of identity in repositories—how, particularly if you’re relying on computer deduplication of identity, there […]

Lerna VIIc: Variants

By: | Post date: 2009-07-21 | Comments: No Comments
Posted in categories: Ancient Greek, Linguistics, Mediaeval Greek
Tags: , , , , ,

The various counts of lemmata that I’ve been putting out for the last while have made little mention of the difficulty in deciding whether two forms belong to variants of the same lemma, or distinct lemmata. The judgement call is difficult enough within a homogeneous language, with slight variations in derivational morphology. It’s even worse […]

Lerna VIIb: Lemma counts and proportion of text recognised

By: | Post date: 2009-07-15 | Comments: 3 Comments
Posted in categories: Ancient Greek, Linguistics, Mediaeval Greek
Tags: , , ,

We can keep dredging lemmata up to move towards a target of 300,000. But of course for a living language, as Modern Greek now is and as Ancient Greek once was, there is no ceiling in lemmata: people can always make up new words, and do. And because dictionaries will never exhaust what words people […]

  • Subscribe to Blog via Email

    Join 300 other subscribers

  • August 2018
    M T W T F S S
    « Jan    
     12345
    6789101112
    13141516171819
    20212223242526
    2728293031