
Morphological tagging of ancient Greek linguistic corpora is approaching 50 years. Notable projects include CCAT/CATSS (1977-), its commercial spinoffs (BibleWorks, 1992-; Logos, 1992-; Accordance, 1994-). TLG (2006-), PROIEL (2007-), AGDT 1.0 (2009-), AGDT 2.0 (2014-), Pedalion (2019-), GLAUx (2021-), and OGA (2023-). Together with the Diorisis Ancient Greek Corpus (2018-), the more recent projects have built corpora of 10s of millions of tokens. In light of all of these developments, what should we do and where should we go? Continue building international collaborations, curate customized datasets, and make linguistic phenemena more easily searchable and citable! Two new corpora of 32M+ tokens each (AGDTmini and CATnaPS) are here released and a starter Jupyter Notebook provided to start to make use of each repository.
Linguistics/statistics & numerical data, Linguistics/history, Linguistics/statistics & numerical data, Greek World/history, FOS: Languages and literature, Linguistics, Linguistics/education
Linguistics/statistics & numerical data, Linguistics/history, Linguistics/statistics & numerical data, Greek World/history, FOS: Languages and literature, Linguistics, Linguistics/education
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
