Are automatic methods for cognate detection good enough for phylogenetic reconstruction in historical linguistics?

Preprint, Conference object English OPEN
Rama, Taraka; List, Johann-Mattis; Wahle, Johannes; Jäger, Gerhard;
(2018)
  • Related identifiers: doi: 10.18653/v1/N18-2063
  • Subject: Computer Science - Computation and Language
    acm: ComputingMethodologies_PATTERNRECOGNITION

We evaluate the performance of state-of-the-art algorithms for automatic cognate detection by comparing how useful automatically inferred cognates are for the task of phylogenetic inference compared to classical manually annotated cognate sets. Our findings suggest that... View more
  • References (37)
    37 references, page 1 of 4

    Enrique Amigo´ , Julio Gonzalo, Javier Artiles, and Felisa Verdejo. 2009. A comparison of extrinsic clustering evaluation metrics based on formal constraints. Information retrieval, 12(4):461-486.

    Remco Bouckaert, Philippe Lemey, Michael Dunn, Simon J. Greenhill, Alexander V. Alekseyenko, Alexei J. Drummond, Russell D. Gray, Marc A. Suchard, and Quentin D. Atkinson. 2012. Mapping the origins and expansion of the Indo-European language family. Science, 337(6097):957-960.

    Claire Bowern and Quentin D. Atkinson. 2012. Computational phylogenetics of the internal structure of Pama-Nguyan. Language, 88:817-845.

    Lyle Campbell and William J. Poser. 2008. Language classification: History and Method. Cambridge University Press.

    Will Chang, Chundra Cathcart, David Hall, and Andrew Garrett. 2015. Ancestry-constrained phylogenetic analysis supports the Indo-European steppe hypothesis. Language, 91(1):194-244.

    Michael A. Covington. 1996. An algorithm to align words for historical comparison. Computational Linguistics, 22(4):481-496.

    Michael Dunn. 2012. Indo-European lexical cognacy database (IELex).

    Michael Dunn, Simon J. Greenhill, Stephen C. Levinson, and Russell D. Gray. 2011. Evolved structure of language shows lineage-specific trends in wordorder universals. Nature, 473(7345):79-82.

    Sean R. Eddy. 2004. Where did the BLOSUM62 alignment score matrix come from? Nature Biotechnology, 22(8):1035-1036.

    George F Estabrook, FR McMorris, and Christopher A Meacham. 1985. Comparison of undirected phylogenetic trees based on subtrees of four evolutionary units. Systematic Biology, 34(2):193-200.

  • Related Research Results (2)
    Inferred by OpenAIRE
    software
    Phylostar/Autocogphylo: Autocogphylo (2018)
    73%
    dataset
    On the Accuracy of Language Trees (2015)
    56%
  • Related Organizations (2)
  • Metrics
Share - Bookmark