The Cognitive Plausibility of Statistical Classification Models: Comparing Textual and Behavioral Evidence

Article English OPEN
Klavan, J. ; Divjak, D. (2016)
  • Publisher: De Gruyter

Usage-based linguistics abounds with studies that use statistical classification models to\ud analyse either textual corpus data or behavioral experimental data. Yet, before we can draw\ud conclusions from statistical models of empirical data that we can feed back into cognitive\ud linguistic theory, we need to assess whether the text-based models are cognitively plausible and\ud whether the behavior-based models are linguistically accurate. In this paper, we review four\ud case studies that evaluate statistical classification models of richly annotated linguistic data by\ud explicitly comparing the performance of a corpus-based model to the behavior of native\ud speakers. The data come from four different languages (Arabic, English, Estonian, and Russian)\ud and pertain to both lexical as well as syntactic near-synonymy. We show that behavioral\ud evidence is needed in order to fine-tune and improve statistical models built on data from a\ud corpus. We argue that methodological pluralism and triangulation are the keys for a cognitively\ud realistic linguistic theory.
  • References (34)
    34 references, page 1 of 4

    20 categorical variant choice: construction, priming and frequency effects on the choice between full and contracted forms of am, are and is. Corpus Linguistics and Linguistic Theory. [Ahead of print - last consulted online at http://www.degruyter.com/view/j/cllt.ahead-of-print/cllt-2014-0022/cllt-2014-0022.xml on 28/05/2015]

    Bermel, Neil & Knittl. 2012a. Corpus frequency and acceptability judgements: A study of morphosyntactic variants in Czech. Corpus Linguistics and Linguistic Theory 8 (2): 241- 275.

    Bermel, Neil & Knittl. 2012b. Morphosyntactic variation and syntactic constructions in Czech nominal declension: corpus frequency and native-speaker judgements. Russian Linguistics 36 (1): 91-119.

    Box, George E. P. 1976. Science and statistics. Journal of the American Statistical Association 71 (356): 791-799.

    Bradshaw, John. 1984. A guide to norms, ratings, and lists. Memory & Cognition 12 (2): 202- 206.

    Bresnan, Joan. 2007. Is syntactic knowledge probabilistic? Experiments with the English dative alternation. In Sam Featherston & Wolfgang Sternefeld, eds. Roots: Linguistics in search of its evidential base. Berlin: Mouton de Gruyter, 77 96.

    Bresnan, Joan, Anna Cueni, Tatiana Nikitina & R. Harald Baayen. 2007. Predicting the dative alternation. In Gerlof Bouma, Irene Krämer & Joost Zwarts, eds. Cognitive foundations of interpretation. Amsterdam: Royal Netherlands Academy of Science, 69 94.

    Bresnan, Joan & Marilyn Ford. 2010. Predicting syntax: processing dative constructions in American and Australian varieties of English. Language 86 (1): 186 213.

    Burnham, Kenneth P. & David R. Anderson. 2002. Model selection and multimodel inference: a practical information-theoretic approach. 2nd ed. New York: Springer.

    Bybee, Joan L. & David Eddington. 2006. A usage-based approach to Spanish verbs of Language 82 (2): 323-355.

  • Metrics
    0
    views in OpenAIRE
    0
    views in local repository
    24
    downloads in local repository

    The information is available from the following content providers:

    From Number Of Views Number Of Downloads
    White Rose Research Online - IRUS-UK 0 24
Share - Bookmark