
We use n-gram language models to investigate how far language approximates an optimal code for human communication in terms of Information Theory, and what differences there are between Learner proficiency levels. Although the language of lower level learners is simpler, it is less optimal in terms of information theory, and as a consequence more difficult to process.
General Language Studies and Linguistics, Jämförande språkvetenskap och allmän lingvistik, proficiency levels, spoken corpora, 10097 English Department, surprisal, L2, 820 English & Old English literatures, 11551 Zurich Center for Linguistics, 1712 Software, 1709 Human-Computer Interaction, 10105 Institute of Computational Linguistics, 1711 Signal Processing, 1203 Language and Linguistics, 820 English & Old English literatures, 2611 Modeling and Simulation
General Language Studies and Linguistics, Jämförande språkvetenskap och allmän lingvistik, proficiency levels, spoken corpora, 10097 English Department, surprisal, L2, 820 English & Old English literatures, 11551 Zurich Center for Linguistics, 1712 Software, 1709 Human-Computer Interaction, 10105 Institute of Computational Linguistics, 1711 Signal Processing, 1203 Language and Linguistics, 820 English & Old English literatures, 2611 Modeling and Simulation
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
