Applying automatically parsed corpora to the study of language variation.

descriptionPublicationkeyboard_double_arrow_right Conference object , Part of book or chapter of book , Article 01 Jan 2014 Netherlands Publisher:Dublin City University and Association for Computational Linguistics

Authors: Bloem, J.; Versloot, A.; Weerman, F.;

handle: 11245/1.418113 , 20.500.11755/ed2163cc-72c3-457b-9698-aa367dd1d8dd

Applying automatically parsed corpora to the study of language variation.

- Summary
- Subjects
- Metrics

Abstract

In this work, we discuss the benefits of using automatically parsed corpora to study language variation. The study of language variation is an area of linguistics in which quantitative methods have been particularly successful. We argue that the large datasets that can be obtained using automatic annotation can help drive further research in this direction, providing sufficient data for the increasingly complex models used to describe variation. We demonstrate this by replicating and extending a previous quantitative variation study that used manually and semi-automatically annotated data.We show that while the study cannot be replicated completely due to limitations of the existing automatic annotation, we can draw at least the same conclusions as the original study. In addition, we demonstrate the flexibility of this method by extending the findings to related linguistic constructions and to another domain of text, using additional data.

Country

Netherlands

Related Organizations

Royal Netherlands Academy of Arts and Sciences (KNAW)
Netherlands
University of Amsterdam
Netherlands

Keywords

410, 004

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Fields of Science

social sciences

psychology and cognitive sciences

Fields of Science

social sciences

psychology and cognitive sciences

Related to Research communities

Netherlands Research Portal