Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao Closed Access logo, derived from PLoS Open Access logo. This version with transparent background. http://commons.wikimedia.org/wiki/File:Closed_Access_logo_transparent.svg Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao Archivio istituziona...arrow_drop_down
image/svg+xml Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao Closed Access logo, derived from PLoS Open Access logo. This version with transparent background. http://commons.wikimedia.org/wiki/File:Closed_Access_logo_transparent.svg Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao
https://doi.org/10.4324/978131...
Part of book or chapter of book . 2022 . Peer-reviewed
Data sources: Crossref
versions View all 2 versions
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

Corpus linguistics

Authors: Bernardini Silvia; Ferraresi Adriano;
Abstract

A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. Thanks to software applications designed specifically for searching through corpora, known as concordancers or corpus query tools, it is possible to obtain information about patterns occurring in a single text or across sets of texts, that would almost certainly escape us if we only read the texts. At their simplest, corpus methods allow users to find out which words are used most frequently in a given corpus (wordlists), or more frequently in one corpus compared to another that acts as a baseline (keywords); users can also search for words that tend to go together more often than would be expected (collocations) or for repeated word sequences (variously called clusters, n-grams, or lexical bundles). Through the provision of information about word frequencies and about syntagmatic relationships established on the level of discourse (Saussure 1971/ 1916: 170ff.), corpora have revolutionised linguistics, allowing researchers to tap into a major new source of linguistic evidence, thus relaxing ‘the stranglehold of intuition’ (Sinclair 1991: 7) and the exclusive focus on the abstract paradigms of traditional grammar. The first modern corpora were developed in the 1970s and 1980s as a reaction against the methods of so-called ‘armchair linguistics’ (Fillmore 1992), which relied on the linguist’s intuition, or on the intuition of a few informants, to describe aspects of language. At the time, linguistics was heavily influenced by generativist views (e.g. Chomsky 1986), and language as an object of study was largely synonymous with a speaker’s linguistic competence. This in turn referred to knowledge of the grammaticality of a given construction, for which the intuition of a competent speaker was considered an adequate source of evidence. With the growing importance accorded to pragmatics and sociolinguistics, a shift occurred from linguistic competence to communicative competence, or competence on the contextual adequacy of language choices (Hymes 1972). More recently, usage-based linguistic approaches have become mainstream. These postulate that ‘usage events define and continuously redefine the language system in a dynamic way’ (Tummers et al. 2005: 228). In these approaches language performance, or actual samples of authentic language usage, have become the main object of linguistic analysis. Anticipating and accompanying these theoretical developments, in the last 50 years corpus methods have grown in importance and nowadays occupy a central position in linguistics. In the words of Stubbs (2009: 117), ‘[c]orpora are just data and quantitative methods are just methods, but their combination has led to a major shift in theory’. The applied branches of the discipline, such as first- and second- language acquisition, terminology and lexicography, and indeed the study of translation, have in turn discovered corpora, and are currently using them as a fundamental resource for studying the products of these activities, and to obtain indirect evidence about their underlying processes.

Keywords

translation, corpus, corpora, parallel corpora, comparable corpora, concordances, collocations, wordlists, keywords, n-grams, clusters, lexical bundles, sintagmatico relations

  • BIP!
    Impact byBIP!
    citations
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    46
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Top 1%
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Top 1%
Powered by OpenAIRE graph
Found an issue? Give us feedback
citations
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
46
Top 1%
Average
Top 1%
Upload OA version
Are you the author of this publication? Upload your Open Access version to Zenodo!
It’s fast and easy, just two clicks!