
Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of “computational sociolinguistics” that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction, and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions used in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.
FOS: Computer and information sciences, Linguistics and Language, CR-I.2.7, LANGUAGE STYLE, Lt3, Computational social science, Computational linguistics, Languages and Literatures, Language and Linguistics, Social media, LINGUISTICS, Artificial Intelligence, ESHCC HIS, Computer Science - Computation and Language, IDENTIFICATION, LEXICAL VARIATION, Computer Science Applications, Sociolinguistics, Computational linguistics. Natural language processing, IDENTITY, GENDER, P98-98.5, Computation and Language (cs.CL)
FOS: Computer and information sciences, Linguistics and Language, CR-I.2.7, LANGUAGE STYLE, Lt3, Computational social science, Computational linguistics, Languages and Literatures, Language and Linguistics, Social media, LINGUISTICS, Artificial Intelligence, ESHCC HIS, Computer Science - Computation and Language, IDENTIFICATION, LEXICAL VARIATION, Computer Science Applications, Sociolinguistics, Computational linguistics. Natural language processing, IDENTITY, GENDER, P98-98.5, Computation and Language (cs.CL)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 100 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
