Can Large Language Models Transform Computational Social  Science?

Name: Can Large Language Models Transform Computational Social Science?
Keywords: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Computational linguistics. Natural language processing, P98-98.5, Computation and Language (cs.CL), Machine Learning (cs.LG)

Caleb Ziems; William Held; Omar Shaikh; Jiaao Chen; Zhehao Zhang; Diyi Yang

Found an issue? Give us feedback

Computational Lingui...arrow_drop_down

Computational Linguistics

Article . 2024 . Peer-reviewed

License: CC BY NC ND

Data sources: Crossref

arXiv.org e-Print Archive

Preprint . 2023

Data sources: arXiv.org e-Print Archive

Computational Linguistics

Article . 2024

Data sources: DOAJ

https://dx.doi.org/10.48550/ar...

Article . 2023

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

Can Large Language Models Transform Computational Social Science?

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2024Embargo end date: 01 Jan 2023 English Publisher:MIT PressJournal:Computational Linguistics, volume 50, pages 237-291 (issn: 0891-2017, eissn: 1530-9312,

Copyright policy )

Authors: Caleb Ziems; William Held; Omar Shaikh; Jiaao Chen; Zhehao Zhang; Diyi Yang;

doi: 10.1162/coli_a_00502 , 10.48550/arxiv.2305.03514

arXiv: 2305.03514

Can Large Language Models Transform Computational Social Science?

- Summary
- Subjects
- Metrics

Abstract

Abstract Large language models (LLMs) are capable of successfully performing many language processing tasks zero-shot (without training data). If zero-shot LLMs can also reliably classify and explain social phenomena like persuasiveness and political ideology, then LLMs could augment the computational social science (CSS) pipeline in important ways. This work provides a road map for using LLMs as CSS tools. Towards this end, we contribute a set of prompting best practices and an extensive evaluation pipeline to measure the zero-shot performance of 13 language models on 25 representative English CSS benchmarks. On taxonomic labeling tasks (classification), LLMs fail to outperform the best fine-tuned models but still achieve fair levels of agreement with humans. On free-form coding tasks (generation), LLMs produce explanations that often exceed the quality of crowdworkers’ gold references. We conclude that the performance of today’s LLMs can augment the CSS research pipeline in two ways: (1) serving as zero-shot data annotators on human annotation teams, and (2) bootstrapping challenging creative generation tasks (e.g., explaining the underlying attributes of a text). In summary, LLMs are posed to meaningfully participate in social science analysis in partnership with humans.

Related Organizations

Georgia Institute of Technology
United States
Stanford University
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Computational linguistics. Natural language processing, P98-98.5, Computation and Language (cs.CL), Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	235
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 0.1%

Found an issue? Give us feedback

235

Top 0.1%

Top 1%

Top 0.1%

Green

gold