descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 30 Dec 2024 English Publisher:Springer Science and Business Media LLCJournal:Nature Communications, volume 15 (eissn: 2041-1723,

Authors: Lukas Galke; Yoav Ram; Limor Raviv;

doi: 10.1038/s41467-024-55158-1

pmid: 39738033

pmc: PMC11685529

Deep neural networks and humans both benefit from compositional language structure

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

AbstractDeep neural networks drive the success of natural language processing. A fundamental property of language is its compositional structure, allowing humans to systematically produce forms for new meanings. For humans, languages with more compositional and transparent structures are typically easier to learn than those with opaque and irregular structures. However, this learnability advantage has not yet been shown for deep neural networks, limiting their use as models for human language learning. Here, we directly test how neural networks compare to humans in learning and generalizing different languages that vary in their degree of compositional structure. We evaluate the memorization and generalization capabilities of a large language model and recurrent neural networks, and show that both deep neural networks exhibit a learnability advantage for more structured linguistic input: neural networks exposed to more compositional languages show more systematic generalization, greater agreement between different agents, and greater similarity to human learners.

Related Organizations

Max Planck Society
Germany
Tel Aviv University
Israel
Max Planck Institute for Psycholinguistics
Netherlands
University of Southern Denmark
Denmark
University of Glasgow
United Kingdom

Keywords

Neural Networks, Science, Q, Learning/physiology, Linguistics, Article, Computer, Deep Learning, Humans, Learning, Neural Networks, Computer, Language, Natural Language Processing

1 Research products, page 1 of 1

easy2deeplearn software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average