Comparing the semantic structures of lexicon of Mandarin and English

Name: Comparing the semantic structures of lexicon of Mandarin and English
Keywords: Consciousness. Cognition, distributional semantics, Language and Literature, semantic vectors, P, mental lexicon, procrustes analysis, clustering, BF309-499, distributional semantics, mental lexicon, semantic vectors, clustering, semantic profiling

Yi Yang; R. Harald Baayen

Found an issue? Give us feedback

Language and Cogniti...arrow_drop_down

Language and Cognition

Article . 2025 . Peer-reviewed

License: CC BY

Data sources: Crossref

Language and Cognition

Article . 2025

Data sources: DOAJ

Language and Cognition

Article . 2025 . Peer-reviewed

Data sources: European Union Open Data Portal

Open Science Framework

Other literature type . 2024

Data sources: Datacite

Comparing the semantic structures of lexicon of Mandarin and English

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 01 Jan 2025 English Publisher:Cambridge University Press (CUP)Journal:Language and Cognition, volume 17 (issn: 1866-9808, eissn: 1866-9859,

Copyright policy )

Authors: Yi Yang; R. Harald Baayen;

doi: 10.1017/langcog.2024.47 , 10.17605/osf.io/w79n6

Comparing the semantic structures of lexicon of Mandarin and English

- Summary
- Subjects
- Metrics

Abstract

Abstract This paper presents a cross-language study of lexical semantics within the framework of distributional semantics. We used a wide range of predefined semantic categories in Mandarin and English and compared the clusterings of these categories using FastText word embeddings. Three techniques of dimensionality reduction were applied to mapping 300-dimensional FastText vectors into two-dimensional planes: multidimensional scaling, principal components analysis, and t-distributed stochastic neighbor embedding. The results show that t-SNE provides the clearest clustering of semantic categories, improving markedly on PCA and MDS. In both languages, we observed similar differentiation between verbs, adjectives, and nouns as well as between concrete and abstract words. In addition, the methods applied in this study, especially Procrustes analysis, make it possible to trace subtle differences in the structure of the semantic lexicons of Mandarin and English.

Related Organizations

University of Tübingen
Germany

Keywords

Consciousness. Cognition, distributional semantics, Language and Literature, semantic vectors, P, mental lexicon, procrustes analysis, clustering, BF309-499, distributional semantics, mental lexicon, semantic vectors, clustering, semantic profiling

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

gold

Related to Research communities

Digital Humanities and Cultural Heritage