Human vocalization corpus: recordings of infant-directed and adult-directed speech and song in 21 societies

Name: Human vocalization corpus: recordings of infant-directed and adult-directed speech and song in 21 societies
Keywords: language, infants, speech, audio, parents, song, music

Hilton, Courtney; Moser, Cody; Mehr, Samuel

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Audiovisual . 2020

License: CC BY NC SA

Data sources: Datacite

ZENODO

Audiovisual . 2020

License: CC BY NC SA

Data sources: Datacite

Human vocalization corpus: recordings of infant-directed and adult-directed speech and song in 21 societies

appsOther research productkeyboard_double_arrow_right Audiovisual 11 Apr 2020Publisher:Zenodo

Authors: Hilton, Courtney; Moser, Cody; Mehr, Samuel;

doi: 10.5281/zenodo.5525161 , 10.5281/zenodo.5525160

Human vocalization corpus: recordings of infant-directed and adult-directed speech and song in 21 societies

- Summary
- Subjects
- Metrics

Abstract

This repository contains a corpus of 1615 audio recordings of speech and song collected in 21 societies, first reported in Moser et al. (2020; bioRxiv) and later published in Hilton & Moser et al. (2022; Nature Human Behaviour). For assistance using any of this, contact Cody Moser (cmoser2@ucmerced.edu), Courtney Hilton (courtney.hilton@auckland.ac.nz), and Samuel Mehr (mehr@hey.com). Two versions of the audio are included: raw audio (`IDS-corpus-raw.zip`) and audio that was edited to prepare the recordings for automatic acoustic feature extraction (`IDS-corpus-edited.zip`). `IDS-textGrids.zip` contains annotation files from Praat's silence detection method, which were manually reviewed for accuracy. These files are used with the audio extraction scripts associated with the project (see code linked in paper) to build the edited audio files. `IDS-fieldsites.csv` contains some fieldsite-level metadata; additional metadata is in the Supplementary Information of the paper. In the two .zip archives, filenames have the format XXXYYZ.wav, where "XXX" is a fieldsite code, "YY" is a participant number, and "Z" is a vocalization type. Fieldsite codes are: MBE: Mbendjele BaYaka HAD: Hadza NYA: Nyangatom TOP: Toposa BEJ: Beijing JEN: Jenu Kurubas MEN: Mentawai Islanders KRA: Krakow LIM: Rural Poland TUR: Turku USD: San Diego TOR: Toronto VAN: Tannese Vanuatuans PNG: Enga WEL: Wellington ARA: Arawak TSI: Tsimane SPA: Sápara & Achuar QUE: Quechua ACO: Afrocolombians MES: Colombian Mestizos Participant numbers are padded integers, starting with 01, and are unique within fieldsites. Vocalization types are: A: infant-directed song B: infant-directed speech C: adult-directed song D: adult-directed speech In a few cases, participants vocalized in a different language than was expected, given the primary language of their fieldsite (e.g., when the participant was multilingual, or if they sang a song that contains multiple languages, as in The Beatles' "Michelle"). The file `IDS-unexpectedLanguages.csv` at https://github.com/themusiclab/infant-speech-song/blob/main/data/IDS-unexpectedLanguages.csv contains an inventory of these examples from the English-speaking fieldsites. This issue only affects a small minority of the recordings, as it was typically avoided by the researchers collecting the recordings.

Related Organizations

University of Auckland
New Zealand
University of California, Merced
United States

Keywords

language, infants, speech, audio, parents, song, music

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Usage byUsageCounts

visibility	views	3K
download	downloads	92

3K
views
92
downloads
Powered by

Found an issue? Give us feedback

visibility

download

0

Average

3K

92