Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Audiovisual . 2020
License: CC BY NC SA
Data sources: Datacite
ZENODO
Audiovisual . 2020
License: CC BY NC SA
Data sources: Datacite
versions View all 2 versions
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

Human vocalization corpus: recordings of infant-directed and adult-directed speech and song in 21 societies

Authors: Hilton, Courtney; Moser, Cody; Mehr, Samuel;

Human vocalization corpus: recordings of infant-directed and adult-directed speech and song in 21 societies

Abstract

This repository contains a corpus of 1615 audio recordings of speech and song collected in 21 societies, first reported in Moser et al. (2020; bioRxiv) and later published in Hilton & Moser et al. (2022; Nature Human Behaviour). For assistance using any of this, contact Cody Moser (cmoser2@ucmerced.edu), Courtney Hilton (courtney.hilton@auckland.ac.nz), and Samuel Mehr (mehr@hey.com). Two versions of the audio are included: raw audio (`IDS-corpus-raw.zip`) and audio that was edited to prepare the recordings for automatic acoustic feature extraction (`IDS-corpus-edited.zip`). `IDS-textGrids.zip` contains annotation files from Praat's silence detection method, which were manually reviewed for accuracy. These files are used with the audio extraction scripts associated with the project (see code linked in paper) to build the edited audio files. `IDS-fieldsites.csv` contains some fieldsite-level metadata; additional metadata is in the Supplementary Information of the paper. In the two .zip archives, filenames have the format XXXYYZ.wav, where "XXX" is a fieldsite code, "YY" is a participant number, and "Z" is a vocalization type. Fieldsite codes are: MBE: Mbendjele BaYaka HAD: Hadza NYA: Nyangatom TOP: Toposa BEJ: Beijing JEN: Jenu Kurubas MEN: Mentawai Islanders KRA: Krakow LIM: Rural Poland TUR: Turku USD: San Diego TOR: Toronto VAN: Tannese Vanuatuans PNG: Enga WEL: Wellington ARA: Arawak TSI: Tsimane SPA: Sápara & Achuar QUE: Quechua ACO: Afrocolombians MES: Colombian Mestizos Participant numbers are padded integers, starting with 01, and are unique within fieldsites. Vocalization types are: A: infant-directed song B: infant-directed speech C: adult-directed song D: adult-directed speech In a few cases, participants vocalized in a different language than was expected, given the primary language of their fieldsite (e.g., when the participant was multilingual, or if they sang a song that contains multiple languages, as in The Beatles' "Michelle"). The file `IDS-unexpectedLanguages.csv` at https://github.com/themusiclab/infant-speech-song/blob/main/data/IDS-unexpectedLanguages.csv contains an inventory of these examples from the English-speaking fieldsites. This issue only affects a small minority of the recordings, as it was typically avoided by the researchers collecting the recordings. 

Related Organizations
Keywords

language, infants, speech, audio, parents, song, music

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
    OpenAIRE UsageCounts
    Usage byUsageCounts
    visibility views 3K
    download downloads 92
  • 3K
    views
    92
    downloads
    Powered byOpenAIRE UsageCounts
Powered by OpenAIRE graph
Found an issue? Give us feedback
visibility
download
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
views
OpenAIRE UsageCountsViews provided by UsageCounts
downloads
OpenAIRE UsageCountsDownloads provided by UsageCounts
0
Average
Average
Average
3K
92