Downloads provided by UsageCounts
This repository contains a corpus of 1615 audio recordings of speech and song collected in 21 societies, first reported in Moser et al. (2020; bioRxiv) and later published in Hilton & Moser et al. (2022; Nature Human Behaviour). For assistance using any of this, contact Cody Moser (cmoser2@ucmerced.edu), Courtney Hilton (courtney.hilton@auckland.ac.nz), and Samuel Mehr (mehr@hey.com). Two versions of the audio are included: raw audio (`IDS-corpus-raw.zip`) and audio that was edited to prepare the recordings for automatic acoustic feature extraction (`IDS-corpus-edited.zip`). `IDS-textGrids.zip` contains annotation files from Praat's silence detection method, which were manually reviewed for accuracy. These files are used with the audio extraction scripts associated with the project (see code linked in paper) to build the edited audio files. `IDS-fieldsites.csv` contains some fieldsite-level metadata; additional metadata is in the Supplementary Information of the paper. In the two .zip archives, filenames have the format XXXYYZ.wav, where "XXX" is a fieldsite code, "YY" is a participant number, and "Z" is a vocalization type. Fieldsite codes are: MBE: Mbendjele BaYaka HAD: Hadza NYA: Nyangatom TOP: Toposa BEJ: Beijing JEN: Jenu Kurubas MEN: Mentawai Islanders KRA: Krakow LIM: Rural Poland TUR: Turku USD: San Diego TOR: Toronto VAN: Tannese Vanuatuans PNG: Enga WEL: Wellington ARA: Arawak TSI: Tsimane SPA: Sápara & Achuar QUE: Quechua ACO: Afrocolombians MES: Colombian Mestizos Participant numbers are padded integers, starting with 01, and are unique within fieldsites. Vocalization types are: A: infant-directed song B: infant-directed speech C: adult-directed song D: adult-directed speech In a few cases, participants vocalized in a different language than was expected, given the primary language of their fieldsite (e.g., when the participant was multilingual, or if they sang a song that contains multiple languages, as in The Beatles' "Michelle"). The file `IDS-unexpectedLanguages.csv` at https://github.com/themusiclab/infant-speech-song/blob/main/data/IDS-unexpectedLanguages.csv contains an inventory of these examples from the English-speaking fieldsites. This issue only affects a small minority of the recordings, as it was typically avoided by the researchers collecting the recordings.
language, infants, speech, audio, parents, song, music
language, infants, speech, audio, parents, song, music
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 3K | |
| downloads | 92 |

Views provided by UsageCounts
Downloads provided by UsageCounts