The Growing Collection. This dataverse is a repository for collections of audio & video recordings and transcriptions that have been recorded and released according to the terms of the Growing Collection. Each speaker has been asked whether they would prefer their audio to be identifiable or anonymous. To gain access to audio you must agree to the terms of the Growing Collection

Audio Collections (Words)
Spring-Village Corpus of minimal pairs in Singapore Mandarin Chinese (Mar 2022)
• Pitch-Peach Corpus of minimal pairs and homophones in Singapore English (under construction)
• TOWRE Skilled Adult Readers of Singapore English (under construction)
• Early Word Lists (under construction)
• /i/ /a/ /u/ SESAME Picture Card Corpus - Adults speaking English, Mandarin, Malay and Tamil (coming soon)
Zoo-Lǎohǔ Word Set Cross linguistic lexical priming word set for animacy judgements (English and Mandarin Chinese)

Audio Collections (Sentences/Narrations/Conversations)
Laksa Corpus (Archived Jan 2020)
Leaf & Stone Corpus (Archived Jan 2020)
Green Grass Park Picture Description Corpus in Singapore Mandarin (Feb 2022)
• SESAME Topic Prompts (under construction)

Video Collections
• Dog-Dark Corpus of hand-shapes in SgSL Corpus (Coming Soon)
• SESAME Topic Prompts SgSL Corpus (Under Development)
• "What a Scary Storm!" SgSL Corpus (Under Development)

Synthesized speech derived from natural speech tokens
Boat-Vote continuum for categorical perception studies

MERLIon Challenge Collections
• MERLIon CCS Challenge 2023

Corpus Administration
Growing Collection Release Form (Archived Jan 2020)
• Growing Collection Corpus metadata template (Coming soon)

Terms of use: To access recordings in these collections, users must agree to the following terms: Recordings of audio and video must be treated with respect, and should not be presented in any context which might cause harm or embarrassment to the speaker. For example recordings should not be associated with assessments of racial prejudice, evaluations of likely criminality, sexual orientation, religious affiliation, or any other sensitive material. Recordings should not be paired with distressing or unpleasant stimuli in another sensory domain (e.g., unpleasant pictures, unpleasant smells). No individual should be identified as ‘bad at’ any aspect of the task. Where Usernames have been given, Usernames must be presented alongside any vocal samples used as illustrations of method or results. For example, named or listed in the credits of a documentary; named in a digital file published as supplementary material in a journal article; or listed in live demonstrations (e.g., Presentation at academic conferences, Public science lectures). Any use of files from the Growing Collection must be credited as specified in the Terms of Use for that dataset.

Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

1 to 9 of 9 Results
Apr 18, 2024 - Cross Linguistic Priming Word Lists
Chay, Zhane T.; Loh, Annabel; Woon, Fei Ting; Styles, Suzy J., 2024, "Zoo-Lǎohǔ word set: A cross linguistic lexical priming word set for animacy judgements (English and Mandarin Chinese)", https://doi.org/10.21979/N9/FEUSIO, DR-NTU (Data), V1, UNF:6:dNuEEAOY9voHbVGHeBWsbg== [fileUNF]
Audio tokens selected and edited from BLIP Lab's Singapore Early Word List - Audio Recordings. Selection performed by Tong, Zhane C. in 2023, under the supervision of SJS. Original audio recordings conducted by Woon Fei Ting and Annabel Loh under the supervision of Suzy J Styles...
Apr 17, 2024
Paca, Angelie M. G; Pan, Lei; Styles, Suzy J, 2024, "Boat-Vote continuum for categorical perception studies", https://doi.org/10.21979/N9/PNBXYU, DR-NTU (Data), V1
This corpus contains audio recordings released by participants under the terms of the Growing Collection. To access this audio, you must agree to the terms of use.
Aug 11, 2023 - MERLIon CCS Challenges
Chua, Victoria Yi Han; Garcia Perera, Leibny Paola; Khudanpur, Sanjeev; Khong, Andy W. H.; Dauwels, Justin; Woon, Fei Ting; Styles, Suzy J, 2023, "Development and Evaluation data for Multilingual Everyday Recordings - Language Identification on Code-Switched Child-Directed Speech (MERLIon CCS) Challenge", https://doi.org/10.21979/N9/ANXS8Z, DR-NTU (Data), V1, UNF:6:QFBERdU0YulYhMohwDaNWg== [fileUNF]
The inaugural Multilingual Everyday Recordings - Language Identification on Code-Switched Child-Directed Speech (MERLIon CCS) Challenge focuses on developing robust language identification and language diarization systems that are reliable for non-standard, accented, spontaneous...
Aug 11, 2023 - MERLIon CCS Challenges
Chua, Victoria Yi Han; Styles, Suzy J, 2023, "MERLIon CCS Challenge Development and Evaluation Datasets Open Preview (Documentation)", https://doi.org/10.21979/N9/4RHC3D, DR-NTU (Data), V1, UNF:6:gMeqVZk9eCvbqN6QXFyuWg== [fileUNF]
The inaugural Multilingual Everyday Recordings - Language Identification on Code-Switched Child-Directed Speech (MERLIon CCS) Challenge focuses on developing robust language identification and language diarization systems that are reliable for non-standard, accented, spontaneous...
Mar 29, 2022 - Spring Village Corpus of minimal pairs in Mandarin Chinese
Goh, Hannah L; Woon, Fei Ting; Styles, Suzy J, 2022, "Spring Village Corpus of minimal pairs in Mandarin Chinese - Singaporean Adults", https://doi.org/10.21979/N9/ZTMPML, DR-NTU (Data), V1
This corpus contains audio recordings released by participants under the terms of the Growing Collection. To access this audio, you must agree to the terms of use. TBC (brief description drawn from Corpus Doc)
Feb 15, 2022 - Green Grass Park Picture Description Corpus
Pan, Lei; Styles, Suzy J, 2022, "Green Grass Park Picture Description Corpus - Singapore Mandarin Adults", https://doi.org/10.21979/N9/OPSN64, DR-NTU (Data), V1
This corpus contains audio recordings released by participants under the terms of the Growing Collection. To access this audio, you must agree to the terms of use. This Corpus contains audio recordings and associated information created using the Green Grass Park (Mandarin Chines...
Jan 13, 2020 - Leaf & Stone Corpus - Singapore
Styles, Suzy J; Travers Kumar, Juanita; Kovic, Vanja, 2020, "Leaf & Stone Corpus (Singapore 2016) Audio", https://doi.org/10.21979/N9/OF5ZDK, DR-NTU (Data), V1, UNF:6:dmhxhSKAdqqaI1hZczAT9w== [fileUNF]
Participants reading aloud from the experimental storybooks 'A Fine Day for a New Hat (Singapore English Version)' by Suzy J Styles, Jelena Sucevic & Vanja Kovic, adapted to Singaporean English by Juanita Travels Kumar.
Jan 13, 2020
Styles, Suzy J, 2020, "Growing Collection of Human Voices - Audio Release Form", https://doi.org/10.21979/N9/I6KXC6, DR-NTU (Data), V1
The Growing Collection It is one of our goals to build up a collection of recordings from different communities around the world, which will help us to answer questions about the function and features of speech produced for different listeners. We are particularly interested in h...
Jan 13, 2020 - Laksa Corpus
Styles, Suzy J; Bin Mustaffa, M Asyraf, 2020, "Laksa Corpus Audio Recordings", https://doi.org/10.21979/N9/DHYM9H, DR-NTU (Data), V1
Audio recordings for the following preregistered study: Bin Mustaffa, MA & Styles SJ (2018), "Preregistration Documents: Ordering Laksa - Preregistered Design for Speech Elicitation in Singapore English Diglossia", DR-NTU (Data). https://doi.org/10.21979/N9/WW6ZHP Following the p...
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.