Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

191 to 200 of 769 Results
audio/vnd.wave - 42.6 KB - MD5: d3bb3e8a0a280a8eb7e40cc44b6ab227
audio/vnd.wave - 42.6 KB - MD5: ffd9b4fdcc1a6f38461063f868b6f584
audio/vnd.wave - 42.7 KB - MD5: 122edafcff871524564ca0a51b9e463a
Aug 11, 2023 - MERLIon CCS Challenges
Chua, Victoria Yi Han; Garcia Perera, Leibny Paola; Khudanpur, Sanjeev; Khong, Andy W. H.; Dauwels, Justin; Woon, Fei Ting; Styles, Suzy J, 2023, "Development and Evaluation data for Multilingual Everyday Recordings - Language Identification on Code-Switched Child-Directed Speech (MERLIon CCS) Challenge", https://doi.org/10.21979/N9/ANXS8Z, DR-NTU (Data), V1, UNF:6:QFBERdU0YulYhMohwDaNWg== [fileUNF]
The inaugural Multilingual Everyday Recordings - Language Identification on Code-Switched Child-Directed Speech (MERLIon CCS) Challenge focuses on developing robust language identification and language diarization systems that are reliable for non-standard, accented, spontaneous...
Tabular Data - 20.4 KB - 3 Variables, 247 Observations - UNF:6:bDmCMlXNpTBHaqDY+57ZdA==
Contains the timestamps of evaluated regions for language diarization in each audio recording in the MERLIon CCS Challenge development set.
Tabular Data - 10.5 KB - 1 Variables, 151 Observations - UNF:6:DBZ0LJYDQuBp+JpHC4e2wQ==
Contains the filenames of all audio recordings in the MERLIon CCS Challenge development set.
Plain Text - 4.1 KB - MD5: 2a4512e853e7ffdd47d4bb24550c55ca
Contains the release notes and dataset description of the MERLIon CCS Challenge development set.
Tabular Data - 13.2 KB - 5 Variables, 151 Observations - UNF:6:cor05m8DAi1IsQux4AFq2w==
Contains total lengths of English and Mandarin speech in milliseconds and number of English and Mandarin segments in each audio recording in the MERLIon CCS Challenge development set.
Unknown - 1000.0 MB - MD5: 6b49a389e5d0463f19364bd54d7de3d2
The MERLIon CCS Challenge Development Set Audio and Metadata is split into 5 parts. This is part 1 of 5. Download all 5 parts together and extract the data via 7zip.
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.