Datasets
Bangor Talk Siarad Welsh-English corpus
License: GPL-3.0
Locale: cym
Task: ASR
Format: MP3, CHA. TSV
Size: 2.13 GB
Test
License: EUPL-1.2
Locale: Test
Task: LID
Format: Test
Size: 50.79 MB
swdad
License: Apache-2.0
Locale: dadawd
Task: NLP
Format: adwad
Size: 15.36 MB
awdadad
License: Apache-2.0
Locale: en-US
Task: NLP
Format: wadada
Size: 249.04 MB
Govtube - Kuña Rembiasa
License: CC0-1.0
Locale: es-PY, gn-PY
Task: ASR
Format: TSV, MP3
Size: 52.52 MB
A new test with the new flow v2
License: BSD-3-Clause
Locale: en-US
Task: MT
Format: Unknown
Size: 249.04 MB
versions
License: BSD-3-Clause
Locale: mul
Task: ML
Format: MP3
Size: 15.36 MB
Wonderful new dataset
License: BSD-3-Clause
Locale: en-US
Task: LID
Format: mp3
Size: 1.82 MB
Long Other Information Description Dataset
License: Apache-2.0
Locale: en
Task: NLP
Format: WAV
Size: 4.20 MB
Dataset with long & short desc
License: CC-SA-1.0
Locale: en-US
Task: CV
Format: mp3
Size: 15.36 MB
dawdawd
License: CC-BY-ND-4.0
Locale: en-us
Task: NLP
Format: adwad
Size: 231.89 MB
Dataset with long desc
License: CC-BY-SA-4.0
Locale: en-US
Task: ASR
Format: mp3
Size: 15.36 MB
New Dev Dataset
License: CC-SA-1.0
Locale: en-US
Task: TTS
Format: mp3
Size: 15.36 MB
New Dev Dataset
License: Apache-2.0
Locale: en-US
Task: NLP
Format: mp3
Size: 15.36 MB
Common Voice Scripted Speech 24.0 - Zaza
License: CC0-1.0
Locale: zza
Task: ASR
Format: MP3
Size: 50.79 MB
Common Voice Scripted Speech 24.0 - Zulu
License: CC0-1.0
Locale: zu
Task: ASR
Format: MP3
Size: 7.57 MB
Common Voice Scripted Speech 24.0 - Copainalá Zoque
License: CC0-1.0
Locale: zoc
Task: ASR
Format: MP3
Size: 204.05 MB
Common Voice Scripted Speech 24.0 - Chinese (Taiwan)
License: CC0-1.0
Locale: zh-TW
Task: ASR
Format: MP3
Size: 2.93 GB
Common Voice Scripted Speech 24.0 - Chinese (Hong Kong)
License: CC0-1.0
Locale: zh-HK
Task: ASR
Format: MP3
Size: 3.40 GB
Common Voice Scripted Speech 24.0 - Chinese (China)
License: CC0-1.0
Locale: zh-CN
Task: ASR
Format: MP3
Size: 21.31 GB
Common Voice Scripted Speech 24.0 - Tamazight
License: CC0-1.0
Locale: zgh
Task: ASR
Format: MP3
Size: 39.47 MB
Common Voice Scripted Speech 24.0 - Cantonese
License: CC0-1.0
Locale: yue
Task: ASR
Format: MP3
Size: 5.98 GB
Common Voice Scripted Speech 24.0 - Yoruba
License: CC0-1.0
Locale: yo
Task: ASR
Format: MP3
Size: 162.93 MB
Common Voice Scripted Speech 24.0 - Yiddish
License: CC0-1.0
Locale: yi
Task: ASR
Format: MP3
Size: 42.69 MB