Datasets

Filters:
Common Voice

Example Dataset Upload - 2025 10 23

Example Dataset Upload - 2025 10 23
License Icon

License: cc-0

Locale Icon

Locale: en-US

Task Icon

Task: NLP

Format Icon

Format: mp3

Size Icon

Size: 72.21 MB

Community

Test dataset - random

This is a test dataset that I will search for on my computer.
License Icon

License: CC-BY-4.0

Locale Icon

Locale: nhi

Task Icon

Task: NLP

Format Icon

Format: wav,conllu

Size Icon

Size: 330.70 KB

Common Voice

newest test

test
License Icon

License: CC0-1.0

Locale Icon

Locale: en-US

Task Icon

Task: CV

Format Icon

Format: tar.gz

Size Icon

Size: 72.21 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Kuku

A collection of spontaneous spoken phrases in Kuku.
License Icon

License: CC0-1.0

Locale Icon

Locale: ukv

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 237.60 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Amba (Uganda)

A collection of spontaneous spoken phrases in Amba (Uganda).
License Icon

License: CC0-1.0

Locale Icon

Locale: rwm

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 265.80 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Sena

A collection of spontaneous spoken phrases in Sena.
License Icon

License: CC0-1.0

Locale Icon

Locale: seh

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 4.40 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Sabah Malay

A collection of spontaneous spoken phrases in Sabah Malay.
License Icon

License: CC0-1.0

Locale Icon

Locale: msi

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 277.20 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Western Penan

A collection of spontaneous spoken phrases in Western Penan.
License Icon

License: CC0-1.0

Locale Icon

Locale: pne

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 247.40 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Toba

A collection of spontaneous spoken phrases in Toba.
License Icon

License: CC0-1.0

Locale Icon

Locale: tob

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 172.50 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Scots

A collection of spontaneous spoken phrases in Scots.
License Icon

License: CC0-1.0

Locale Icon

Locale: sco

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 228 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Ruuli

A collection of spontaneous spoken phrases in Ruuli.
License Icon

License: CC0-1.0

Locale Icon

Locale: ruc

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 365.20 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Central Melanau

A collection of spontaneous spoken phrases in Central Melanau.
License Icon

License: CC0-1.0

Locale Icon

Locale: mel

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 208.60 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Tooro

A collection of spontaneous spoken phrases in Tooro.
License Icon

License: CC0-1.0

Locale Icon

Locale: ttj

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 272.80 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Bukar-Sadung Bidayuh

A collection of spontaneous spoken phrases in Bukar-Sadung Bidayuh.
License Icon

License: CC0-1.0

Locale Icon

Locale: sdo

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 200.80 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Turkish

A collection of spontaneous spoken phrases in Turkish.
License Icon

License: CC0-1.0

Locale Icon

Locale: tr

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 3.10 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Malay (Malaysia)

The collection of spontaneous spoken phrases in Malay (Malaysia).
License Icon

License: CC0-1.0

Locale Icon

Locale: ms-MY

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 126.10 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Southwestern Tlaxiaco Mixtec

A collection of spontaneous spoken phrases in Southwestern Tlaxiaco Mixtec.
License Icon

License: CC0-1.0

Locale Icon

Locale: meh

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 201.80 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Papantla Totonac

A collection of spontaneous spoken phrases in Papantla Totonac.
License Icon

License: CC0-1.0

Locale Icon

Locale: top

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 205.70 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Mainstream Kenyah

A collection of spontaneous spoken phrases in Mainstream Kenyah.
License Icon

License: CC0-1.0

Locale Icon

Locale: xkl

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 212.30 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Sa'ban

A collection of spontaneous spoken phrases in Sa'ban.
License Icon

License: CC0-1.0

Locale Icon

Locale: snv

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 212.90 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Russian

A collection of spontaneous spoken phrases in Russian.
License Icon

License: CC0-1.0

Locale Icon

Locale: ru

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 5.40 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Michoacán Mazahua

A collection of spontaneous spoken phrases in Michoacán Mazahua.
License Icon

License: CC0-1.0

Locale Icon

Locale: mmc

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 225.70 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Galician

A collection of spontaneous spoken phrases in Galician.
License Icon

License: CC0-1.0

Locale Icon

Locale: gl

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 21.80 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Bodo

A collection of spontaneous spoken phrases in Bodo.
License Icon

License: CC0-1.0

Locale Icon

Locale: brx

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 1.30 MB