[DataSet] Commonly used artificial intelligence data sets for speech

Common AI datasets for speech

Dataset category Common Datasets
Multilingual Speech Datasets Mozilla Common Voice dataset; Tatoeba dataset
English Speech Dataset VOiCES Dataset Dataset;LibriSpeech Dataset;2000 HUB5 English Dataset;VoxForge Dataset;VoxCeleb Dataset;TIMIT Dataset;CHIME Dataset;TED-LIUM Dataset;Google AudioSet Dataset;CCPE Dataset;FreSTAmerican English Corpus Dataset dataset; CSTR VCTK dataset; LibriTTScorpus dataset; TheAMI Corpus dataset
Chinese Speech Dataset Free ST Chinese Mandarin Corpus dataset;Primewords Chinese CorpusSet1 dataset;THCHS30 dataset;ST-CMDS dataset;MAGICDATAMandarin Chinese Read Speech Corpus dataset;ASHELL datasetMobvoiHotwords dataset;Aidatatang dataset
Other Language Datasets Vystadial dataset;ALFFA dataset;Heroico dataset;Tunisian MSA dataset;Afirican Accented French dataset;Afican Accented French dataset;ParlamentParla dataset;TEDx Spanish Corpus dataset

Guess you like

Origin blog.csdn.net/qq_44824148/article/details/129859693