css10¶

Created by Kyubyong Park and Thomas Mulc

version	1.0.0
license	CC0-1.0
usage	unrestricted
languages	zho, nld, fin, fra, deu, ell, hun, jpn, rus, spa
format	wav
channel	1
sampling rate	22050
bit depth	32
duration	5 days 19:41:56.206122452
files	63857, duration distribution: 1.0 s 39.5 s
repository	audb-public

Description¶

CSS10 is a collection of single speaker speech data for 10 languages. Each of them consists of audio files recorded by a single volunteer and their aligned text sourced from LibriVox. The dataset contains: 6h for Chinese, 13h for Dutch, 10h for Finnish, 19h for French, 16h for German, 4h for Greek, 10h for Hungarian, 15h for Japanese, 21h for Russian, 24h for Spanish.

Example¶

19demarzo/19demarzo_0073.wav

../_images/css10-1.0.0-player-waveform.png

Tables¶

Click on a row to toggle a preview.

files

filewise

transcription, normalized-transcription, language, speaker

file	transcription	normalized-transcription	language	speaker
call_to_arms/call_to_arms_0000.wav	《呐喊》自序·鲁迅·我在年青时候也曾经做过许多梦，	nà hǎn zì xù · lǔ xùn · wǒ zài nián qīng shí hou yě céng jīng zuò guò xǔ duō mèng ，	chinese	Jing Li
call_to_arms/call_to_arms_0001.wav	后来大半忘却了，但自己也并不以为可惜。所谓回忆者，虽说可以使人欢欣，有时也不免使人寂寞，	hòu lái dà bàn wàng què liào ， dàn zì jǐ yě bìng bù yǐ wéi kě xī 。 suǒ wèi huí yì zhě ， suī shuō ...	chinese	Jing Li
call_to_arms/call_to_arms_0002.wav	使精神的丝缕还牵着己逝的寂寞的时光，又有什么意味呢，而我偏苦于不能全忘却，	shǐ jīng shen dì sī lǚ huán qiān zháo jǐ shì dì jì mò dì shí guāng ， yòu yǒu shén me yì wèi ní ， ...	chinese	Jing Li
call_to_arms/call_to_arms_0003.wav	这不能全忘的一部分，到现在便成了《呐喊》的来由。我有四年多，曾经常常，	zhè bù néng quán wàng dì yī bù fen ， dào xiàn zài biàn chéng liào nà hǎn dì lái yóu 。 wǒ yǒu sì n...	chinese	Jing Li
call_to_arms/call_to_arms_0004.wav	——几乎是每天，出入于质铺和药店里，	— — jī hū shì měi tiān ， chū rù yú zhí pù hé yào diàn lǐ ，	chinese	Jing Li
63857 rows x 4 columns

Schemes¶

ID	Dtype	Labels
language	str	chinese, dutch, finnish, french, german, greek, hungarian, japanese, russian, spanish
normalized-transcription	str
speaker	str	Bart de Leeuw, Diana Majlinger, Gilles G. Le Blanc, Harri Tapani Ylilammi, Hokuspokus, Jing Li, Mark Chulsky, Rapunzelina, Tux, ekzemplaro
transcription	str