speech-accent-archive¶

Created by Rachael Tatman, Steven H. Weinberger, et al.

version	2.2.0
license	CC-BY-NC-SA-4.0
usage	research
languages	eng
format	wav
channel	1
sampling rate	16000
bit depth	16
duration	0 days 16:29:32.808937500
files	2138, duration distribution: 15.8 s 106.6 s
segments	17688, duration distribution: 0.8 s 7.0 s
repository	audb-public

Description¶

This dataset contains 2138 speech samples, each from a different talker reading the same reading passage. Talkers come from 177 countries and have 214 different native languages. Each talker is speaking in English. ### All read the following passage: “Please call Stella. Ask her to bring these things with her from the store: Six spoons of fresh snow peas, five thick slabs of blue cheese, and maybe a snack for her brother Bob. We also need a small plastic snake and a big toy frog for the kids. She can scoop these things into three red bags, and we will go meet her Wednesday at the train station.” ### This dataset was collected by many individuals under the supervision of Steven H. Weinberger. The most up-to-date version of the archive is hosted at http://accent.gmu.edu/. If you use this dataset in your work, please include the following citation: Weinberger, S. (2013). Speech accent archive. George Mason University.

Example¶

wav/dutch18.wav

../_images/speech-accent-archive-2.2.0-player-waveform.png

Tables¶

Click on a row to toggle a preview.

accent.test

segmented

tone, native_language, sex

file	start	end	tone	native_language	sex
wav/amharic19.wav	0 days 00:00:00.860000	0 days 00:00:01.920000	neutral	amharic	female
wav/amharic19.wav	0 days 00:00:02.260000	0 days 00:00:04.680000	neutral	amharic	female
wav/amharic19.wav	0 days 00:00:05.280000	0 days 00:00:07.360000	neutral	amharic	female
wav/amharic19.wav	0 days 00:00:07.700000	0 days 00:00:09.700000	neutral	amharic	female
wav/amharic19.wav	0 days 00:00:10.060000	0 days 00:00:12.140000	neutral	amharic	female
2597 rows x 3 columns

files

filewise

age, age_onset, birthplace, native_language, sex, speaker, country, duration

file	age	age_onset	birthplace	native_language	sex	speaker	country	duration
wav/afrikaans1.wav	27	9	virginia, south africa	afrikaans	female	1	south africa	0 days 00:00:20.819562500
wav/afrikaans2.wav	40	5	pretoria, south africa	afrikaans	male	2	south africa	0 days 00:00:22.021250
wav/afrikaans3.wav	43	4	pretoria, transvaal, south africa	afrikaans	male	418	south africa	0 days 00:00:26.932250
wav/afrikaans4.wav	26	8	pretoria, south africa	afrikaans	male	1159	south africa	0 days 00:00:23.536312499
wav/afrikaans5.wav	19	6	cape town, south africa	afrikaans	male	1432	south africa	0 days 00:00:20.297125
2138 rows x 8 columns

segments

segmented

content

file	start	end	content
wav/afrikaans1.wav	0 days 00:00:00.960000	0 days 00:00:03.740000	speech
wav/afrikaans1.wav	0 days 00:00:04.100000	0 days 00:00:06.380000	speech
wav/afrikaans1.wav	0 days 00:00:06.760000	0 days 00:00:10.800000	speech
wav/afrikaans1.wav	0 days 00:00:11.220000	0 days 00:00:14.600000	speech
wav/afrikaans1.wav	0 days 00:00:14.900000	0 days 00:00:19.740000	speech
17688 rows x 1 column

Schemes¶

ID	Dtype	Labels
age	int
age_onset	int
birthplace	str
content	str	speech
country	str
duration	time
native_language	str
sex	str	female, male
speaker	int
tone	str	neutral