speech-accent-archive

Created by Rachael Tatman, Steven H. Weinberger, et al.

version

2.2.0

license

CC-BY-NC-SA-4.0

usage

research

languages

eng

format

wav

channel

1

sampling rate

16000

bit depth

16

duration

0 days 16:29:32.808937500

files

2138, duration distribution: 15.8 s speech-accent-archive-2.2.0-file-duration-distribution 106.6 s

segments

17688, duration distribution: 0.8 s speech-accent-archive-2.2.0-segment-duration-distribution 7.0 s

repository

audb-public

Description

This dataset contains 2138 speech samples, each from a different talker reading the same reading passage. Talkers come from 177 countries and have 214 different native languages. Each talker is speaking in English. ### All read the following passage: “Please call Stella. Ask her to bring these things with her from the store: Six spoons of fresh snow peas, five thick slabs of blue cheese, and maybe a snack for her brother Bob. We also need a small plastic snake and a big toy frog for the kids. She can scoop these things into three red bags, and we will go meet her Wednesday at the train station.” ### This dataset was collected by many individuals under the supervision of Steven H. Weinberger. The most up-to-date version of the archive is hosted at http://accent.gmu.edu/. If you use this dataset in your work, please include the following citation: Weinberger, S. (2013). Speech accent archive. George Mason University.

Example

wav/dutch18.wav

../_images/speech-accent-archive-2.2.0-player-waveform.png

Tables

Click on a row to toggle a preview.

ID

Type

Columns

accent.test

segmented

tone, native_language, sex

file

start

end

tone

native_language

sex

wav/amharic19.wav

0 days 00:00:00.860000

0 days 00:00:01.920000

neutral

amharic

female

wav/amharic19.wav

0 days 00:00:02.260000

0 days 00:00:04.680000

neutral

amharic

female

wav/amharic19.wav

0 days 00:00:05.280000

0 days 00:00:07.360000

neutral

amharic

female

wav/amharic19.wav

0 days 00:00:07.700000

0 days 00:00:09.700000

neutral

amharic

female

wav/amharic19.wav

0 days 00:00:10.060000

0 days 00:00:12.140000

neutral

amharic

female

2597 rows x 3 columns

files

filewise

age, age_onset, birthplace, native_language, sex, speaker, country, duration

file

age

age_onset

birthplace

native_language

sex

speaker

country

duration

wav/afrikaans1.wav

27

9

virginia, south africa

afrikaans

female

1

south africa

0 days 00:00:20.819562500

wav/afrikaans2.wav

40

5

pretoria, south africa

afrikaans

male

2

south africa

0 days 00:00:22.021250

wav/afrikaans3.wav

43

4

pretoria, transvaal, south africa

afrikaans

male

418

south africa

0 days 00:00:26.932250

wav/afrikaans4.wav

26

8

pretoria, south africa

afrikaans

male

1159

south africa

0 days 00:00:23.536312499

wav/afrikaans5.wav

19

6

cape town, south africa

afrikaans

male

1432

south africa

0 days 00:00:20.297125

2138 rows x 8 columns

segments

segmented

content

file

start

end

content

wav/afrikaans1.wav

0 days 00:00:00.960000

0 days 00:00:03.740000

speech

wav/afrikaans1.wav

0 days 00:00:04.100000

0 days 00:00:06.380000

speech

wav/afrikaans1.wav

0 days 00:00:06.760000

0 days 00:00:10.800000

speech

wav/afrikaans1.wav

0 days 00:00:11.220000

0 days 00:00:14.600000

speech

wav/afrikaans1.wav

0 days 00:00:14.900000

0 days 00:00:19.740000

speech

17688 rows x 1 column

Schemes

ID

Dtype

Labels

age

int

age_onset

int

birthplace

str

content

str

speech

country

str

duration

time

native_language

str

sex

str

female, male

speaker

int

tone

str

neutral