Datasets

Datasets available with audb as of Jan 26, 2024. For each dataset, the latest version is shown.

name

description

license

version

schemes

air

The Aachen Impulse Response (AIR) database is a set of impulse responses that were measured in a wide variety of rooms. The initial aim of the AIR …

MIT

1.4.2

azimuth, distance, mode, reverberation-time, room

cough-speech-sneeze

Cough-speech-sneeze: a data set of human sounds This dataset was collected by Dr. Shahin Amiriparian. It contains samples of human speech, coughing…

CC-BY-4.0

2.0.1

category

crema-d

CREMA-D: Crowd-sourced Emotional Mutimodal Actors Dataset CREMA-D is a data set of 7,442 original clips from 91 actors. These clips were from 48 m…

Open Data Commons Open Database License (ODbL) v1.0

1.2.0

emotion: [anger, disgust, fear, happiness, neutral, no_agreement, sadness], speaker: [age, sex, race, ethnicity], corrupted, emotion.agreement, emotion.intensity, emotion.level, sentence, votes

emodb

Berlin Database of Emotional Speech. A German database of emotional utterances spoken by actors recorded as a part of the DFG funded research proje…

CC0-1.0

1.4.1

emotion: [anger, boredom, disgust, fear, happiness, sadness, neutral], speaker: [age, gender, language], age, confidence, gender, language, transcription

micirp

The Microphone Impulse Response Project (MicIRP) contains impulse response data for vintage microphones. The impulse response files were created us…

CC-BY-SA-4.0

1.0.0

manufacturer

musan

The goal of this corpus is to provide data for music/speech discrimination, speech/nonspeech detection, and voice activity detection. The corpus is…

CC-BY-4.0

1.0.0

artist, background_noise, composer, gender, genre, language, vocals

vadtoolkit

VAD Toolkit: A Database for Voice Activity Detection At each environment, conversational speech by two Korean male speakers was recorded. The groun…

GPLv3

1.1.0

noise