Datasets¶
Datasets available with audb as of Jan 26, 2024. For each dataset, the latest version is shown.
name |
description |
license |
version |
schemes |
---|---|---|---|---|
The Aachen Impulse Response (AIR) database is a set of impulse responses that were measured in a wide variety of rooms. The initial aim of the AIR … |
1.4.2 |
azimuth, distance, mode, reverberation-time, room |
||
Cough-speech-sneeze: a data set of human sounds This dataset was collected by Dr. Shahin Amiriparian. It contains samples of human speech, coughing… |
2.0.1 |
category |
||
CREMA-D: Crowd-sourced Emotional Mutimodal Actors Dataset CREMA-D is a data set of 7,442 original clips from 91 actors. These clips were from 48 m… |
1.2.0 |
emotion: [anger, disgust, fear, happiness, neutral, no_agreement, sadness], speaker: [age, sex, race, ethnicity], corrupted, emotion.agreement, emotion.intensity, emotion.level, sentence, votes |
||
Berlin Database of Emotional Speech. A German database of emotional utterances spoken by actors recorded as a part of the DFG funded research proje… |
1.4.1 |
emotion: [anger, boredom, disgust, fear, happiness, sadness, neutral], speaker: [age, gender, language], age, confidence, gender, language, transcription |
||
The Microphone Impulse Response Project (MicIRP) contains impulse response data for vintage microphones. The impulse response files were created us… |
1.0.0 |
manufacturer |
||
The goal of this corpus is to provide data for music/speech discrimination, speech/nonspeech detection, and voice activity detection. The corpus is… |
1.0.0 |
artist, background_noise, composer, gender, genre, language, vocals |
||
VAD Toolkit: A Database for Voice Activity Detection At each environment, conversational speech by two Korean male speakers was recorded. The groun… |
1.1.0 |
noise |