data-public

Datasets available with audb as of Jul 27, 2024 in the repository data-public. For each dataset, the latest version is shown.

name

description

license

version

schemes

air

The Aachen Impulse Response (AIR) database is a set of impulse responses that were measured in a wide variety of rooms. The initial aim of the AIR …

MIT

1.4.2

azimuth, distance, mode, reverberation-time, room

cough-speech-sneeze

Cough-speech-sneeze: a data set of human sounds This dataset was collected by Dr. Shahin Amiriparian. It contains samples of human speech, coughing…

CC-BY-4.0

2.0.1

category

crema-d

CREMA-D: Crowd-sourced Emotional Mutimodal Actors Dataset CREMA-D is a data set of 7,442 original clips from 91 actors. These clips were from 48 m…

Open Data Commons Open Database License (ODbL) v1.0

1.3.0

emotion: [anger, disgust, fear, happiness, neutral, no_agreement, sadness], speaker: [age, sex, race, ethnicity], corrupted, emotion.agreement, emotion.intensity, emotion.level, sentence, votes

emodb

Berlin Database of Emotional Speech. A German database of emotional utterances spoken by actors recorded as a part of the DFG funded research proje…

CC0-1.0

1.4.1

emotion: [anger, boredom, disgust, fear, happiness, sadness, neutral], speaker: [age, gender, language], age, confidence, gender, language, transcription

micirp

The Microphone Impulse Response Project (MicIRP) contains impulse response data for vintage microphones. The impulse response files were created us…

CC-BY-SA-4.0

1.0.0

manufacturer

musan

The goal of this corpus is to provide data for music/speech discrimination, speech/nonspeech detection, and voice activity detection. The corpus is…

CC-BY-4.0

1.0.0

artist, background_noise, composer, gender, genre, language, vocals

vadtoolkit

VAD Toolkit: A Database for Voice Activity Detection At each environment, conversational speech by two Korean male speakers was recorded. The groun…

GPLv3

1.1.0

noise