data-public¶

Datasets available with audb as of Nov 04, 2024 in the repository data-public. For each dataset, the latest version is shown.

name	description	license	version	schemes
air	The Aachen Impulse Response (AIR) database is a set of impulse responses that were measured in a wide variety of rooms. The initial aim of the AIR …	MIT	1.4.2	azimuth, distance, mode, reverberation-time, room
cough-speech-sneeze	Cough-speech-sneeze: a data set of human sounds This dataset was collected by Dr. Shahin Amiriparian. It contains samples of human speech, coughing…	CC-BY-4.0	2.0.1	category
crema-d	CREMA-D: Crowd-sourced Emotional Mutimodal Actors Dataset CREMA-D is a data set of 7,442 original clips from 91 actors. These clips were from 48 m…	Open Data Commons Open Database License (ODbL) v1.0	1.3.0	emotion: [anger, disgust, fear, happiness, neutral, no_agreement, sadness], speaker: [age, sex, race, ethnicity], corrupted, emotion.agreement, emotion.intensity, emotion.level, sentence, votes
emodb	Berlin Database of Emotional Speech. A German database of emotional utterances spoken by actors recorded as a part of the DFG funded research proje…	CC0-1.0	1.4.1	emotion: [anger, boredom, disgust, fear, happiness, sadness, neutral], speaker: [age, gender, language], age, confidence, gender, language, transcription
micirp	The Microphone Impulse Response Project (MicIRP) contains impulse response data for vintage microphones. The impulse response files were created us…	CC-BY-SA-4.0	1.0.0	manufacturer
musan	The goal of this corpus is to provide data for music/speech discrimination, speech/nonspeech detection, and voice activity detection. The corpus is…	CC-BY-4.0	1.0.0	artist, background_noise, composer, gender, genre, language, vocals
vadtoolkit	VAD Toolkit: A Database for Voice Activity Detection At each environment, conversational speech by two Korean male speakers was recorded. The groun…	GPLv3	1.1.0	noise