fsdnoisy18k¶
Created by Eduardo Fonseca, Mercedes Collado, Manoj Plakal, Daniel P. W. Ellis, Frederic Font, Xavier Favory, and Xavier Serra
version |
1.0.0 |
license |
|
usage |
unrestricted |
languages |
|
format |
wav |
channel |
1 |
sampling rate |
44100 |
bit depth |
16 |
duration |
1 days 18:29:50.962131518 |
files |
18532, duration distribution: 0.3 s |
repository |
audb-public |
Description¶
FSDnoisy18k is an audio dataset collected with the aim of fostering the investigation of label noise in sound event classification. It contains 42.5 hours of audio across 20 sound classes, including a small amount of manually-labeled data and a larger quantity of real-world noisy data. FSDnoisy18k contains 18,532 audio clips (42.5h) unequally distributed in the 20 classes drawn from the AudioSet Ontology. The audio clips are of variable length ranging from 300ms to 30s, and each clip has a single ground truth label (singly-labeled data). Files are released under either CC-BY or CC0. The individual license for each clip is available from the files table. FSDnoisy18k is an expandable dataset that features a per-class varying degree of types and amount of label noise. The dataset allows investigation of label noise as well as other approaches, from semi-supervised learning, e.g., self-training to learning with minimal supervision.
Tables¶
Click on a row to toggle a preview.
ID |
Type |
Columns |
---|---|---|
files |
filewise |
license |
test |
filewise |
label |
train |
filewise |
label, manually_verified, noisy_small |
Schemes¶
ID |
Dtype |
Labels |
Mappings |
---|---|---|---|
categories |
str |
Acoustic guitar, Bass guitar, Clapping, Coin (dropping), Crash cymbal, Dishes, pots, and pans, Engine, […], Piano, Rain, Slam, Squeak, Tearing, Walk, footsteps, Wind, Writing |
child_ids, citation_uri, description, id, positive_examples, restrictions |
license |
str |
CC-BY-3.0, CC0-1.0 |
✓ |
manually_verified |
bool |
||
noisy_small |
bool |