wham¶
Created by Gordon Wichern, Joe Antognini, Michael Flynn, Licheng Richard Zhu, Emmett McQuinn, Dwight Crow, Ethan Manilow, Jonathan Le Roux
version |
1.0.0 |
license |
|
usage |
research |
languages |
|
format |
wav |
channel |
2 |
sampling rate |
16000 |
bit depth |
32 |
duration |
3 days 09:41:18.774249999 |
files |
28000, duration distribution: 3.4 s |
repository |
audb-public |
Description¶
The noise audio was collected at various urban locations throughout the San Francisco Bay Area in late 2018. The environments primarily consist of restaurants, cafes, bars, and parks. Audio was recorded using an Apogee Sennheiser binaural microphone on a tripod between 1.0 and 1.5 meters off the ground. The noise dataset has been processed to remove any segments containing intelligible speech. The average clip duration is 10.5 seconds with the shortest clip being 3.4 seconds and the longest 47.7 seconds.
Tables¶
Click on a row to toggle a preview.
ID |
Type |
Columns |
---|---|---|
dev |
filewise |
noise-band, file-id, l-to-r-width, reverberation, location, day |
test |
filewise |
noise-band, file-id, l-to-r-width, reverberation, location, day |
train |
filewise |
noise-band, file-id, l-to-r-width, reverberation, location, day |
Schemes¶
ID |
Dtype |
Labels |
---|---|---|
day |
str |
day1, day2, day3, day4 |
file-id |
str |
|
l-to-r-width |
float |
0.15, 0.16, 0.17 |
location |
str |
loc00, loc01, loc02, loc03, loc04, loc05, loc06, […], loc36, loc37, loc38, loc39, loc40, loc41, loc42, loc43 |
noise-band |
int |
0, 1, 2, 3 |
reverberation |
str |
high, low, medium |