vadtoolkit¶
Created by Kim Jaeseok
version |
1.1.0 |
license |
|
source |
|
usage |
commercial |
languages |
kor |
format |
wav |
channel |
1 |
sampling rate |
16000 |
bit depth |
16, 32 |
duration |
0 days 02:00:09.703062500 |
files |
4, duration distribution: 1801.5 s 1804.0 s |
segments |
588, duration distribution: 0.0 s 51.1 s |
repository |
|
published |
2024-01-02 by audeering |
Description¶
VAD Toolkit: A Database for Voice Activity Detection At each environment, conversational speech by two Korean male speakers was recorded. The ground truth labels are manually annotated. Because the recording was carried out in the real world, unexpected noises are included to the dataset such as the crying of baby, the chirping of insects, mouse click sound, and etc..
Tables¶
ID |
Type |
Columns |
---|---|---|
segments |
segmented |
noise |
Schemes¶
ID |
Dtype |
Labels |
Mappings |
---|---|---|---|
noise |
int |
0, 1, 2, 3 |
✓ |