Robustness low quality phone¶
w2v2-L |
hubert-L |
wavlm |
data2vec |
|
---|---|---|---|---|
Overall Score |
75.0% passed tests (3 passed / 1 failed). |
50.0% passed tests (2 passed / 2 failed). |
75.0% passed tests (3 passed / 1 failed). |
50.0% passed tests (2 passed / 2 failed). |
Change Ccc Low Quality Phone¶
Data |
Change CCC Low Quality Phone |
|||
---|---|---|---|---|
w2v2-L |
hubert-L |
wavlm |
data2vec |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
-0.02 |
-0.05 |
-0.05 |
-0.05 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
-0.05 |
-0.05 |
0.01 |
-0.05 |
mean |
-0.04 |
-0.05 |
-0.02 |
-0.05 |
Percentage Unchanged Predictions Low Quality Phone¶
Data |
Percentage Unchanged Predictions Low Quality Phone |
|||
---|---|---|---|---|
w2v2-L |
hubert-L |
wavlm |
data2vec |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.64 |
0.62 |
0.58 |
0.64 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.53 |
0.68 |
0.68 |
0.65 |
mean |
0.58 |
0.65 |
0.63 |
0.65 |
Visualization¶
Difference of predictions for original audio and low quality phone audio. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.
w2v2-L |
hubert-L |
wavlm |
data2vec |
---|---|---|---|