Robustness low quality phone¶
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
---|---|---|---|---|
Overall Score |
25.0% passed tests (1 passed / 3 failed). |
100.0% passed tests (4 passed / 0 failed). |
25.0% passed tests (1 passed / 3 failed). |
0.0% passed tests (0 passed / 4 failed). |
Change Ccc Low Quality Phone¶
Data |
Change CCC Low Quality Phone |
|||
---|---|---|---|---|
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
-0.00 |
-0.03 |
-0.07 |
-0.05 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
-0.08 |
-0.03 |
-0.05 |
-0.07 |
mean |
-0.04 |
-0.03 |
-0.06 |
-0.06 |
Percentage Unchanged Predictions Low Quality Phone¶
Data |
Percentage Unchanged Predictions Low Quality Phone |
|||
---|---|---|---|---|
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.29 |
0.58 |
0.50 |
0.37 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.33 |
0.59 |
0.54 |
0.39 |
mean |
0.31 |
0.58 |
0.52 |
0.38 |
Visualization¶
Difference of predictions for original audio and low quality phone audio. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.
CNN14 |
w2v2-b |
hubert-b |
axlstm |
---|---|---|---|