Robustness low quality phone¶
w2v2-L |
hubert-L |
wavlm |
data2vec |
|
---|---|---|---|---|
Overall Score |
25.0% passed tests (1 passed / 3 failed). |
75.0% passed tests (3 passed / 1 failed). |
100.0% passed tests (4 passed / 0 failed). |
75.0% passed tests (3 passed / 1 failed). |
Change Ccc Low Quality Phone¶
Data |
Change CCC Low Quality Phone |
|||
---|---|---|---|---|
w2v2-L |
hubert-L |
wavlm |
data2vec |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
-0.06 |
-0.05 |
-0.01 |
-0.02 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
-0.03 |
-0.01 |
-0.01 |
-0.01 |
mean |
-0.04 |
-0.03 |
-0.01 |
-0.01 |
Percentage Unchanged Predictions Low Quality Phone¶
Data |
Percentage Unchanged Predictions Low Quality Phone |
|||
---|---|---|---|---|
w2v2-L |
hubert-L |
wavlm |
data2vec |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.39 |
0.75 |
0.82 |
0.52 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.39 |
0.74 |
0.76 |
0.42 |
mean |
0.39 |
0.74 |
0.79 |
0.47 |
Visualization¶
Difference of predictions for original audio and low quality phone audio. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.
w2v2-L |
hubert-L |
wavlm |
data2vec |
---|---|---|---|