Robustness low quality phone

Overall scores

CNN14

w2v2-b

hubert-b

axlstm

Overall Score

25.0% passed tests (1 passed / 3 failed).

100.0% passed tests (4 passed / 0 failed).

25.0% passed tests (1 passed / 3 failed).

0.0% passed tests (0 passed / 4 failed).

Change Ccc Low Quality Phone

Threshold: -0.05

Data

Change CCC Low Quality Phone

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

-0.00

-0.03

-0.07

-0.05

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

-0.08

-0.03

-0.05

-0.07

mean

-0.04

-0.03

-0.06

-0.06

Percentage Unchanged Predictions Low Quality Phone

Threshold: 0.5

Data

Percentage Unchanged Predictions Low Quality Phone

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.29

0.58

0.50

0.37

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.33

0.59

0.54

0.39

mean

0.31

0.58

0.52

0.38

Visualization

Difference of predictions for original audio and low quality phone audio. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

CNN14

w2v2-b

hubert-b

axlstm

../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard8.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard9.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard10.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard11.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard16.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard17.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard18.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard19.png