Robustness spectral tilt

Overall scores

CNN14

w2v2-b

hubert-b

axlstm

Overall Score

12.5% passed tests (1 passed / 7 failed).

75.0% passed tests (6 passed / 2 failed).

100.0% passed tests (8 passed / 0 failed).

62.5% passed tests (5 passed / 3 failed).

Change Ccc Downward Tilt

Threshold: -0.05

Data

Change CCC Downward Tilt

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

-0.06

-0.00

-0.00

-0.01

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

-0.06

-0.01

-0.00

-0.02

mean

-0.06

-0.01

0.00

-0.01

Change Ccc Upward Tilt

Threshold: -0.05

Data

Change CCC Upward Tilt

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.03

-0.01

-0.00

0.03

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

-0.08

-0.01

-0.01

0.01

mean

-0.03

-0.01

-0.01

0.02

Percentage Unchanged Predictions Downward Tilt

Threshold: 0.8

Data

Percentage Unchanged Predictions Downward Tilt

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.56

0.95

0.95

0.83

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.39

0.92

0.97

0.68

mean

0.48

0.94

0.96

0.76

Percentage Unchanged Predictions Upward Tilt

Threshold: 0.8

Data

Percentage Unchanged Predictions Upward Tilt

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.20

0.72

0.83

0.44

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.25

0.72

0.87

0.46

mean

0.23

0.72

0.85

0.45

Visualization Downward Tilt

Difference of predictions for original audio and audio with a downward spectral tilt. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

CNN14

w2v2-b

hubert-b

axlstm

../../../_images/visualization-downward-tilt_iemocap-2.3.0-emotion.dimensions.test.gold_standard22.png
../../../_images/visualization-downward-tilt_iemocap-2.3.0-emotion.dimensions.test.gold_standard23.png
../../../_images/visualization-downward-tilt_iemocap-2.3.0-emotion.dimensions.test.gold_standard24.png
../../../_images/visualization-downward-tilt_iemocap-2.3.0-emotion.dimensions.test.gold_standard25.png
../../../_images/visualization-downward-tilt_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard22.png
../../../_images/visualization-downward-tilt_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard23.png
../../../_images/visualization-downward-tilt_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard24.png
../../../_images/visualization-downward-tilt_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard25.png
../../../_images/visualization-upward-tilt_iemocap-2.3.0-emotion.dimensions.test.gold_standard22.png
../../../_images/visualization-upward-tilt_iemocap-2.3.0-emotion.dimensions.test.gold_standard23.png
../../../_images/visualization-upward-tilt_iemocap-2.3.0-emotion.dimensions.test.gold_standard24.png
../../../_images/visualization-upward-tilt_iemocap-2.3.0-emotion.dimensions.test.gold_standard25.png
../../../_images/visualization-upward-tilt_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard22.png
../../../_images/visualization-upward-tilt_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard23.png
../../../_images/visualization-upward-tilt_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard24.png
../../../_images/visualization-upward-tilt_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard25.png